llama.cpp
ggml-org/llama.cpp
8.2
Inference Engines
★ 118.5k◇ 20.0kC++MITtoday
vLLM
vllm-project/vllm
8.6
Inference Engines
★ 84.7k◇ 18.6kPythonApache-2.0today
gpt4all
nomic-ai/gpt4all
7.1
Inference Engines
★ 77.4k◇ 8.3kC++MIT1y ago
ray
ray-project/ray
8.4
Inference Engines
★ 43.0k◇ 7.7kPythonApache-2.01d ago
gitleaks
gitleaks/gitleaks
8.2
Inference Engines
★ 27.9k◇ 2.1kGoMIT4d ago
llm-action
liguodongiot/llm-action
6.8
Inference Engines
★ 24.6k◇ 2.8kHTMLApache-2.03d ago
litgpt
Lightning-AI/litgpt
7.8
Inference Engines
★ 13.4k◇ 1.5kPythonApache-2.02d ago
OpenLLM
bentoml/OpenLLM
7.4
Inference Engines
★ 12.4k◇ 819PythonApache-2.06d ago
mistral-inference
mistralai/mistral-inference
7.2
Inference Engines
★ 10.8k◇ 1.1kJupyter NotebookApache-2.012d ago
openvino
openvinotoolkit/openvino
7.9
Inference Engines
★ 10.4k◇ 3.3kC++Apache-2.02d ago
PowerInfer
Tiiny-AI/PowerInfer
6.9
Inference Engines
★ 9.6k◇ 586C++MIT1mo ago
BentoML
bentoml/BentoML
8.0
Inference Engines
★ 8.7k◇ 979PythonApache-2.06d ago
lmdeploy
InternLM/lmdeploy
7.5
Inference Engines
★ 7.9k◇ 700PythonApache-2.02d ago
openevolve
algorithmicsuperintelligence/openevolve
6.6
Inference Engines
★ 6.6k◇ 1.1kPythonApache-2.03mo ago
plano
katanemo/plano
7.4
Inference Engines
★ 6.6k◇ 433RustApache-2.03d ago
flashinfer
flashinfer-ai/flashinfer
7.6
Inference Engines
★ 5.9k◇ 1.1kPythonApache-2.0today
kserve
kserve/kserve
7.7
Inference Engines
★ 5.6k◇ 1.5kGoApache-2.03d ago
shimmy
Michael-A-Kuykendall/shimmy
6.3
Inference Engines
★ 5.5k◇ 530RustApache-2.011d ago
Awesome-LLM-Inference
xlite-dev/Awesome-LLM-Inference
6.6
Inference Engines
★ 5.4k◇ 417PythonGPL-3.05d ago
gpustack
gpustack/gpustack
7.0
Inference Engines
★ 5.2k◇ 556PythonApache-2.0today
eko
FellouAI/eko
7.0
Inference Engines
★ 4.9k◇ 439TypeScriptMIT3mo ago
lemonade
lemonade-sdk/lemonade
7.2
Inference Engines
★ 4.7k◇ 371C++Apache-2.0today
RuVector
ruvnet/RuVector
7.0
Inference Engines
★ 4.3k◇ 568RustMITtoday
ruvector
ruvnet/ruvector
7.0
Inference Engines
★ 4.3k◇ 568RustMITtoday
optillm
algorithmicsuperintelligence/optillm
6.5
Inference Engines
★ 4.2k◇ 368PythonApache-2.01mo ago
lorax
predibase/lorax
6.8
Inference Engines
★ 3.8k◇ 322PythonApache-2.01mo ago
deepsparse
neuralmagic/deepsparse
5.9
Inference Engines
★ 3.2k◇ 191PythonNOASSERTION1y ago
spiceai
spiceai/spiceai
7.1
Inference Engines
★ 3.0k◇ 207RustApache-2.0today
distributed-llama
b4rtaz/distributed-llama
6.1
Inference Engines
★ 3.0k◇ 237C++MIT2mo ago
Medusa
FasterDecoding/Medusa
5.4
Inference Engines
★ 2.8k◇ 202Jupyter NotebookApache-2.02y ago
kvcached
ovg-project/kvcached
5.8
Inference Engines
★ 1.1k◇ 120PythonApache-2.016d ago
nobodywho
nobodywho-ooo/nobodywho
6.4
Inference Engines
★ 1.0k◇ 70RustEUPL-1.21d ago
ZhiLight
zhihu/ZhiLight
5.3
Inference Engines
★ 906◇ 102C++Apache-2.03mo ago
mlxstudio
jjang-ai/mlxstudio
5.2
Inference Engines
★ 830◇ 575d ago
yalm
andrewkchan/yalm
3.7
Inference Engines
★ 590◇ 64C++9mo ago
KuiperLLama
zjhellofss/KuiperLLama
4.0
Inference Engines
★ 547◇ 143C++8mo ago
openinfer
openinfer-project/openinfer
5.8
Inference Engines
★ 481◇ 70RustApache-2.0today
tessera
zengxiao-he/tessera
4.3
Inference Engines
★ 386◇ 4PythonNOASSERTION23d ago
swiftLLM
interestingLSY/swiftLLM
3.8
Inference Engines
★ 329◇ 37PythonApache-2.01y ago