llama.cpp
ggml-org/llama.cpp
8.0
Inference Engines
★ 104.0k◇ 16.9kC++MITtoday
gpt4all
nomic-ai/gpt4all
7.2
Inference Engines
★ 77.3k◇ 8.3kC++MIT10mo ago
vLLM
vllm-project/vllm
8.6
Inference Engines
★ 76.9k◇ 15.7kPythonApache-2.0today
ray
ray-project/ray
8.6
Inference Engines
★ 42.2k◇ 7.4kPythonApache-2.0today
gitleaks
gitleaks/gitleaks
8.2
Inference Engines
★ 26.0k◇ 2.0kGoMIT21d ago
llm-action
liguodongiot/llm-action
6.7
Inference Engines
★ 24.0k◇ 2.8kHTMLApache-2.01mo ago
litgpt
Lightning-AI/litgpt
7.9
Inference Engines
★ 13.3k◇ 1.4kPythonApache-2.0today
OpenLLM
bentoml/OpenLLM
7.3
Inference Engines
★ 12.3k◇ 804PythonApache-2.02d ago
mistral-inference
mistralai/mistral-inference
6.9
Inference Engines
★ 10.8k◇ 1.0kJupyter NotebookApache-2.01mo ago
openvino
openvinotoolkit/openvino
8.2
Inference Engines
★ 10.1k◇ 3.2kC++Apache-2.0today
PowerInfer
Tiiny-AI/PowerInfer
6.8
Inference Engines
★ 9.3k◇ 562C++MIT2mo ago
BentoML
bentoml/BentoML
8.0
Inference Engines
★ 8.6k◇ 950PythonApache-2.0today
lmdeploy
InternLM/lmdeploy
7.5
Inference Engines
★ 7.8k◇ 685PythonApache-2.0today
plano
katanemo/plano
7.4
Inference Engines
★ 6.3k◇ 400RustApache-2.0today
openevolve
algorithmicsuperintelligence/openevolve
6.8
Inference Engines
★ 6.0k◇ 949PythonApache-2.029d ago
flashinfer
flashinfer-ai/flashinfer
7.5
Inference Engines
★ 5.4k◇ 899PythonApache-2.0today
kserve
kserve/kserve
7.7
Inference Engines
★ 5.3k◇ 1.4kGoApache-2.0today
Awesome-LLM-Inference
xlite-dev/Awesome-LLM-Inference
6.6
Inference Engines
★ 5.1k◇ 361PythonGPL-3.07d ago
eko
FellouAI/eko
7.3
Inference Engines
★ 4.9k◇ 435TypeScriptMIT1mo ago
gpustack
gpustack/gpustack
7.0
Inference Engines
★ 4.9k◇ 499PythonApache-2.0today
shimmy
Michael-A-Kuykendall/shimmy
6.2
Inference Engines
★ 4.0k◇ 345RustApache-2.021d ago
ruvector
ruvnet/ruvector
6.7
Inference Engines
★ 3.8k◇ 465RustMIT1d ago
RuVector
ruvnet/RuVector
6.7
Inference Engines
★ 3.8k◇ 465RustMIT1d ago
lorax
predibase/lorax
6.1
Inference Engines
★ 3.8k◇ 312PythonApache-2.010mo ago
lemonade
lemonade-sdk/lemonade
7.0
Inference Engines
★ 3.6k◇ 264C++Apache-2.0today
optillm
algorithmicsuperintelligence/optillm
6.5
Inference Engines
★ 3.4k◇ 267PythonApache-2.028d ago
deepsparse
neuralmagic/deepsparse
6.1
Inference Engines
★ 3.2k◇ 191PythonNOASSERTION10mo ago
distributed-llama
b4rtaz/distributed-llama
6.3
Inference Engines
★ 2.9k◇ 225C++MIT1d ago
spiceai
spiceai/spiceai
6.9
Inference Engines
★ 2.9k◇ 185RustApache-2.0today
Medusa
FasterDecoding/Medusa
5.4
Inference Engines
★ 2.7k◇ 197Jupyter NotebookApache-2.01y ago
ZhiLight
zhihu/ZhiLight
5.5
Inference Engines
★ 904◇ 102C++Apache-2.029d ago
kvcached
ovg-project/kvcached
5.6
Inference Engines
★ 858◇ 100PythonApache-2.08d ago
nobodywho
nobodywho-ooo/nobodywho
6.2
Inference Engines
★ 793◇ 55RustEUPL-1.2today
yalm
andrewkchan/yalm
3.8
Inference Engines
★ 570◇ 59C++7mo ago
KuiperLLama
zjhellofss/KuiperLLama
4.1
Inference Engines
★ 527◇ 137C++5mo ago
mlxstudio
jjang-ai/mlxstudio
4.8
Inference Engines
★ 496◇ 32today
swiftLLM
interestingLSY/swiftLLM
3.9
Inference Engines
★ 323◇ 37PythonApache-2.010mo ago