omlx
jundot/omlx
7.8
Model Serving
★ 17.2k◇ 1.5kPythonApache-2.0today
TensorRT-LLM
NVIDIA/TensorRT-LLM
7.1
Model Serving
★ 14.0k◇ 2.5kPythonNOASSERTIONtoday
vllm-omni
vllm-project/vllm-omni
7.5
Model Serving
★ 5.3k◇ 1.2kPythonApache-2.0today
Olares
beclab/Olares
7.0
Model Serving
★ 5.0k◇ 302GoAGPL-3.01d ago
Deep-Learning-in-Production
ahkarami/Deep-Learning-in-Production
4.5
Model Serving
★ 4.4k◇ 6851y ago
AI-Infra-from-Zero-to-Hero
HuaizhengZhang/AI-Infra-from-Zero-to-Hero
6.2
Model Serving
★ 4.1k◇ 401MIT11mo ago
LightLLM
ModelTC/LightLLM
6.5
Model Serving
★ 4.1k◇ 335PythonApache-2.02d ago
chitu
thu-pacman/chitu
6.8
Model Serving
★ 3.1k◇ 266PythonApache-2.01d ago
ramalama
containers/ramalama
7.5
Model Serving
★ 2.9k◇ 344PythonMIT1d ago
inference
roboflow/inference
7.0
Model Serving
★ 2.3k◇ 277PythonNOASSERTION2d ago
vllm-ascend
vllm-project/vllm-ascend
7.2
Model Serving
★ 2.3k◇ 1.5kC++Apache-2.0today
envd
tensorchord/envd
6.8
Model Serving
★ 2.2k◇ 169GoApache-2.01mo ago
sie
superlinked/sie
6.6
Model Serving
★ 2.1k◇ 183PythonApache-2.01d ago
aici
microsoft/aici
4.9
Model Serving
★ 2.1k◇ 84RustMIT1y ago
mlrun
mlrun/mlrun
7.2
Model Serving
★ 1.7k◇ 308PythonApache-2.0today
kitops
kitops-ml/kitops
7.0
Model Serving
★ 1.4k◇ 176GoApache-2.02d ago
hopsworks
logicalclocks/hopsworks
5.8
Model Serving
★ 1.3k◇ 158JavaAGPL-3.01y ago
rtp-llm
alibaba/rtp-llm
6.0
Model Serving
★ 1.2k◇ 219CudaApache-2.0today
truss
basetenlabs/truss
6.8
Model Serving
★ 1.2k◇ 109PythonMIT2d ago
Nanoflow
efeslab/Nanoflow
4.7
Model Serving
★ 965◇ 50Jupyter Notebook3mo ago
mosec
mosecorg/mosec
6.5
Model Serving
★ 902◇ 73PythonApache-2.03d ago
model_server
openvinotoolkit/model_server
6.5
Model Serving
★ 892◇ 260C++Apache-2.02d ago
pipeless
pipeless-ai/pipeless
4.9
Model Serving
★ 849◇ 52RustApache-2.02y ago
Yatai
bentoml/Yatai
6.1
Model Serving
★ 844◇ 76TypeScriptNOASSERTION29d ago
ServerlessLLM
ServerlessLLM/ServerlessLLM
5.8
Model Serving
★ 687◇ 74PythonApache-2.01mo ago
timber
kossisoroyce/timber
5.4
Model Serving
★ 685◇ 23PythonNOASSERTION2mo ago
fastapi-ml-skeleton
eightBEC/fastapi-ml-skeleton
4.5
Model Serving
★ 604◇ 91PythonApache-2.05mo ago
pinferencia
underneathall/pinferencia
4.7
Model Serving
★ 543◇ 83PythonApache-2.03y ago
ome
ome-projects/ome
6.1
Model Serving
★ 472◇ 83GoApache-2.01d ago
JetStream
AI-Hypercomputer/JetStream
4.8
Model Serving
★ 447◇ 66PythonApache-2.05mo ago
xFasterTransformer
intel/xFasterTransformer
4.3
Model Serving
★ 436◇ 75C++Apache-2.09mo ago
gpu-rest-engine
NVIDIA/gpu-rest-engine
3.7
Model Serving
★ 422◇ 93C++BSD-3-Clause7y ago
stable-diffusion-deploy
Lightning-Universe/stable-diffusion-deploy
4.6
Model Serving
★ 391◇ 39PythonApache-2.02y ago
TurboOCR
aiptimizer/TurboOCR
5.1
Model Serving
★ 305◇ 37C++MITtoday
pmetal
Epistates/pmetal
5.0
Model Serving
★ 300◇ 21RustNOASSERTION23d ago
podman-desktop-extension-ai-lab
containers/podman-desktop-extension-ai-lab
5.9
Model Serving
★ 291◇ 82TypeScriptApache-2.06d ago
BMW-YOLOv4-Inference-API-GPU
BMW-InnovationLab/BMW-YOLOv4-Inference-API-GPU
4.1
Model Serving
★ 277◇ 67PythonBSD-3-Clause4y ago
llm-server
raketenkater/llm-server
4.8
Model Serving
★ 237◇ 12GoMIT3d ago
ggrun
raketenkater/ggrun
4.8
Model Serving
★ 237◇ 12GoMIT3d ago
BMW-YOLOv4-Inference-API-CPU
BMW-InnovationLab/BMW-YOLOv4-Inference-API-CPU
3.9
Model Serving
★ 218◇ 58PythonNOASSERTION4y ago