STACKQUADRANT

gpustack/gpustack

Inference Engines

Performance-optimized AI inference on your GPUs. Unlock superior throughput by selecting and tuning engines like vLLM or SGLang.

7.0
GitHub Metrics
Stars
5.2k
Forks
556
Open Issues
615
Watchers
41
Contributors
42
Weekly Commits
0
Language
Python
License
Apache-2.0
Last Commit
Jun 28, 2026
Created
May 11, 2024
Latest Release
v2.2.0
Release Date
Jun 18, 2026
Synced: Jun 29, 2026
Quality Scores
Documentation Qualityw: 20%
7.3

Has docs site (https://gpustack.ai). Description: 128 chars. Stars signal: 5,232. Contributors: 42. Score: 7.3/10

Community Healthw: 20%
6.2

Stars: 5,232. Contributors: 42. Watchers: 41. Forks: 556. Issue ratio: 11.8%. Score: 6.2/10

Maintenance Velocityw: 15%
7.6

Last commit: 1d ago. Weekly commits: 0. Latest release: v2.2.0. Maturity bonus: 2.1y old. Score: 7.6/10

API Design & DXw: 20%
6.4

Stars/issues ratio: 9. Dynamic language: Python. Has documentation site. Permissive license: Apache-2.0. Popularity signal: 5,232 stars. Score: 6.4/10

Production Readinessw: 15%
7.3

Battle-tested: 5,232 stars. Peer review: 42 contributors. Versioned: v2.2.0. Licensed: Apache-2.0. Age: 2.1 years. Maintenance: last commit 1d ago. Score: 7.3/10

Ecosystem Integrationw: 10%
7.9

Fork interest: 556. Major ecosystem: Python. Integration-friendly: Apache-2.0. Adoption: 5,232 stars. Has web presence. Score: 7.9/10

Tags
ascendcudadeepseekdistributed-inferencegenaihigh-performance-inferenceinferencellamallmllm-inference
Radar
Documentation Quality
Community Health
Maintenance Velocity
API Design & DX
Production Readiness
Ecosystem Integration