STACKQUADRANT

gpustack/gpustack

Inference Engines

Performance-optimized AI inference on your GPUs. Unlock superior throughput by selecting and tuning engines like vLLM or SGLang.

7.0
GitHub Metrics
Stars
4.9k
Forks
499
Open Issues
543
Watchers
38
Contributors
41
Weekly Commits
0
Language
Python
License
Apache-2.0
Last Commit
Apr 16, 2026
Created
May 11, 2024
Latest Release
v2.1.1
Release Date
Mar 26, 2026
Synced: Apr 16, 2026
Quality Scores
Documentation Qualityw: 20%
7.3

Has docs site (https://gpustack.ai). Description: 128 chars. Stars signal: 4,851. Contributors: 41. Score: 7.3/10

Community Healthw: 20%
6.1

Stars: 4,851. Contributors: 41. Watchers: 38. Forks: 499. Issue ratio: 11.2%. Score: 6.1/10

Maintenance Velocityw: 15%
7.6

Last commit: 0d ago. Weekly commits: 0. Latest release: v2.1.1. Maturity bonus: 1.9y old. Score: 7.6/10

API Design & DXw: 20%
6.4

Stars/issues ratio: 9. Dynamic language: Python. Has documentation site. Permissive license: Apache-2.0. Popularity signal: 4,851 stars. Score: 6.4/10

Production Readinessw: 15%
7.1

Battle-tested: 4,851 stars. Peer review: 41 contributors. Versioned: v2.1.1. Licensed: Apache-2.0. Age: 1.9 years. Maintenance: last commit 0d ago. Score: 7.1/10

Ecosystem Integrationw: 10%
7.9

Fork interest: 499. Major ecosystem: Python. Integration-friendly: Apache-2.0. Adoption: 4,851 stars. Has web presence. Score: 7.9/10

Tags
ascendcudadeepseekdistributed-inferencegenaihigh-performance-inferenceinferencellamallmllm-inference
Radar
Documentation Quality
Community Health
Maintenance Velocity
API Design & DX
Production Readiness
Ecosystem Integration