STACKQUADRANT

AI-Hypercomputer/JetStream

Model Serving

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

5.0
GitHub Metrics
Stars
425
Forks
63
Open Issues
27
Watchers
22
Contributors
38
Weekly Commits
0
Language
Python
License
Apache-2.0
Last Commit
Jan 5, 2026
Created
Mar 1, 2024
Latest Release
v0.3
Release Date
Dec 18, 2024
Synced: Apr 16, 2026
Quality Scores
Documentation Qualityw: 20%
4.7

No dedicated docs site. Description: 143 chars. Stars signal: 425. Contributors: 38. Score: 4.7/10

Community Healthw: 20%
5.1

Stars: 425. Contributors: 38. Watchers: 22. Forks: 63. Issue ratio: 6.4%. Score: 5.1/10

Maintenance Velocityw: 15%
4.1

Last commit: 101d ago. Weekly commits: 0. Latest release: v0.3. Maturity bonus: 2.1y old. Score: 4.1/10

API Design & DXw: 20%
5.8

Stars/issues ratio: 16. Dynamic language: Python. No dedicated API docs. Permissive license: Apache-2.0. Popularity signal: 425 stars. Score: 5.8/10

Production Readinessw: 15%
4.6

Battle-tested: 425 stars. Peer review: 38 contributors. Versioned: v0.3. Licensed: Apache-2.0. Age: 2.1 years. Maintenance: last commit 101d ago. Score: 4.6/10

Ecosystem Integrationw: 10%
6.0

Fork interest: 63. Major ecosystem: Python. Integration-friendly: Apache-2.0. Adoption: 425 stars. Score: 6/10

Tags
gemmagptgpuinferencejaxlarge-language-modelsllamallama2llmllm-inference
Radar
Documentation Quality
Community Health
Maintenance Velocity
API Design & DX
Production Readiness
Ecosystem Integration