AI-Hypercomputer/JetStream

Model Serving

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

GitHub →

4.8

GitHub Metrics

Stars

448

Forks

Open Issues

Watchers

Contributors

Weekly Commits

Language

Python

License

Apache-2.0

Last Commit

Jan 5, 2026

Created

Mar 1, 2024

Latest Release

v0.3

Release Date

Dec 18, 2024

Synced: Jun 29, 2026

Quality Scores

Documentation Qualityw: 20%

4.8

No dedicated docs site. Description: 143 chars. Stars signal: 448. Contributors: 38. Score: 4.8/10

Community Healthw: 20%

5.1

Stars: 448. Contributors: 38. Watchers: 22. Forks: 66. Issue ratio: 6.0%. Score: 5.1/10

Maintenance Velocityw: 15%

3.1

Last commit: 174d ago. Weekly commits: 0. Latest release: v0.3. Maturity bonus: 2.3y old. Score: 3.1/10

API Design & DXw: 20%

5.8

Stars/issues ratio: 17. Dynamic language: Python. No dedicated API docs. Permissive license: Apache-2.0. Popularity signal: 448 stars. Score: 5.8/10

Production Readinessw: 15%

4.0

Battle-tested: 448 stars. Peer review: 38 contributors. Versioned: v0.3. Licensed: Apache-2.0. Age: 2.3 years. Maintenance: last commit 174d ago. Score: 4/10

Ecosystem Integrationw: 10%

6.1

Fork interest: 66. Major ecosystem: Python. Integration-friendly: Apache-2.0. Adoption: 448 stars. Score: 6.1/10