STACKQUADRANT

Aquila2

FlagAI-Open/Aquila2
3.6

The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.

Fine-tuning Tools
44531Python1y ago

xFasterTransformer

intel/xFasterTransformer
4.5

xFasterTransformer — open-source AI/LLM project.

Model Serving
43674C++Apache-2.07mo ago

JetStream

AI-Hypercomputer/JetStream
5.0

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Model Serving
42663PythonApache-2.03mo ago

gpu-rest-engine

NVIDIA/gpu-rest-engine
3.9

A REST API for Caffe using Docker and Go

Model Serving
42393C++BSD-3-Clause7y ago

InternEvo

InternLM/InternEvo
5.0

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Fine-tuning Tools
42068PythonApache-2.07mo ago

awsome-distributed-training

awslabs/awsome-distributed-training
5.7

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

Fine-tuning Tools
407182ShellMIT-01d ago

Awesome-LLM-Prompt-Optimization

jxzhangjhu/Awesome-LLM-Prompt-Optimization
3.0

Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models

Prompt Engineering
406212y ago

tiger

tigerlab-ai/tiger
4.3

Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)

Fine-tuning Tools
40127Jupyter NotebookApache-2.02y ago

awesome-azure-openai-llm

kimtth/awesome-azure-openai-llm
4.3

A curated collection of resources for 🌌 Azure OpenAI, 🦙 LLMs (RAG, Agents).

Agent Frameworks
39952Python1d ago

stable-diffusion-deploy

Lightning-Universe/stable-diffusion-deploy
4.7

Learn to serve Stable Diffusion models on cloud infrastructure at scale. This Lightning App shows load-balancing, orchestrating, pre-provisioning, dynamic batching, GPU-inference, micro-services working together via the Lightning Apps framework.

Model Serving
39139PythonApache-2.02y ago

llm-leaderboard

JonathanChavezTamales/llm-leaderboard
4.8

A comprehensive set of LLM benchmark scores and provider prices. (deprecated, read more in README)

Evaluation & Testing
36140JavaScriptNOASSERTION5mo ago

APOLLO

zhuhanqing/APOLLO
4.4

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

Fine-tuning Tools
34418PythonNOASSERTION4mo ago

alphora

opencmit/alphora
5.4

A Production-Ready Framework for Building Composable AI Agents

Agent Frameworks
34433PythonApache-2.026d ago

palico-ai

palico-ai/palico-ai
4.5

Build, Improve Performance, and Productionize your LLM Application with an Integrated Framework

Evaluation & Testing
34228TypeScriptMIT1y ago

Awesome-MLSys-Blogger

MLSys-Learner-Resources/Awesome-MLSys-Blogger
3.6

The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)

Fine-tuning Tools
3359HTML1y ago

ReaLHF

openpsi-project/ReaLHF
3.7

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Fine-tuning Tools
33522PythonApache-2.011mo ago

swiftLLM

interestingLSY/swiftLLM
3.9

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

Inference Engines
32337PythonApache-2.010mo ago

vibe-log-cli

vibe-log/vibe-log-cli
4.9

A CLI tool for logging and analyzing Claude Code and Cursor ai-driven coding session.

Prompt Engineering
31519TypeScriptMIT4mo ago

rhesis

rhesis-ai/rhesis
5.4

The testing platform for AI teams. Bring engineers, PMs, and domain experts together to generate tests, simulate (adversarial) conversations, and trace every failure to its root cause.

Evaluation & Testing
31224PythonNOASSERTIONtoday

llms-tools

PetroIvaniuk/llms-tools
4.7

A list of LLMs Tools & Projects

Evaluation & Testing
30640Apache-2.01mo ago

promptimal

shobrook/promptimal
3.5

A very fast, very minimal prompt optimizer

Prompt Engineering
30115PythonMIT1y ago

athina-evals

athina-ai/athina-evals
4.1

Python SDK for running evaluations on LLM generated responses

Evaluation & Testing
29921Python10mo ago

podman-desktop-extension-ai-lab

containers/podman-desktop-extension-ai-lab
5.9

Work with LLMs on a local environment using containers

Model Serving
29180TypeScriptApache-2.0today

SiLLM

armbues/SiLLM
3.8

SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.

Fine-tuning Tools
28526PythonMIT10mo ago

BMW-YOLOv4-Inference-API-GPU

BMW-InnovationLab/BMW-YOLOv4-Inference-API-GPU
4.1

This is a repository for an nocode object detection inference API using the Yolov3 and Yolov4 Darknet framework.

Model Serving
27867PythonBSD-3-Clause3y ago

Nano

bd4sur/Nano
4.3

电子鹦鹉 / Toy Language Model

Fine-tuning Tools
27513Ctoday

openclaw-optimization-guide

OnlyTerp/openclaw-optimization-guide
5.0

Make your OpenClaw AI agent faster, smarter, and cheaper. Speed optimization, memory architecture, context management, model selection, and one-shot development guide.

Prompt Engineering
24029TypeScriptMIT9d ago

ContribAI

tang-vu/ContribAI
5.5

Autonomous AI agent that contributes to open source — discovers repos, analyzes code, generates fixes, and submits PRs

Agent Frameworks
23386RustNOASSERTION2d ago

npi

sheet0/npi
3.8

Action library for AI Agent

Agent Frameworks
22811PythonApache-2.01y ago

BMW-YOLOv4-Inference-API-CPU

BMW-InnovationLab/BMW-YOLOv4-Inference-API-CPU
3.9

This is a repository for an nocode object detection inference API using the Yolov4 and Yolov3 Opencv.

Model Serving
21858PythonNOASSERTION3y ago

pocketgroq

jgravelle/pocketgroq
3.8

PocketGroq is a powerful Python library that simplifies integration with the Groq API, offering advanced features for natural language processing, web scraping, and autonomous agent capabilities. Key Features Seamless integration with Groq API for text generation and completion Chain of Thought (CoT) reasoning for complex problem-solving and more.

Agent Frameworks
21759Python1y ago

prompt-optimizer-studio

XBigRoad/prompt-optimizer-studio
4.7

可同时输入多个初版提示词,开启多轮自动优化,连续得到高分后得到最终结果。

Prompt Engineering
20018TypeScriptAGPL-3.012d ago

flutter-skill

ai-dashboad/flutter-skill
5.1

AI-powered E2E testing for 10 platforms. 253 MCP tools. Zero config. Works with Claude, Cursor, Windsurf, Copilot. Test Flutter, React Native, iOS, Android, Web, Electron, Tauri, KMP, .NET MAUI — all from natural language.

Evaluation & Testing
19624DartMIT1d ago

promptolution

automl/promptolution
4.8

A unified, modular Framework for Prompt Optimization

Prompt Engineering
1269PythonApache-2.01mo ago

qaskills

PramodDutta/qaskills
4.0

QA Skills Directory QA Skills is a curated directory of testing-specific skills for AI coding agents (Claude Code, Cursor, Copilot, etc.).

Evaluation & Testing
1044TypeScript2d ago

SkillFoundry

samibs/skillfoundry
3.4

SkillFoundry — a leading open-source project in the AI/LLM ecosystem.

AI DevOps
6TypeScriptMITtoday

StackQuadrant

samibs/StackQuadrant
3.1

StackQuadrant — a leading open-source project in the AI/LLM ecosystem.

AI DevOps
1TypeScript23d ago
← prev10 / 10