STACKQUADRANT

veScale

volcengine/veScale
5.5

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Fine-tuning Tools
1.0k61PythonApache-2.01mo ago

cashclaw

moltlaunch/cashclaw
5.6

An autonomous agent that takes work, does work, gets paid, and gets better at it.

Agent Frameworks
966204TypeScriptMIT1mo ago

start-llms

louisfb01/start-llms
5.5

A complete guide to start and improve your LLM skills in 2026 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

Fine-tuning Tools
961124MIT2mo ago

Nanoflow

efeslab/Nanoflow
4.9

A throughput-oriented high-performance serving framework for LLMs

Model Serving
95248Jupyter Notebook18d ago

mcp-framework

QuantGeekDev/mcp-framework
6.5

The Typescript MCP Framework

LLM Frameworks
908105TypeScriptMITtoday

ZhiLight

zhihu/ZhiLight
5.5

A highly optimized LLM inference acceleration engine for Llama and its variants.

Inference Engines
904102C++Apache-2.029d ago

mosec

mosecorg/mosec
6.5

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

Model Serving
89872PythonApache-2.01d ago

kvcached

ovg-project/kvcached
5.6

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Inference Engines
858100PythonApache-2.09d ago

model_server

openvinotoolkit/model_server
6.5

A scalable inference server for models optimized with OpenVINO™

Model Serving
857248C++Apache-2.0today

cerebellum

theredsix/cerebellum
5.0

Browser automation system that uses AI-driven planning to navigate web pages and perform goals.

Fine-tuning Tools
85657PythonMIT1mo ago

pipeless

pipeless-ai/pipeless
4.9

An open-source computer vision framework to build and deploy apps in minutes

Model Serving
85052RustApache-2.01y ago

Yatai

bentoml/Yatai
5.3

Model Deployment at Scale on Kubernetes 🦄️

Model Serving
83977TypeScriptNOASSERTION1y ago

scenario

langwatch/scenario
5.9

Agentic testing for agentic codebases

Evaluation & Testing
83758TypeScriptMITtoday

Adan

sail-sg/Adan
4.5

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Fine-tuning Tools
81470PythonApache-2.010mo ago

DeepMCPAgent

cryxnet/DeepMCPAgent
5.4

Model-agnostic plug-n-play LangChain/LangGraph agents powered entirely by MCP tools over HTTP/SSE.

Agent Frameworks
812127PythonApache-2.06mo ago

skales

skalesapp/skales
5.9

Free AI Desktop Agent for Windows, macOS & Linux - Automate email, calendar, browser, code generation. 13+ AI providers, Ollama, Telegram remote control. No Docker, no terminal. 🦎

Agent Frameworks
806137TypeScriptNOASSERTION3d ago

nobodywho

nobodywho-ooo/nobodywho
6.2

NobodyWho is an inference engine that lets you run LLMs locally and efficiently on any device.

Inference Engines
79455RustEUPL-1.2today

RAG-FiT

IntelLabs/RAG-FiT
5.4

Framework for enhancing LLMs for RAG tasks using fine-tuning.

Fine-tuning Tools
76961PythonApache-2.04mo ago

blades

go-kratos/blades
6.3

Blades is a Go-based multimodal AI Agent framework.

Agent Frameworks
75792GoMIT12d ago

awesome-ai-apps

rohitg00/awesome-ai-apps
5.2

A curated collection of awesome AI Agents and LLM Apps built with multiple tech stacks, showcasing real-world implementations using OpenAI, Gemini, local models, and various AI frameworks.

LLM Frameworks
753155HTMLApache-2.02mo ago

Trace

microsoft/Trace
5.7

End-to-end Generative Optimization for AI Agents

Prompt Engineering
73058PythonMIT4mo ago

LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing

ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
5.3

LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.

Fine-tuning Tools
727121Jupyter NotebookMIT1mo ago

pydantic-deepagents

vstorm-co/pydantic-deepagents
6.3

Python Deep Agent framework built on top of Pydantic-AI, designed to help you quickly build production-grade autonomous AI agents with planning, filesystem operations, subagent delegation, skills, and structured outputs—in just 10 lines of code.

Agent Frameworks
68074PythonMITtoday

ServerlessLLM

ServerlessLLM/ServerlessLLM
5.9

Serverless LLM Serving for Everyone.

Model Serving
67470PythonApache-2.01mo ago

AI-Compass

tingaicompass/AI-Compass
5.3

“AI-Compass”将为社区指引在 AI 技术海洋中航行的方向,无论你是初学者还是进阶开发者,都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI 的核心概念、主流技术、前沿趋势,并通过实践掌握从理论到落地的全过程。

Fine-tuning Tools
67485Pythontoday

timber

kossisoroyce/timber
5.6

Ollama for classical ML models. AOT compiler that turns XGBoost, LightGBM, scikit-learn, CatBoost & ONNX models into native C99 inference code. One command to load, one command to serve. 336x faster than Python inference.

Model Serving
66720PythonNOASSERTIONtoday

long-context-attention

feifeibear/long-context-attention
5.7

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Fine-tuning Tools
66479PythonApache-2.03mo ago

gollm

teilomillet/gollm
6.2

Unified Go interface for Language Model (LLM) providers. Simplifies LLM integration with flexible prompt management and common task functions.

Prompt Engineering
65163GoApache-2.026d ago

Awesome-LLM-Eval

onejune2018/Awesome-LLM-Eval
5.0

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

Evaluation & Testing
63158MIT4mo ago

Awesome-LLM-in-Social-Science

ValueByte-AI/Awesome-LLM-in-Social-Science
5.0

Awesome papers involving LLMs in Social Science.

Evaluation & Testing
61046MIT1mo ago

daydreams

daydreamsai/daydreams
6.2

Daydreams is a set of tools for building agents for commerce

Agent Frameworks
604131TypeScriptMIT1mo ago

fastapi-ml-skeleton

eightBEC/fastapi-ml-skeleton
4.7

FastAPI Skeleton App to serve machine learning models production-ready.

Model Serving
60493PythonApache-2.03mo ago

LLMTornado

lofcz/LLMTornado
6.5

The .NET library to build AI agents with 30+ built-in connectors.

Agent Frameworks
596100C#MIT1d ago

ICLR2025-Papers-with-Code

yinizhilian/ICLR2025-Papers-with-Code
3.3

历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.

Fine-tuning Tools
571291y ago

yalm

andrewkchan/yalm
3.8

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

Inference Engines
57059C++7mo ago

LLM-FineTuning-Large-Language-Models

rohan-paul/LLM-FineTuning-Large-Language-Models
3.6

LLM (Large Language Model) FineTuning

Fine-tuning Tools
570137Jupyter Notebook1y ago

langtest

Pacific-AI-Corp/langtest
6.1

Deliver safe & effective language models

Evaluation & Testing
55549PythonApache-2.01d ago

langtest

PacificAI/langtest
6.1

Deliver safe & effective language models

Evaluation & Testing
55549PythonApache-2.01d ago

pinferencia

underneathall/pinferencia
4.7

Python + Inference - Model Deployment library in Python. Simplest model inference server ever.

Model Serving
54582PythonApache-2.03y ago

KuiperLLama

zjhellofss/KuiperLLama
4.1

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

Inference Engines
529137C++5mo ago

voice-ai

rapidaai/voice-ai
5.9

Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.

Agent Frameworks
51884GoNOASSERTIONtoday

continuous-eval

relari-ai/continuous-eval
4.7

Data-Driven Evaluation for LLM-Powered Applications

Evaluation & Testing
51638PythonApache-2.01y ago

mlxstudio

jjang-ai/mlxstudio
4.8

MLX Studio - Home of JANG_Q - Image Gen/Edit + Chat/Code All in one - + OpenClaw (Anthropic API)

Inference Engines
50333today

LLM-VM

anarchy-ai/LLM-VM
4.8

irresponsible innovation. Try now at https://chat.dev/

Fine-tuning Tools
491135PythonMIT1y ago

agency

operand/agency
5.0

A fast and minimal framework for building agentic systems

Agent Frameworks
48026PythonMIT8d ago

aimock

CopilotKit/aimock
5.8

Mock everything your AI app talks to — LLM APIs, MCP, A2A, vector DBs, search. One package, one port, zero dependencies.

Evaluation & Testing
47324TypeScriptMITtoday

Finetune_LLMs

mallorbc/Finetune_LLMs
3.8

Repo for fine-tuning Casual LLMs

Fine-tuning Tools
46086PythonAGPL-3.02y ago

agentsilex

howl-anderson/agentsilex
5.0

A transparent, minimal, and hackable agent framework. ~300 lines of readable code. Full control, no magic.

Agent Frameworks
44744PythonMIT3mo ago