STACKQUADRANT

gollm

teilomillet/gollm
5.9

Unified Go interface for Language Model (LLM) providers. Simplifies LLM integration with flexible prompt management and common task functions.

Prompt Engineering
67064GoApache-2.03mo ago

Awesome-LLM-Eval

onejune2018/Awesome-LLM-Eval
4.7

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

Evaluation & Testing
64776MIT7mo ago

aimock

CopilotKit/aimock
6.3

Mock everything your AI app talks to — LLM APIs, MCP, A2A, vector DBs, search. One package, one port, zero dependencies.

Evaluation & Testing
63744TypeScriptMIT1d ago

Awesome-LLM-in-Social-Science

ValueByte-AI/Awesome-LLM-in-Social-Science
5.1

Awesome papers involving LLMs in Social Science.

Evaluation & Testing
63349MIT20d ago

LLMTornado

lofcz/LLMTornado
6.5

The .NET library to build AI agents with 30+ built-in connectors.

Agent Frameworks
621106C#MIT1d ago

daydreams

daydreamsai/daydreams
5.9

Daydreams is a set of tools for building agents for commerce

Agent Frameworks
608133TypeScriptMIT3mo ago

fastapi-ml-skeleton

eightBEC/fastapi-ml-skeleton
4.5

FastAPI Skeleton App to serve machine learning models production-ready.

Model Serving
60491PythonApache-2.05mo ago

agent-skills-eval

darkrishabh/agent-skills-eval
5.3

A test runner for agentskills.io-style AI agent skills

Evaluation & Testing
60330TypeScriptMIT4d ago

yalm

andrewkchan/yalm
3.7

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

Inference Engines
59064C++9mo ago

ICLR2025-Papers-with-Code

yinizhilian/ICLR2025-Papers-with-Code
3.3

历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.

Fine-tuning Tools
587331y ago

LLM-FineTuning-Large-Language-Models

rohan-paul/LLM-FineTuning-Large-Language-Models
3.6

LLM (Large Language Model) FineTuning

Fine-tuning Tools
576140Jupyter Notebook1y ago

awesome-evals

benchflow-ai/awesome-evals
4.7

A curated, non-BS library of the best resources for building and evaluating AI agents — papers, blogs, talks, tools, benchmarks. Maintained by BenchFlow.

Evaluation & Testing
57642NOASSERTION1d ago

gitagent

open-gitagent/gitagent
5.8

A framework-agnostic, git-native standard for defining AI agents

Agent Frameworks
573113TypeScriptMITtoday

iFixAi

ifixai-ai/iFixAi
6.2

The open-source diagnostic for AI misalignment. 32 tests across fabrication, manipulation, deception, unpredictability, and opacity. Provider-agnostic. Runs against OpenAI, Anthropic, Bedrock, Azure, Gemini, and more. Letter grade in under 5 minutes, content-addressed manifest for bit-identical replay. Built by iMe.

Evaluation & Testing
572114PythonApache-2.0today

langtest

Pacific-AI-Corp/langtest
5.8

Deliver safe & effective language models

Evaluation & Testing
56250PythonApache-2.02mo ago

langtest

PacificAI/langtest
5.8

Deliver safe & effective language models

Evaluation & Testing
56250PythonApache-2.02mo ago

KuiperLLama

zjhellofss/KuiperLLama
4.0

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

Inference Engines
547143C++8mo ago

pinferencia

underneathall/pinferencia
4.7

Python + Inference - Model Deployment library in Python. Simplest model inference server ever.

Model Serving
54383PythonApache-2.03y ago

continuous-eval

relari-ai/continuous-eval
4.7

Data-Driven Evaluation for LLM-Powered Applications

Evaluation & Testing
51738PythonApache-2.01y ago

Athena-Public

winstonkoh87/Athena-Public
5.9

The Linux OS for AI Agents — Persistent memory, autonomy, and time-awareness for any LLM. Own the state. Rent the intelligence.

LLM Frameworks
51269PythonMIT8d ago

LLM-VM

anarchy-ai/LLM-VM
4.8

irresponsible innovation. Try now at https://chat.dev/

Fine-tuning Tools
491137PythonMIT2y ago

openinfer

openinfer-project/openinfer
6.0

Pure Rust + CUDA LLM inference engine — no PyTorch, OpenAI-compatible, serves Qwen3 to Kimi-K2

Inference Engines
48870RustApache-2.0today

agency

operand/agency
5.0

A fast and minimal framework for building agentic systems

Agent Frameworks
48628PythonMIT18d ago

ome

ome-projects/ome
6.1

Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

Model Serving
47283GoApache-2.0today

Finetune_LLMs

mallorbc/Finetune_LLMs
3.8

Repo for fine-tuning Casual LLMs

Fine-tuning Tools
46586PythonAGPL-3.02y ago

fakecloud

faiscadev/fakecloud
5.7

Free, open-source AWS emulator. LocalStack alternative: 26 services, 1,924 operations, 100% conformance. No account, no auth token, no paid tier.

Evaluation & Testing
45631RustAGPL-3.0today

awsome-distributed-training

awslabs/awsome-distributed-training
5.6

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

Fine-tuning Tools
451196ShellMIT-0today

agentsilex

howl-anderson/agentsilex
4.8

A transparent, minimal, and hackable agent framework. ~300 lines of readable code. Full control, no magic.

Agent Frameworks
45145PythonMIT5mo ago

JetStream

AI-Hypercomputer/JetStream
4.8

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Model Serving
44866PythonApache-2.05mo ago

Aquila2

FlagAI-Open/Aquila2
3.6

The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.

Fine-tuning Tools
44632Python1y ago

xFasterTransformer

intel/xFasterTransformer
4.3

xFasterTransformer — open-source AI/LLM project.

Model Serving
43675C++Apache-2.09mo ago

awesome-on-policy-distillation

chrisliu298/awesome-on-policy-distillation
4.2

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models

Fine-tuning Tools
42512CC0-1.01d ago

gpu-rest-engine

NVIDIA/gpu-rest-engine
3.7

A REST API for Caffe using Docker and Go

Model Serving
42293C++BSD-3-Clause7y ago

InternEvo

InternLM/InternEvo
4.8

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Fine-tuning Tools
42067PythonApache-2.010mo ago

Awesome-LLM-Prompt-Optimization

jxzhangjhu/Awesome-LLM-Prompt-Optimization
4.3

Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models

Prompt Engineering
412233d ago

tiger

tigerlab-ai/tiger
4.3

Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)

Fine-tuning Tools
40327Jupyter NotebookApache-2.02y ago

awesome-azure-openai-llm

kimtth/awesome-azure-openai-llm
4.6

A curated collection of resources for 🌌 Azure OpenAI, 🦙 LLMs (RAG, Agents).

Agent Frameworks
40258Python25d ago

tessera

zengxiao-he/tessera
4.3

From teacher to tiles — a from-scratch LLM distillation & serving engine: custom Triton/CUDA kernels, FSDP distillation, paged-KV continuous batching, speculative decoding, a Rust gateway, a JAX oracle, and interpretability tooling.

Inference Engines
3944PythonNOASSERTION24d ago

stable-diffusion-deploy

Lightning-Universe/stable-diffusion-deploy
4.6

Learn to serve Stable Diffusion models on cloud infrastructure at scale. This Lightning App shows load-balancing, orchestrating, pre-provisioning, dynamic batching, GPU-inference, micro-services working together via the Lightning Apps framework.

Model Serving
39139PythonApache-2.02y ago

LightRFT

opendilab/LightRFT
5.1

LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Framework

Fine-tuning Tools
38811PythonApache-2.02mo ago

rhesis

rhesis-ai/rhesis
5.5

The testing platform for AI teams. Bring engineers, PMs, and domain experts together to generate tests, simulate (adversarial) conversations, and trace every failure to its root cause.

Evaluation & Testing
37326PythonNOASSERTIONtoday

APOLLO

zhuhanqing/APOLLO
4.2

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

Fine-tuning Tools
36419PythonNOASSERTION7mo ago

Dulus

KevRojo/Dulus
5.3

Open-source autonomous AI agent — runs on Claude-web, Gemini-web, Kimi-web, Deepseek-web and more for free and Every paid model via liteLLM. No API key required.

Agent Frameworks
36027PythonGPL-3.05d ago

llm-leaderboard

JonathanChavezTamales/llm-leaderboard
4.7

A comprehensive set of LLM benchmark scores and provider prices. (deprecated, read more in README)

Evaluation & Testing
36040JavaScriptNOASSERTION8mo ago

openclaw-optimization-guide

OnlyTerp/openclaw-optimization-guide
5.0

Make your OpenClaw AI agent faster, smarter, and cheaper. Speed optimization, memory architecture, context management, model selection, and one-shot development guide.

Prompt Engineering
35543JavaScriptMIT10d ago

alphora

opencmit/alphora
5.3

A Production-Ready Framework for Building Composable AI Agents

Agent Frameworks
34733PythonApache-2.01mo ago

palico-ai

palico-ai/palico-ai
4.5

Build, Improve Performance, and Productionize your LLM Application with an Integrated Framework

Evaluation & Testing
34328TypeScriptMIT1y ago

Awesome-MLSys-Blogger

MLSys-Learner-Resources/Awesome-MLSys-Blogger
3.6

The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)

Fine-tuning Tools
3409HTML1y ago