STACKQUADRANT

SPTAG

microsoft/SPTAG
6.6

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.

Vector Databases
5.0k619C++MITtoday

Olares

beclab/Olares
7.0

Olares: An Open-Source Personal Cloud to Reclaim Your Data

Model Serving
5.0k302GoAGPL-3.0today

h2o-llmstudio

h2oai/h2o-llmstudio
7.6

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Fine-tuning Tools
5.0k532PythonApache-2.02d ago

eko

FellouAI/eko
7.0

Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai

Inference Engines
4.9k439TypeScriptMIT3mo ago

AutoRAG

Marker-Inc-Korea/AutoRAG
7.1

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

RAG Libraries
4.8k404PythonApache-2.05d ago

awesome-vibe-coding

filipecalegario/awesome-vibe-coding
5.8

A curated list of vibe coding references, collaborating with AI to write code.

Agent Frameworks
4.8k573CC0-1.02mo ago

ag2

ag2ai/ag2
8.1

AG2 (formerly AutoGen): The Open-Source AgentOS. Join us at: https://discord.gg/sNGSwQME3x

Agent Frameworks
4.7k660PythonApache-2.0today

lemonade

lemonade-sdk/lemonade
7.2

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk

Inference Engines
4.7k371C++Apache-2.0today

semantic-router

vllm-project/semantic-router
7.7

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Fine-tuning Tools
4.6k721GoApache-2.0today

Integuru

Integuru-AI/Integuru
6.4

The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs.

Agent Frameworks
4.6k361PythonAGPL-3.04d ago

infinity

infiniflow/infinity
7.4

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

Vector Databases
4.6k430C++Apache-2.04d ago

youtu-agent

TencentCloudADP/youtu-agent
6.4

A simple yet powerful agent framework that delivers with open-source models

Agent Frameworks
4.6k468PythonNOASSERTION3mo ago

agent-governance-toolkit

microsoft/agent-governance-toolkit
7.0

AI Agent Governance Toolkit — Policy enforcement, zero-trust identity, execution sandboxing, and reliability engineering for autonomous AI agents. Covers 10/10 OWASP Agentic Top 10.

Agent Frameworks
4.5k651PythonMIT2d ago

m_flow

FlowElement-ai/m_flow
6.6

A bio-inspired cognitive memory engine — a new paradigm for Graph RAG.

Vector Databases
4.5k260PythonApache-2.01mo ago

crate

crate/crate
7.6

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

Vector Databases
4.4k601JavaApache-2.0today

cognita

truefoundry/cognita
6.7

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

LLM Frameworks
4.4k391PythonApache-2.03mo ago

Deep-Learning-in-Production

ahkarami/Deep-Learning-in-Production
4.5

In this repository, I will share some useful notes and references about deploying deep learning-based models in production.

Model Serving
4.4k6851y ago

claude-code-guide

zebbern/claude-code-guide
5.8

Claude Code Guide - Setup, Commands, workflows, agents, skills & tips-n-tricks

Agent Frameworks
4.3k441PythonMIT2d ago

tiny-llm

skyzh/tiny-llm
6.9

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

LLM Frameworks
4.3k335PythonApache-2.015d ago

ruvector

ruvnet/ruvector
7.0

RuVector is a High Performance, Real-Time, Self-Learning, Vector Graph Neural Network, and Database built in Rust.

Inference Engines
4.3k568RustMITtoday

RuVector

ruvnet/RuVector
7.3

RuVector is a High Performance, Real-Time, Self-Learning, Vector Graph Neural Network, and Database built in Rust.

Inference Engines
4.3k568RustMITtoday

lmms-eval

EvolvingLMMs-Lab/lmms-eval
7.5

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Evaluation & Testing
4.3k607PythonNOASSERTION4d ago

agenta

Agenta-AI/agenta
7.4

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

Evaluation & Testing
4.2k555TypeScriptNOASSERTIONtoday

memgraph

memgraph/memgraph
6.9

Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.

Agent Frameworks
4.2k241C++NOASSERTIONtoday

USearch

unum-cloud/USearch
7.3

Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

Vector Databases
4.2k326C++Apache-2.01mo ago

AdalFlow

SylphAI-Inc/AdalFlow
6.9

AdalFlow: The library to build & auto-optimize LLM applications.

LLM Frameworks
4.2k376PythonMIT1mo ago

optillm

algorithmicsuperintelligence/optillm
6.5

Optimizing inference proxy for LLMs

Inference Engines
4.2k367PythonApache-2.01mo ago

AI-Infra-from-Zero-to-Hero

HuaizhengZhang/AI-Infra-from-Zero-to-Hero
6.2

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑‍💻 Video Tutorials.

Model Serving
4.2k401MIT11mo ago

LightLLM

ModelTC/LightLLM
6.5

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Model Serving
4.1k335PythonApache-2.0today

bRAG-langchain

bragai/bRAG-langchain
5.4

Everything you need to know to build your own RAG application

RAG Libraries
4.1k497Jupyter NotebookNOASSERTION7mo ago

GenerativeAIExamples

NVIDIA/GenerativeAIExamples
6.6

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

RAG Libraries
4.1k1.1kJupyter NotebookApache-2.01mo ago

FedML

FedML-AI/FedML
6.6

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

Agent Frameworks
4.0k767PythonApache-2.08mo ago

langroid

langroid/langroid
7.2

Harness LLMs with Multi-Agent Programming

RAG Libraries
4.0k378PythonMIT13d ago

AI-Infra-Guard

Tencent/AI-Infra-Guard
7.4

A full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.

Evaluation & Testing
4.0k385PythonApache-2.0today

AIGC-Interview-Book

WeThinkIn/AIGC-Interview-Book
6.1

【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、LLM大模型、传统深度学习、自动驾驶、AI Agent、机器学习、计算机视觉、自然语言处理、强化学习、大数据挖掘、具身智能、元宇宙、AGI等AI行业面试笔试干货经验与核心知识。

Agent Frameworks
4.0k421GPL-3.06d ago

ravendb

ravendb/ravendb
7.7

ACID Document Database

Vector Databases
4.0k859C#NOASSERTION2d ago

telegram-search

groupultra/telegram-search
7.1

🔍 导出并模糊搜索 Telegram 聊天记录 | Export and fuzzy search your Telegram chat history

Agent Frameworks
4.0k259TypeScriptAGPL-3.0today

anything_about_game

killop/anything_about_game
6.2

A wonderful list of Game Development resources.

Agent Frameworks
4.0k511Apache-2.02d ago

LazyLLM

LazyAGI/LazyLLM
7.3

Easiest and laziest way for building multi-agent LLMs applications.

LLM Frameworks
3.8k393PythonApache-2.0today

fast-agent

evalstate/fast-agent
7.4

Code, Build and Evaluate agents - excellent Model and Skills/MCP/ACP Support

Agent Frameworks
3.8k409PythonApache-2.0today

lorax

predibase/lorax
6.8

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Inference Engines
3.8k322PythonApache-2.01mo ago

OpenClawChineseTranslation

1186258278/OpenClawChineseTranslation
6.6

🦞 OpenClaw (Clawdbot/Moltbot) 汉化版 - 开源个人 AI 助手中文版 | Claude/ChatGPT LLM 接入 | WhatsApp/Telegram/Discord 多平台 | 每小时自动同步 | CLI + Dashboard 全中文 | 全流程搭建教程,以及排错指南!

Agent Frameworks
3.8k483JavaScriptNOASSERTIONtoday

jailbreak_llms

verazuo/jailbreak_llms
5.3

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

LLM Frameworks
3.7k319Jupyter NotebookMIT1y ago

textgrad

zou-group/textgrad
5.9

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Prompt Engineering
3.6k295PythonMIT11mo ago

skillhub

iflytek/skillhub
7.1

Self-hosted, open-source agent skill registry for enterprises. Publish & version skill packages, govern with RBAC and audit logs, deploy on-premise with Docker or Kubernetes.

Agent Frameworks
3.6k535JavaApache-2.0today

Acontext

memodb-io/Acontext
6.7

The Agent Memory Stack

Agent Frameworks
3.6k323JavaScriptApache-2.05d ago

agentic-rag-for-dummies

GiovanniPasq/agentic-rag-for-dummies
6.0

A modular Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.

RAG Libraries
3.6k466Jupyter NotebookMIT7d ago

refact

smallcloudai/refact
7.4

AI Agent that handles engineering tasks end-to-end: integrates with developers’ tools, plans, executes, and iterates until it achieves a successful result.

Agent Frameworks
3.5k318RustBSD-3-Clause29d ago