STACKQUADRANT

optillm

algorithmicsuperintelligence/optillm
6.5

Optimizing inference proxy for LLMs

Inference Engines
3.4k267PythonApache-2.028d ago

chitu

thu-pacman/chitu
6.9

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Model Serving
3.4k354PythonApache-2.0today

MetaClaw

aiming-lab/MetaClaw
6.6

Just talk to your agent — it learns and EVOLVES.

Fine-tuning Tools
3.4k409PythonMIT5d ago

vault-ai

pashpashpash/vault-ai
5.6

OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.

Vector Databases
3.4k300JavaScriptMIT9mo ago

wanwu

UnicomAI/wanwu
6.9

China Unicom's Yuanjing Wanwu Agent Platform is an enterprise-grade, multi-tenant AI agent development platform. It helps users build applications such as intelligent agents, workflows, and rag, and also supports model management. The platform features a developer-friendly license, and we welcome all developers to build upon the platform.

Agent Frameworks
3.4k89GoApache-2.06d ago

ii-agent

Intelligent-Internet/ii-agent
6.6

II-Agent: a new open-source framework to build and deploy intelligent agents

Agent Frameworks
3.3k498PythonApache-2.03d ago

Acontext

memodb-io/Acontext
6.8

The Agent Memory Stack

Agent Frameworks
3.3k312TypeScriptApache-2.01d ago

trulens

truera/trulens
7.3

Evaluation and Tracking for LLM Experiments and AI Agents

Evaluation & Testing
3.3k263PythonMIT1d ago

beeai-framework

i-am-bee/beeai-framework
7.4

Build production-ready AI agents in both Python and Typescript.

LLM Frameworks
3.2k425PythonApache-2.08d ago

deepsparse

neuralmagic/deepsparse
6.1

Sparsity-aware deep learning inference runtime for CPUs

Inference Engines
3.2k191PythonNOASSERTION10mo ago

agentic-rag-for-dummies

GiovanniPasq/agentic-rag-for-dummies
6.0

A modular Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.

RAG Libraries
3.1k419Jupyter NotebookMIT15d ago

matmulfreellm

ridgerchu/matmulfreellm
5.6

Implementation for MatMul-free LM.

LLM Frameworks
3.1k201PythonApache-2.04mo ago

BotSharp

SciSharp/BotSharp
7.4

AI Multi-Agent Framework in .NET

Agent Frameworks
3.0k623C#Apache-2.0today

prompttools

hegelai/prompttools
6.4

Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).

Vector Databases
3.0k256PythonApache-2.02mo ago

core

cheshire-cat-ai/core
6.8

AI agent microservice

LLM Frameworks
3.0k396PythonGPL-3.01mo ago

swirl-search

swirlai/swirl-search
7.3

AI Search & RAG Without Moving Your Data. Get instant answers from your company's knowledge across 100+ apps while keeping data secure. Deploy in minutes, not months.

RAG Libraries
3.0k282PythonApache-2.02d ago

MiroFlow

MiroMindAI/MiroFlow
6.6

🏆 Top-1 on 5+ benchmarks | Web UI | Supports MiroThinker, Claude, Kimi, OpenAI

Agent Frameworks
2.9k304PythonApache-2.07d ago

miroflow

MiroMindAI/miroflow
6.7

MiroFlow is an agent framework that enables tool-use agent tasks, featuring a reproducible GAIA score of 82.4%.

Agent Frameworks
2.9k304PythonApache-2.07d ago

InternLM-XComposer

InternLM/InternLM-XComposer
5.1

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

LLM Frameworks
2.9k176PythonApache-2.010mo ago

LLM-Finetuning

ashishpatel26/LLM-Finetuning
4.4

LLM Finetuning with peft

Fine-tuning Tools
2.9k766Jupyter Notebook8mo ago

distributed-llama

b4rtaz/distributed-llama
6.3

Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.

Inference Engines
2.9k225C++MIT2d ago

VideoRAG

HKUDS/VideoRAG
6.1

[KDD'2026] "VideoRAG: Chat with Your Videos"

RAG Libraries
2.9k409PythonNOASSERTION29d ago

spiceai

spiceai/spiceai
6.9

A portable accelerated SQL query, search, and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

Inference Engines
2.9k186RustApache-2.0today

fastembed

qdrant/fastembed
6.9

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

RAG Libraries
2.9k194PythonApache-2.01d ago

gerev

GerevAI/gerev
5.3

🧠 AI-powered enterprise search engine 🔎

Vector Databases
2.8k179PythonMIT2y ago

lmnr

lmnr-ai/lmnr
6.9

Laminar - open-source observability platform purpose-built for AI agents. YC S24.

Evaluation & Testing
2.8k191TypeScriptApache-2.0today

ramalama

containers/ramalama
7.4

RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.

Model Serving
2.7k330PythonMITtoday

Medusa

FasterDecoding/Medusa
5.4

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Inference Engines
2.7k197Jupyter NotebookApache-2.01y ago

gitagent

open-gitagent/gitagent
6.7

A framework-agnostic, git-native standard for defining AI agents

Agent Frameworks
2.7k321TypeScriptMIT3d ago

xTuring

stochasticai/xTuring
6.8

Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

Fine-tuning Tools
2.7k212PythonApache-2.01mo ago

hora

hora-search/hora
6.2

🚀 efficient approximate nearest neighbor search algorithm collections library written in Rust 🦀 .

Vector Databases
2.7k76RustApache-2.01mo ago

second-brain-ai-assistant-course

decodingai-magazine/second-brain-ai-assistant-course
6.4

Learn to build your Second Brain AI assistant with LLMs, agents, RAG, fine-tuning, LLMOps and AI systems techniques.

Fine-tuning Tools
2.6k471Jupyter NotebookMIT10d ago

trieve

devflowinc/trieve
6.9

All-in-one platform for search, recommendations, RAG, and analytics offered via API

RAG Libraries
2.6k241RustMIT2mo ago

LISA

JIA-Lab-research/LISA
4.8

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

LLM Frameworks
2.6k205PythonApache-2.01y ago

Awesome-LLM-KG

RManLuo/Awesome-LLM-KG
4.7

Awesome papers about unifying LLMs and KGs

LLM Frameworks
2.6k17911mo ago

inference

roboflow/inference
7.2

Turn any computer or edge device into a command center for your computer vision projects.

Model Serving
2.3k253PythonNOASSERTIONtoday

maxtext

AI-Hypercomputer/maxtext
7.1

A simple, performant and scalable Jax LLM!

Fine-tuning Tools
2.2k507PythonApache-2.0today

prompt-in-context-learning

EgoAlpha/prompt-in-context-learning
6.3

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

LLM Frameworks
2.2k191Jupyter NotebookMIT14d ago

awesome-llm-powered-agent

hyp1231/awesome-llm-powered-agent
5.2

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

LLM Frameworks
2.2k208MIT11mo ago

generative-ai

genieincodebottle/generative-ai
6.3

Comprehensive resources on Generative AI, including a detailed roadmap, projects, use cases, interview preparation, and coding preparation.

LLM Frameworks
2.2k543Jupyter NotebookMIT5d ago

envd

tensorchord/envd
7.0

🏕️ Reproducible development environment for humans and agents

Model Serving
2.2k167GoApache-2.06d ago

AI-Engineering.academy

adithya-s-k/AI-Engineering.academy
6.1

Mastering Applied AI, One Concept at a Time

Fine-tuning Tools
2.2k250Jupyter NotebookMIT1mo ago

intel-extension-for-transformers

intel/intel-extension-for-transformers
5.6

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

LLM Frameworks
2.2k217PythonApache-2.01y ago

nestia

samchon/nestia
7.0

NestJS Helper + AI Chatbot Development

LLM Frameworks
2.1k123TypeScriptMITtoday

YiVal

YiVal/YiVal
6.1

Your Automatic Prompt Engineering Assistant for GenAI Applications

LLM Frameworks
2.1k328PythonApache-2.01y ago

MoBA

MoonshotAI/MoBA
4.6

MoBA: Mixture of Block Attention for Long-Context LLMs

Fine-tuning Tools
2.1k141PythonMIT1y ago

trainer

kubeflow/trainer
7.5

Distributed AI Model Training and LLM Fine-Tuning on Kubernetes

Fine-tuning Tools
2.1k945GoApache-2.01d ago

llama_deploy

run-llama/llama_deploy
6.9

Deploy your agentic worfklows to production

LLM Frameworks
2.1k229PythonMIT10d ago