Top RAG repositories on GitHub
Retrieval-augmented generation pipelines, embeddings, and grounding tooling.
Ranked by stars across 1,261 repositories tagged rag. Refreshed daily.
- 1langgenius/dify★ 146,004 · ⑂ 22,962
Production-ready platform for agentic workflow development.
- ai
- gpt
- llm
- openai
- python
- rag
- 2open-webui/open-webui★ 142,468 · ⑂ 20,486
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
- ollama
- ollama-webui
- llm
- webui
- self-hosted
- llm-ui
- 3langchain-ai/langchain★ 139,782 · ⑂ 23,182
The agent engineering platform.
- ai
- anthropic
- gemini
- langchain
- llm
- openai
- 4Shubhamsaboo/awesome-llm-apps★ 115,190 · ⑂ 17,107
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
- llms
- rag
- python
- agents
- 5thedotmack/claude-mem★ 83,460 · ⑂ 7,222
Persistent Context Across Sessions for Every Agent – Captures everything your agent does during sessions, compresses it with AI, and injects relevant context back into future sessions. Works with Claude Code, OpenClaw, Codex, Gemini, Hermes, Copilot, OpenCode + More
- ai
- ai-agents
- ai-memory
- anthropic
- artificial-intelligence
- claude
- 6infiniflow/ragflow★ 83,265 · ⑂ 9,638
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
- ai
- ai-agents
- context-engine
- llm-apps
- rag
- retrieval-augmented-generation
- 7PaddlePaddle/PaddleOCR★ 83,159 · ⑂ 10,830
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
- ocr
- chineseocr
- pdf2markdown
- pp-ocr
- pp-structure
- document-parsing
- 8dair-ai/Prompt-Engineering-Guide★ 75,801 · ⑂ 8,266
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
- deep-learning
- prompt-engineering
- openai
- chatgpt
- language-model
- generative-ai
- 9safishamsi/graphify★ 70,019 · ⑂ 7,033
AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.
- claude-code
- graphrag
- knowledge-graph
- codex
- openclaw
- skills
- 10Mintplex-Labs/anything-llm★ 61,868 · ⑂ 6,751
Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience
- rag
- localai
- vector-database
- llm
- ai-agents
- multimodal
- 11
- 12pathwaycom/llm-app★ 59,289 · ⑂ 1,433
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
- chatbot
- hugging-face
- llm
- llm-local
- llm-prompting
- llm-security
- 13
- 14FlowiseAI/Flowise★ 53,852 · ⑂ 24,564
Build AI Agents, Visually
- artificial-intelligence
- chatgpt
- large-language-models
- low-code
- no-code
- javascript
- 15run-llama/llama_index★ 50,249 · ⑂ 7,597
LlamaIndex is the leading document agent and OCR platform
- agents
- application
- data
- fine-tuning
- framework
- llamaindex
- 16jeecgboot/JeecgBoot★ 46,809 · ⑂ 16,055
AI 低代码平台「低代码 + 零代码」双驱动!低代码可一键生成前后端代码;零代码可 5 分钟搭建系统;AI Skills 一句话画流程、设计表单、生成整套系统。内置 AI聊天、知识库、流程编排、MCP插件等,兼容主流大模型。引领「AI 生成 → 在线配置 → 代码生成 → 手工合并->AI修改」开发模式,消除 Java 项目 80% 的重复工作,提效而不失灵活。
- antd
- activiti
- codegenerator
- springcloud
- springboot
- low-code
- 17milvus-io/milvus★ 44,862 · ⑂ 4,077
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
- anns
- nearest-neighbor-search
- faiss
- vector-search
- image-search
- hnsw
- 18chopratejas/headroom★ 42,491 · ⑂ 2,929
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
- agent
- ai
- anthropic
- compression
- context-engineering
- context-window
- 19mindsdb/minds★ 39,319 · ⑂ 6,206
General-purpose AI designed for knowledge workers — creators, strategists, and operators — and individuals seeking AI systems they can truly control to help them get work done, with full flexibility to extend and deploy anywhere (VPC, on-prem, or cloud).
- ai
- artificial-inteligence
- databases
- llms
- rag
- agents
- 20QuivrHQ/quivr★ 39,163 · ⑂ 3,723
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
- ai
- llm
- api
- chatbot
- chatgpt
- database
- 21chatchat-space/Langchain-Chatchat★ 38,200 · ⑂ 6,215
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
- chatglm
- langchain
- llm
- knowledge-base
- llama
- chatbot
- 22HKUDS/LightRAG★ 36,817 · ⑂ 5,192
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
- knowledge-graph
- large-language-models
- retrieval-augmented-generation
- genai
- graphrag
- llm
- 23patchy631/ai-engineering-hub★ 35,905 · ⑂ 5,957
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
- agents
- ai
- llms
- machine-learning
- mcp
- rag
- 24ItzCrazyKns/Vane★ 35,383 · ⑂ 3,899
Vane is an AI-powered answering engine.
- ai-search-engine
- search-engine
- open-source-ai-search-engine
- perplexica
- artificial-intelligence
- machine-learning
- 25langchain-ai/langgraph★ 35,321 · ⑂ 5,924
Build resilient agents.
- agents
- ai
- ai-agents
- chatgpt
- deepagents
- enterprise
Find engineers shipping RAG
The list above ranks the most-starred public repositories tagged with the RAG topic, drawn from the public GitHub graph. Across 1,261 repositories tagged this way, the maintainers and top contributors are a tight cluster of the people actually building RAG.
Looking for engineers who’ve worked on RAG for real, not just listed it on LinkedIn? The fastest path is the contributor list of these repos. Their commits, issues, and READMEs are public proof of depth.
Refolk turns this list into a search. Ask for “maintainers of top RAG repos who are hiring”, “RAG engineers in San Francisco”, or “founders shipping RAG” and Refolk returns a ranked shortlist with sources.
How this list is built
Last refreshed: Sun, 21 Jun 2026 08:15:48 GMT
Need a list like this for any search?
Refolk runs natural-language searches across GitHub, LinkedIn, and the open web. Try one of these:
Browse other topics
- Top Embeddings repos
- Top Tailwind CSS repos
- Top Self-hosted repos
- Top Compilers repos
- Top Kubernetes repos
- Top Game development repos
- Top Computer vision repos
- Top Terraform repos
See all repository lists.