Refolk

Top Rust LLM repositories on GitHub

Large language model frameworks, agents, runtimes, and inference servers. Filtered to projects whose primary language is Rust.

Ranked by stars across 333 Rust repositories tagged llm. Refreshed daily.

  1. 1
    rtk-ai/rtk64,301 · ⑂ 3,962

    CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

    • agentic-coding
    • ai-coding
    • anthropic
    • claude-code
    • cli
    • command-line-tool
  2. 2
    Hmbown/CodeWhale38,763 · ⑂ 3,338

    Open-source, community-driven agent harness

    • cli
    • deepseek
    • llm
    • rust
    • terminal
    • tui
  3. 3
    AlexsJones/llmfit28,399 · ⑂ 1,742

    Hundreds of models & providers. One command to find what runs on your hardware.

    • llm
    • skill
    • localai
    • gguf
    • mlx
    • unsloth
  4. 4
    screenpipe/screenpipe19,401 · ⑂ 1,849

    YC (S26) | AI that knows what you've seen, said, or heard. Records everything you do, say, hear 24/7, local, private, secure. Connect to OpenClaw, Hermes agent and 100+ apps

    • ai
    • computer-vision
    • llm
    • machine-learning
    • multimodal
    • agents
  5. 5
    RightNow-AI/openfang17,869 · ⑂ 2,268

    Open-source Agent Operating System

    • agent-framework
    • ai-agents
    • llm
    • mcp
    • open-source
    • openclaw
  6. 6
    memvid/memvid15,675 · ⑂ 1,355

    Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.

    • ai
    • context
    • embedded
    • faiss
    • knowledge-base
    • knowledge-graph
  7. 7
    Zackriya-Solutions/meetily12,827 · ⑂ 1,376

    Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.

    • meeting-minutes
    • meeting-notes
    • llm
    • mac
    • windows
    • rust
  8. 8
    tensorzero/tensorzero11,664 · ⑂ 936

    TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

    • ai
    • artificial-intelligence
    • deep-learning
    • gpt
    • llm
    • llmops
  9. 9
    cocoindex-io/cocoindex10,436 · ⑂ 811

    Incremental engine for long horizon agents 🌟 Star if you like it!

    • ai
    • change-data-capture
    • data-indexing
    • etl
    • indexing
    • python
  10. 10
    sigoden/aichat10,158 · ⑂ 703

    All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.

    • claude
    • gemini
    • ollama
    • openai
    • function-calling
    • cli
  11. 11
    BoundaryML/baml8,403 · ⑂ 434

    The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)

    • baml
    • llm
    • boundaryml
    • guardrails
    • llm-playground
    • playground
  12. 12
    qualcomm/nexa-sdk8,113 · ⑂ 1,008

    Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Supporting OpenAI GPT-OSS, IBM Granite-4, Qwen-3-VL, Gemma-3n, Ministral-3, and more.

    • llm
    • on-device-ai
    • sdk
    • stable-diffusion
    • vlm
    • go
  13. 13
    0xPlaygrounds/rig7,695 · ⑂ 856

    ⚙️🦀 Build modular and scalable LLM Applications in Rust

    • ai
    • llm
    • agent
    • artificial-intelligence
    • automation
    • large-language-model
  14. 14
    1jehuang/jcode7,440 · ⑂ 828

    Coding Agent Harness

    • ai
    • claude
    • cli
    • coding-agent
    • llm
    • mcp
  15. 15
    mufeedvh/code2prompt7,427 · ⑂ 426

    A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.

    • ai
    • chatgpt
    • claude
    • cli
    • command-line
    • command-line-tool
  16. 16
    tailcallhq/forgecode7,421 · ⑂ 1,453

    AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models

    • ai-workflows
    • artifical-intelligense
    • claude-3-7-sonnet
    • command-line
    • multi-agent-reinforcement-learning
    • shell
  17. 17
    EricLBuehler/mistral.rs7,332 · ⑂ 628

    Fast, flexible LLM inference

    • llm
    • rust
    • uqff
  18. 18
    postgresml/postgresml6,802 · ⑂ 362

    Postgres with GPUs for ML/AI apps.

    • ml
    • machine-learning
    • ai
    • ann
    • artificial-intelligence
    • classification
  19. 19
    rustformers/llm6,151 · ⑂ 378

    [Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

    • ai
    • ggml
    • llm
    • ml
    • rust
  20. 20
    winfunc/deepreasoning5,356 · ⑂ 440

    A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

    • ai
    • anthropic
    • anthropic-claude
    • api
    • chain-of-thought
    • claude
  21. 21

    A blazing fast inference solution for text embeddings models

    • ai
    • embeddings
    • huggingface
    • llm
    • ml
  22. 22
    SilasMarvin/lsp-ai3,186 · ⑂ 114

    LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.

    • ai
    • auto-completion
    • developer-tools
    • ide
    • language-client
    • llama
  23. 23
    keon/browser-control3,112 · ⑂ 208

    A tiny, fast Rust CLI that drives a real browser over the Chrome DevTools Protocol — built for coding agents.

    • rust
    • agent
    • browser-automation
    • llm
  24. 24
    yvgude/lean-ctx2,827 · ⑂ 279

    Control what your AI can see. LeanCTX (Lean Context) is the context intelligence layer for AI agents — one local Rust binary that decides what they read, remembers what they learn, guards what they touch, and proves what they save. 60–90% fewer tokens as the receipt. 76 MCP tools, 30+ agents, local-first.

    • ai
    • cursor
    • llm
    • mcp
    • rust
    • token-optimization
  25. 25
    moltis-org/moltis2,750 · ⑂ 323

    A secure persistent personal agent server in Rust. One binary, sandboxed execution, multi-provider LLMs, voice, memory, Telegram, WhatsApp, Discord, Teams, and MCP tools. Secure by design, runs on your hardware.

    • rust
    • ai-agent
    • ai-assistant
    • llm
    • mcp
    • sandbox

Find Rust engineers shipping LLM

The list above ranks the most-starred public Rust repositories tagged with the LLM topic, drawn from the public GitHub graph. Across 333 matching repositories, the contributors are a tight cluster of engineers with both Rust chops and real LLM experience.

That overlap is rare. Most Rust engineers haven’t shipped LLM, and most LLM maintainers don’t write Rust. The people on this list’s contributor graph are the ones who do both.

Refolk turns this list into a search. Ask for Rust LLM maintainers hiring” or Rust engineers shipping LLM in 2025” and Refolk returns a ranked shortlist with the commits, profiles, and projects behind each name.

How this list is built

Refolk searched GitHub for public Rust repositories tagged with the LLM topic, ranked them by stargazer count, and kept those with at least 25 stars. The list refreshes once a day.

Last refreshed: Sun, 21 Jun 2026 07:12:45 GMT

Need a more specific search?

Refolk runs natural-language searches across GitHub, LinkedIn, and the open web. Try one of these:

Related lists

See all repository lists.

Or zoom out