Refolk

Top Observability repositories on GitHub

Tracing, metrics, logs, and debugging tools for production systems.

Ranked by stars across 589 repositories tagged observability. Refreshed daily.

  1. 1
    netdata/netdata79,331 · ⑂ 6,488

    The fastest path to AI-powered full stack observability, even for lean teams.

    • monitoring
    • docker
    • kubernetes
    • cncf
    • prometheus
    • netdata
  2. 2
    langfuse/langfuse29,442 · ⑂ 3,061

    🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenTelemetry, LangChain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

    • analytics
    • llm
    • llmops
    • large-language-models
    • openai
    • self-hosted
  3. 3
    SigNoz/signoz27,403 · ⑂ 2,231

    SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. 🔥 🖥. 👉 Open source Application Performance Monitoring (APM) & Observability tool

    • observability
    • application-monitoring
    • opentelemetry
    • distributed-tracing
    • apm
    • go
  4. 4
    mlflow/mlflow26,645 · ⑂ 5,875

    The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

    • machine-learning
    • ai
    • ml
    • mlflow
    • apache-spark
    • model-management
  5. 5
    apache/skywalking24,830 · ⑂ 6,639

    APM, Application Performance Monitoring System

    • skywalking
    • observability
    • apm
    • service-mesh
    • dapper
    • distributed-tracing
  6. 6
    cilium/cilium24,559 · ⑂ 3,841

    eBPF-based Networking, Security, and Observability

    • containers
    • bpf
    • security
    • kubernetes
    • kubernetes-networking
    • cni
  7. 7
    jaegertracing/jaeger22,903 · ⑂ 2,962

    CNCF Jaeger, a Distributed Tracing Platform

    • distributed-tracing
    • cncf
    • tracing
    • observability
    • jaeger
    • opentelemetry
  8. 8
    PrefectHQ/prefect22,653 · ⑂ 2,346

    Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

    • python
    • workflow
    • data-engineering
    • data-science
    • workflow-engine
    • prefect
  9. 9
    vectordotdev/vector22,062 · ⑂ 2,180

    A high-performance observability data pipeline.

    • logs
    • metrics
    • observability
    • forwarder
    • events
    • stream-processing
  10. 10
    mikeroyal/Self-Hosting-Guide21,549 · ⑂ 1,083

    Self-Hosting Guide. Learn all about locally hosting (on premises & private web servers) and managing software applications by yourself or your organization. Including Cloud, LLMs, WireGuard, Automation, Home Assistant, and Networking.

    • self-hosted
    • selfhosted
    • home-assistant
    • wireguard
    • decentralized
    • self-hosting
  11. 11
    elastic/kibana21,147 · ⑂ 8,595

    Your window into all of your data

    • kibana
    • elasticsearch
    • visualizations
    • metrics
    • observability
    • dashboards
  12. 12

    End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.

    • agent
    • agent-framework
    • agents
    • ai-agents
    • genai
    • generative-ai
  13. 13
    openobserve/openobserve19,387 · ⑂ 861

    Open source observability platform for logs, metrics, traces, frontend monitoring, pipelines and LLM observability. A sophisticated, simple and highly performant alternative to Datadog, Splunk, and Elasticsearch with 140x lower storage costs and single binary deployment.

    • logs
    • metrics
    • traces
    • analytics
    • elasticsearch
    • jaeger
  14. 14
    openzipkin/zipkin17,430 · ⑂ 3,103

    Zipkin is a distributed tracing system

    • zipkin
    • distributed-tracing
    • tracing
    • openzipkin
    • observability
  15. 15
    VictoriaMetrics/VictoriaMetrics17,188 · ⑂ 1,668

    VictoriaMetrics: fast, cost-effective monitoring solution and time series database

    • tsdb
    • prometheus
    • promql
    • influxdb
    • graphite
    • opentsdb
  16. 16
    kubesphere/kubesphere16,974 · ⑂ 2,744

    The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️

    • devops
    • cncf
    • cloud-native
    • servicemesh
    • kubesphere
    • kubernetes
  17. 17
    apache/doris15,516 · ⑂ 3,834

    Apache Doris is an easy-to-use, high performance and unified analytics database.

    • olap
    • database
    • hudi
    • iceberg
    • real-time
    • sql
  18. 18
    Effect-TS/effect14,659 · ⑂ 596

    Build production-ready applications in TypeScript

    • effect
    • javascript
    • cli
    • opentelemetry
    • platform
    • schema
  19. 19
    thanos-io/thanos14,115 · ⑂ 2,317

    Highly available Prometheus setup with long term storage capabilities. A CNCF Incubating project.

    • prometheus
    • google-cloud-storage
    • high-availability
    • prometheus-ha-pairs
    • thanos
    • s3
  20. 20
    ccfos/nightingale13,098 · ⑂ 1,725

    Nightingale is to monitoring and alerting what Grafana is to visualization.

    • monitoring
    • time-series
    • nightingale
    • tsdb
    • open-falcon
    • alerting
  21. 21
    kubeshark/kubeshark11,961 · ⑂ 539

    eBPF-powered network observability for Kubernetes. Indexes L4/L7 traffic with full K8s context, decrypts TLS without keys. Queryable by AI agents via MCP and humans via dashboard.

    • kubernetes
    • golang
    • rest
    • grpc
    • devops
    • sniffer
  22. 22
    grafana/pyroscope11,501 · ⑂ 770

    Continuous Profiling Platform. Debug performance issues down to a single line of code

    • continuous-profiling
    • profiling
    • performance
    • golang
    • ruby
    • python
  23. 23
    upgundecha/howtheysre9,739 · ⑂ 884

    A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)

    • site-reliability-engineering
    • sre
    • chaos-engineering
    • dev-ops
    • devops
    • monitoring
  24. 24
    VoltAgent/voltagent9,710 · ⑂ 1,006

    AI Agent Engineering Platform built on an Open Source TypeScript AI Agent Framework

    • agents
    • ai
    • chatbots
    • llm
    • mcp
    • nodejs
  25. 25
    hyperdxio/hyperdx9,612 · ⑂ 413

    Resolve production issues, fast. An open source observability platform unifying session replays, logs, metrics, traces and errors powered by ClickHouse and OpenTelemetry.

    • analytics
    • application-monitoring
    • log-management
    • logs
    • metrics
    • monitoring

Find engineers shipping Observability

The list above ranks the most-starred public repositories tagged with the Observability topic, drawn from the public GitHub graph. Across 589 repositories tagged this way, the maintainers and top contributors are a tight cluster of the people actually building Observability.

Looking for engineers who’ve worked on Observability for real, not just listed it on LinkedIn? The fastest path is the contributor list of these repos. Their commits, issues, and READMEs are public proof of depth.

Refolk turns this list into a search. Ask for “maintainers of top Observability repos who are hiring”, Observability engineers in San Francisco”, or “founders shipping Observability” and Refolk returns a ranked shortlist with sources.

How this list is built

Refolk searched GitHub for public repositories tagged with the Observability topic, ranked them by stargazer count, and kept those with at least 50 stars. The list refreshes once a day.

Last refreshed: Sun, 21 Jun 2026 07:10:38 GMT

Need a list like this for any search?

Refolk runs natural-language searches across GitHub, LinkedIn, and the open web. Try one of these:

Browse other topics

See all repository lists.

Observability by language