Refolk

Top Scala repositories on GitHub

JVM language blending object-oriented and functional paradigms.

Ranked by stars across 1,723 Scala repositories on GitHub. Refreshed daily.

  1. 1
    twitter/the-algorithm73,113 · ⑂ 13,259

    Source code for the X Recommendation Algorithm

  2. 2
    apache/spark43,240 · ⑂ 29,170

    Apache Spark - A unified analytics engine for large-scale data processing

    • python
    • scala
    • r
    • java
    • big-data
    • jdbc
  3. 3
    lichess-org/lila18,162 · ⑂ 2,651

    ♞ lichess.org: the forever free, adless and open source chess server ♞

    • scala
    • chess
    • play-framework
    • non-profit
    • functional-programming
    • type-safe
  4. 4
    prisma/prisma116,403 · ⑂ 841

    💾 Database Tools incl. ORM, Migrations and Admin UI (Postgres, MySQL & MongoDB) [deprecated]

    • orm
    • database
    • graphql
    • datamapper
    • serverless
    • dao
  5. 5
    scala/scala14,449 · ⑂ 3,088

    Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3

    • scala
    • scala-compiler
    • scala-programming-language
    • scala-library
    • jvm-languages
    • functional-programming
  6. 6
    akka/akka-core13,279 · ⑂ 3,550

    A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.

    • reactive
    • distributed-systems
    • concurrency
    • high-performance
    • akka
    • actor-model
  7. 7
    playframework/playframework12,622 · ⑂ 4,036

    The Community Maintained High Velocity Web Framework For Java and Scala.

    • scala
    • java
    • reactive
    • web-framework
    • restful
    • play
  8. 8
    apache/predictionio12,529 · ⑂ 1,908

    PredictionIO, a machine learning server for developers and ML engineers.

    • scala
    • big-data
    • predictionio
  9. 9
    rtyley/bfg-repo-cleaner12,102 · ⑂ 580

    Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala

    • git
  10. 10
    yahoo/CMAK11,934 · ⑂ 2,488

    CMAK is a tool for managing Apache Kafka clusters

    • kafka
    • scala
    • cluster-management
    • big-data
  11. 11
    gitbucket/gitbucket9,372 · ⑂ 1,262

    A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility

    • scala
    • gitbucket
    • git
    • scalatra
  12. 12
    twitter/finagle8,869 · ⑂ 1,437

    A fault tolerant, protocol-agnostic RPC system

    • rpc
    • distributed-systems
    • finagle
    • http
    • http2
    • thrift
  13. 13
    delta-io/delta8,772 · ⑂ 2,087

    An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

    • spark
    • acid
    • big-data
    • analytics
    • delta-lake
  14. 14
    twitter-archive/snowflake7,771 · ⑂ 1,118

    Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.

  15. 15
    snowplow/snowplow7,014 · ⑂ 1,177

    The leader in Customer Data Infrastructure

    • snowplow
    • analytics
    • data
    • data-pipeline
    • data-collection
    • product-analytics
  16. 16
    OpenXiangShan/XiangShan7,000 · ⑂ 901

    Open-source high-performance RISC-V processor

    • risc-v
    • microarchitecture
    • chisel
  17. 17
    gatling/gatling6,898 · ⑂ 1,209

    Modern Load Testing as Code

    • netty
    • scala
    • loadtesting
    • automation
    • gatling
    • load-testing
  18. 18
    lhartikk/ArnoldC6,870 · ⑂ 295

    Arnold Schwarzenegger based programming language

  19. 19
    apache/openwhisk6,771 · ⑂ 1,174

    Apache OpenWhisk is an open source serverless cloud platform

    • openwhisk
    • apache
    • serverless
    • cloud
    • faas
    • functions-as-a-service
  20. 20
    scala/scala36,241 · ⑂ 1,158

    The Scala 3 compiler, also known as Dotty.

    • scala
    • scala3
    • epfl
    • compiler
    • dotty
  21. 21
    guardian/frontend5,887 · ⑂ 570

    The Guardian DotCom.

    • production
  22. 22
    fpinscala/fpinscala5,822 · ⑂ 3,030

    Code, exercises, answers, and hints to go along with the book "Functional Programming in Scala"

  23. 23
    typelevel/cats5,435 · ⑂ 1,239

    Lightweight, modular, and extensible library for functional programming.

  24. 24
    linkerd/linkerd5,324 · ⑂ 496

    Old repo for Linkerd 1.x. See the linkerd2 repo for Linkerd 2.x.

    • cloud-native
    • service-mesh
    • linkerd
    • service-discovery
  25. 25
    microsoft/SynapseML5,226 · ⑂ 861

    Simple and Distributed Machine Learning

    • spark
    • pyspark
    • azure
    • scala
    • microsoft
    • ml

Find Scala engineers and maintainers

The list above ranks the most-starred public Scala repositories, drawn from the public GitHub graph. Across 1,723 Scala repositories in the public graph, the maintainers, top contributors, and recurring committers are a powerful signal for where Scala expertise lives.

Behind every popular Scalaproject is a small group of people who actually shipped it. They’re the Scalaengineers, library authors, and infrastructure builders worth knowing — whether you’re hiring, partnering, or doing technical research.

Refolk turns this list into a search. Ask for Scala maintainers hiring”, “contributors to Scala repos based in Europe”, or “companies whose engineers ship Scala” and Refolk returns a ranked shortlist with the commits, repos, and profiles that earned each person a spot.

How this list is built

Refolk searched GitHub for public repositories whose primary language is Scala, ranked them by stargazer count, and kept those with at least 100 stars. The list refreshes once a day.

Last refreshed: Thu, 07 May 2026 05:55:37 GMT

Need a list like this for any search?

Refolk runs natural-language searches across GitHub, LinkedIn, and the open web. Try one of these to see how it works:

Top repositories in other languages

See all language lists.

Popular Scala sub-categories