Refolk

Top Python Machine learning repositories on GitHub

Libraries, models, and end-to-end systems for traditional and modern ML. Filtered to projects whose primary language is Python.

Ranked by stars across 6,322 Python repositories tagged machine-learning. Refreshed daily.

  1. 1
    huggingface/transformers160,327 · ⑂ 33,126

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    • nlp
    • natural-language-processing
    • pytorch
    • pytorch-transformers
    • transformer
    • model-hub
  2. 2
    pytorch/pytorch99,711 · ⑂ 27,695

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    • neural-network
    • autograd
    • gpu
    • numpy
    • deep-learning
    • tensor
  3. 3
    d2l-ai/d2l-zh77,696 · ⑂ 12,281

    《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

    • deep-learning
    • book
    • notebook
    • natural-language-processing
    • computer-vision
    • machine-learning
  4. 4
    OpenBB-finance/OpenBB67,123 · ⑂ 6,726

    Financial data platform for analysts, quants and AI agents.

    • python
    • machine-learning
    • finance
    • stocks
    • quantitative-finance
    • economics
  5. 5

    🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

    • deep-learning
    • deep-learning-tutorial
    • pytorch
    • gan
    • transformers
    • reinforcement-learning
  6. 6
    scikit-learn/scikit-learn65,988 · ⑂ 26,991

    scikit-learn: machine learning in Python

    • machine-learning
    • python
    • statistics
    • data-science
    • data-analysis
  7. 7
    keras-team/keras64,060 · ⑂ 19,765

    Deep Learning for humans

    • deep-learning
    • tensorflow
    • neural-networks
    • machine-learning
    • data-science
    • python
  8. 8
    ultralytics/yolov557,335 · ⑂ 17,482

    YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

    • yolov3
    • yolov5
    • object-detection
    • pytorch
    • onnx
    • coreml
  9. 9
    ultralytics/ultralytics56,820 · ⑂ 10,921

    Ultralytics YOLO 🚀

    • hub
    • ultralytics
    • yolov8
    • yolo
    • deep-learning
    • image-classification
  10. 10
    ageitgey/face_recognition56,391 · ⑂ 13,721

    The world's simplest facial recognition api for Python and the command line

    • machine-learning
    • face-detection
    • face-recognition
    • python
  11. 11
    deepfakes/faceswap55,214 · ⑂ 13,403

    Deepfakes Software For All

    • faceswap
    • face-swap
    • deep-learning
    • deeplearning
    • deep-neural-networks
    • deepfakes
  12. 12
    apache/airflow45,307 · ⑂ 17,010

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

    • airflow
    • apache
    • apache-airflow
    • python
    • scheduler
    • workflow
  13. 13
    streamlit/streamlit44,475 · ⑂ 4,229

    Streamlit — A faster way to build and share data apps.

    • python
    • machine-learning
    • data-science
    • deep-learning
    • data-visualization
    • streamlit
  14. 14
    gradio-app/gradio42,519 · ⑂ 3,434

    Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

    • machine-learning
    • models
    • ui
    • ui-components
    • interface
    • python
  15. 15
    ray-project/ray42,442 · ⑂ 7,531

    Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

    • ray
    • distributed
    • parallel
    • machine-learning
    • reinforcement-learning
    • deep-learning
  16. 16
    deepspeedai/DeepSpeed42,269 · ⑂ 4,820

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

    • deep-learning
    • pytorch
    • gpu
    • machine-learning
    • billion-parameters
    • data-parallelism
  17. 17
    microsoft/qlib42,122 · ⑂ 6,630

    Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.

    • quantitative-finance
    • machine-learning
    • stock-data
    • platform
    • finance
    • algorithmic-trading
  18. 18
    paperless-ngx/paperless-ngx40,302 · ⑂ 2,642

    A community-supported supercharged document management system: scan, index and archive all your documents

    • angular
    • archiving
    • django
    • dms
    • document-management
    • document-management-system
  19. 19
    roboflow/supervision38,361 · ⑂ 3,414

    We write your reusable computer vision tools. 💜

    • computer-vision
    • image-processing
    • python
    • yolo
    • instance-segmentation
    • object-detection
  20. 20
    LAION-AI/Open-Assistant37,410 · ⑂ 3,296

    OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

    • chatgpt
    • language-model
    • rlhf
    • ai
    • assistant
    • discord-bot
  21. 21
    explosion/spaCy33,546 · ⑂ 4,679

    💫 Industrial-strength Natural Language Processing (NLP) in Python

    • natural-language-processing
    • data-science
    • machine-learning
    • python
    • cython
    • nlp
  22. 22
    eriklindernoren/ML-From-Scratch31,421 · ⑂ 5,265

    Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

    • machine-learning
    • deep-learning
    • deep-reinforcement-learning
    • machine-learning-from-scratch
    • data-science
    • data-mining
  23. 23
    Lightning-AI/pytorch-lightning31,116 · ⑂ 3,720

    Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

    • python
    • deep-learning
    • artificial-intelligence
    • ai
    • pytorch
    • data-science
  24. 24
    iperov/DeepFaceLive30,809 · ⑂ 1,229

    Real-time face swap for PC streaming or video calls

    • deepfake
    • real-time
    • faceswap
    • webcam
    • streaming
    • videocall
  25. 25
    JaidedAI/EasyOCR29,409 · ⑂ 3,568

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

    • ocr
    • deep-learning
    • crnn
    • pytorch
    • lstm
    • machine-learning

Find Python engineers shipping Machine learning

The list above ranks the most-starred public Python repositories tagged with the Machine learning topic, drawn from the public GitHub graph. Across 6,322 matching repositories, the contributors are a tight cluster of engineers with both Python chops and real Machine learning experience.

That overlap is rare. Most Python engineers haven’t shipped Machine learning, and most Machine learning maintainers don’t write Python. The people on this list’s contributor graph are the ones who do both.

Refolk turns this list into a search. Ask for Python Machine learning maintainers hiring” or Python engineers shipping Machine learning in 2025” and Refolk returns a ranked shortlist with the commits, profiles, and projects behind each name.

How this list is built

Refolk searched GitHub for public Python repositories tagged with the Machine learning topic, ranked them by stargazer count, and kept those with at least 25 stars. The list refreshes once a day.

Last refreshed: Thu, 07 May 2026 05:54:26 GMT

Need a more specific search?

Refolk runs natural-language searches across GitHub, LinkedIn, and the open web. Try one of these:

Related lists

See all repository lists.

Or zoom out