🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
This curated list contains 920 awesome open-source projects with a total of 4.9M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!
🧙♂️ Discover other best-of lists or create your own.
📫 Subscribe to our newsletter for updates and trending projects.
- Machine Learning Frameworks 63 projects
- Data Visualization 55 projects
- Text Data & NLP 103 projects
- Image Data 64 projects
- Graph Data 36 projects
- Audio Data 29 projects
- Geospatial Data 22 projects
- Financial Data 25 projects
- Time Series Data 29 projects
- Medical Data 19 projects
- Tabular Data 5 projects
- Optical Character Recognition 12 projects
- Data Containers & Structures 1 projects
- Data Loading & Extraction 1 projects
- Web Scraping & Crawling 1 projects
- Data Pipelines & Streaming 2 projects
- Distributed Machine Learning 36 projects
- Hyperparameter Optimization & AutoML 52 projects
- Reinforcement Learning 23 projects
- Recommender Systems 17 projects
- Privacy Machine Learning 7 projects
- Workflow & Experiment Tracking 40 projects
- Model Serialization & Deployment 20 projects
- Model Interpretability 55 projects
- Vector Similarity Search (ANN) 13 projects
- Probabilistics & Statistics 24 projects
- Adversarial Robustness 9 projects
- GPU & Accelerator Utilities 20 projects
- Tensorflow Utilities 16 projects
- Jax Utilities 3 projects
- Sklearn Utilities 19 projects
- Pytorch Utilities 32 projects
- Database Clients 1 projects
- Others 66 projects
- 🥇🥈🥉 Combined project-quality score
- ⭐️ Star count from GitHub
- 🐣 New project (less than 6 months old)
- 💤 Inactive project (6 months no activity)
- 💀 Dead project (12 months no activity)
- 📈📉 Project is trending up or down
- ➕ Project was recently added
- ❗️ Warning (e.g. missing/risky license)
- 👨💻 Contributors count from GitHub
- 🔀 Fork count from GitHub
- 📋 Issue count from GitHub
- ⏱️ Last update timestamp on package manager
- 📥 Download count from package manager
- 📦 Number of dependent projects
Tensorflow related project
Sklearn related project
PyTorch related project
MxNet related project
Apache Spark related project
Jupyter related project
PaddlePaddle related project
Pandas related project
Jax related project
General-purpose machine learning and deep learning frameworks.
Tensorflow (🥇56 · ⭐ 190K) - An Open Source Machine Learning Framework for Everyone. Apache-2

-
GitHub (👨💻 4.8K · 🔀 75K · 📦 490K · 📋 47K - 15% open · ⏱️ 03.04.2025):
git clone https://github.com/tensorflow/tensorflow
-
PyPi (📥 21M / month · 📦 8.7K · ⏱️ 12.03.2025):
pip install tensorflow
-
Conda (📥 5.4M · ⏱️ 25.03.2025):
conda install -c conda-forge tensorflow
-
Docker Hub (📥 79M · ⭐ 2.7K · ⏱️ 03.04.2025):
docker pull tensorflow/tensorflow
scikit-learn (🥇53 · ⭐ 62K) - scikit-learn: machine learning in Python. BSD-3

XGBoost (🥇46 · ⭐ 27K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
PaddlePaddle (🥇45 · ⭐ 23K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2

pytorch-lightning (🥈44 · ⭐ 29K) - Pretrain, finetune ANY AI model of ANY size on.. Apache-2

StatsModels (🥈44 · ⭐ 11K · 📉) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
LightGBM (🥈42 · ⭐ 17K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
Catboost (🥈41 · ⭐ 8.3K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
einops (🥈36 · ⭐ 8.8K) - Flexible and powerful tensor operations for readable and reliable code.. MIT
Jina (🥈35 · ⭐ 22K) - Build multimodal AI applications with cloud-native stack. Apache-2
-
GitHub (👨💻 180 · 🔀 2.2K · 📋 1.9K - 0% open · ⏱️ 24.03.2025):
git clone https://github.com/jina-ai/jina
-
PyPi (📥 100K / month · 📦 29 · ⏱️ 24.03.2025):
pip install jina
-
Conda (📥 90K · ⏱️ 25.03.2025):
conda install -c conda-forge jina-core
-
Docker Hub (📥 1.8M · ⭐ 8 · ⏱️ 24.03.2025):
docker pull jinaai/jina
Vowpal Wabbit (🥈34 · ⭐ 8.5K · 💤) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
Thinc (🥈34 · ⭐ 2.8K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
tensorflow-upstream (🥉31 · ⭐ 690) - TensorFlow ROCm port. Apache-2

Determined (🥉30 · ⭐ 3.1K) - Determined is an open-source machine learning platform.. Apache-2


Neural Network Libraries (🥉26 · ⭐ 2.7K) - Neural Network Libraries. Apache-2
Runhouse (🥉23 · ⭐ 1K) - Distribute and run AI workloads magically in Python, like PyTorch for.. Apache-2
ThunderSVM (🥉22 · ⭐ 1.6K · 💤) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
chefboost (🥉20 · ⭐ 470) - A Lightweight Decision Tree Framework supporting regular algorithms:.. MIT
NeoML (🥉19 · ⭐ 770 · 💤) - Machine learning framework for both deep learning and traditional.. Apache-2
ThunderGBM (🥉18 · ⭐ 700) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
Show 23 hidden projects...
- dlib (🥈40 · ⭐ 14K) - A toolkit for making real world machine learning and data analysis..
❗️BSL-1.0
- MXNet (🥈39 · ⭐ 21K · 💀) - Lightweight, Portable, Flexible Distributed/Mobile Deep..
Apache-2
- Theano (🥈38 · ⭐ 9.9K · 💀) - Theano was a Python library that allows you to define, optimize, and..
BSD-3
- Chainer (🥈34 · ⭐ 5.9K · 💀) - A flexible framework of neural networks for deep learning.
MIT
- MindsDB (🥉33 · ⭐ 28K) - AIs query engine - Platform for building AI that can learn and..
❗️ICU
- tensorpack (🥉33 · ⭐ 6.3K · 💀) - A Neural Net Training Interface on TensorFlow, with..
Apache-2
- Turi Create (🥉32 · ⭐ 11K · 💀) - Turi Create simplifies the development of custom machine..
BSD-3
- TFlearn (🥉31 · ⭐ 9.6K · 💀) - Deep learning library featuring a higher-level API for TensorFlow.
MIT
- dyNET (🥉31 · ⭐ 3.4K · 💀) - DyNet: The Dynamic Neural Network Toolkit.
Apache-2
- CNTK (🥉29 · ⭐ 18K · 💀) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit.
MIT
- Lasagne (🥉28 · ⭐ 3.9K · 💀) - Lightweight library to build and train neural networks in Theano.
MIT
- SHOGUN (🥉27 · ⭐ 3K · 💀) - Unified and efficient Machine Learning.
BSD-3
- EvaDB (🥉27 · ⭐ 2.7K · 💀) - Database system for AI-powered apps.
Apache-2
- neon (🥉24 · ⭐ 3.9K · 💀) - Intel Nervana reference deep learning framework committed to best..
Apache-2
- xLearn (🥉24 · ⭐ 3.1K · 💀) - High performance, easy-to-use, and scalable machine learning (ML)..
Apache-2
- Neural Tangents (🥉24 · ⭐ 2.3K · 💀) - Fast and Easy Infinite Neural Networks in Python.
Apache-2
- NeuPy (🥉24 · ⭐ 740 · 💀) - NeuPy is a Tensorflow based python library for prototyping and building..
MIT
- Torchbearer (🥉22 · ⭐ 640 · 💀) - torchbearer: A model fitting library for PyTorch.
MIT
- mace (🥉21 · ⭐ 5K · 💀) - MACE is a deep learning inference framework optimized for mobile..
Apache-2
- elegy (🥉20 · ⭐ 470 · 💀) - A High Level API for Deep Learning in JAX.
MIT
- Objax (🥉19 · ⭐ 770 · 💀) - Objax is a machine learning framework that provides an Object..
Apache-2
- StarSpace (🥉16 · ⭐ 4K · 💀) - Learning embeddings for classification, retrieval and ranking.
MIT
- nanodl (🥉14 · ⭐ 280 · 💤) - A Jax-based library for designing and training transformer models..
MIT
General-purpose and task-specific data visualization libraries.
Matplotlib (🥇49 · ⭐ 21K) - matplotlib: plotting with Python. ❗Unlicensed
Plotly (🥇46 · ⭐ 17K) - The interactive graphing library for Python. MIT
-
GitHub (👨💻 280 · 🔀 2.6K · 📥 120 · 📦 390K · 📋 3.2K - 20% open · ⏱️ 28.03.2025):
git clone https://github.com/plotly/plotly.py
-
PyPi (📥 22M / month · 📦 7.8K · ⏱️ 31.03.2025):
pip install plotly
-
Conda (📥 8.9M · ⏱️ 25.03.2025):
conda install -c conda-forge plotly
-
npm (📥 63K / month · 📦 9 · ⏱️ 12.01.2021):
npm install plotlywidget
HoloViews (🥈39 · ⭐ 2.8K) - With Holoviews, your data visualizes itself. BSD-3

-
GitHub (👨💻 150 · 🔀 410 · 📦 15K · 📋 3.4K - 32% open · ⏱️ 31.03.2025):
git clone https://github.com/holoviz/holoviews
-
PyPi (📥 500K / month · 📦 430 · ⏱️ 31.03.2025):
pip install holoviews
-
Conda (📥 2M · ⏱️ 25.03.2025):
conda install -c conda-forge holoviews
-
npm (📥 200 / month · 📦 5 · ⏱️ 14.01.2025):
npm install @pyviz/jupyterlab_pyviz
pandas-profiling (🥈38 · ⭐ 13K) - 1 Line of code data quality profiling & exploratory.. MIT


Perspective (🥈34 · ⭐ 9K) - A data visualization and analytics component, especially.. Apache-2

-
GitHub (👨💻 98 · 🔀 1.2K · 📥 11K · 📦 170 · 📋 880 - 12% open · ⏱️ 25.03.2025):
git clone https://github.com/finos/perspective
-
PyPi (📥 17K / month · 📦 30 · ⏱️ 20.03.2025):
pip install perspective-python
-
Conda (📥 1.9M · ⏱️ 25.03.2025):
conda install -c conda-forge perspective
-
npm (📥 1.1K / month · 📦 6 · ⏱️ 20.03.2025):
npm install @finos/perspective-jupyterlab
VisPy (🥈34 · ⭐ 3.4K) - High-performance interactive 2D/3D data visualization library. BSD-3

-
GitHub (👨💻 200 · 🔀 620 · 📦 1.9K · 📋 1.5K - 25% open · ⏱️ 31.03.2025):
git clone https://github.com/vispy/vispy
-
PyPi (📥 150K / month · 📦 170 · ⏱️ 17.06.2024):
pip install vispy
-
Conda (📥 770K · ⏱️ 25.03.2025):
conda install -c conda-forge vispy
-
npm (📥 18 / month · 📦 3 · ⏱️ 15.03.2020):
npm install vispy
datashader (🥈34 · ⭐ 3.4K) - Quickly and accurately render even the largest data. BSD-3
hvPlot (🥈32 · ⭐ 1.2K) - A high-level plotting API for pandas, dask, xarray, and networkx built.. BSD-3
mpld3 (🥉31 · ⭐ 2.4K) - An interactive data visualization tool which brings matplotlib graphics to.. BSD-3
-
GitHub (👨💻 53 · 🔀 360 · 📦 7.2K · 📋 370 - 59% open · ⏱️ 30.10.2024):
git clone https://github.com/mpld3/mpld3
-
PyPi (📥 360K / month · 📦 150 · ⏱️ 23.12.2023):
pip install mpld3
-
Conda (📥 230K · ⏱️ 25.03.2025):
conda install -c conda-forge mpld3
-
npm (📥 1.4K / month · 📦 9 · ⏱️ 23.12.2023):
npm install mpld3
D-Tale (🥉30 · ⭐ 4.9K) - Visualizer for pandas data structures. ❗️LGPL-2.1


bqplot (🥉30 · ⭐ 3.7K) - Plotting library for IPython/Jupyter notebooks. Apache-2

-
GitHub (👨💻 65 · 🔀 470 · 📦 61 · 📋 640 - 42% open · ⏱️ 22.10.2024):
git clone https://github.com/bqplot/bqplot
-
PyPi (📥 210K / month · 📦 110 · ⏱️ 24.12.2024):
pip install bqplot
-
Conda (📥 1.5M · ⏱️ 25.03.2025):
conda install -c conda-forge bqplot
-
npm (📥 1.8K / month · 📦 21 · ⏱️ 24.12.2024):
npm install bqplot
AutoViz (🥉27 · ⭐ 1.8K · 💤) - Automatically Visualize any dataset, any size with a single line.. Apache-2
Plotly-Resampler (🥉27 · ⭐ 1.1K) - Visualize large time series data with plotly.py. MIT
Chartify (🥉25 · ⭐ 3.6K) - Python library that makes it easy for data scientists to create.. Apache-2
data-validation (🥉25 · ⭐ 770) - Library for exploring and validating machine learning.. Apache-2


python-ternary (🥉25 · ⭐ 760 · 💤) - Ternary plotting library for python with matplotlib. MIT
vegafusion (🥉20 · ⭐ 350) - Serverside scaling for Vega and Altair visualizations. BSD-3
-
GitHub (👨💻 6 · 🔀 20 · 📥 12K · 📋 140 - 36% open · ⏱️ 23.02.2025):
git clone https://github.com/vegafusion/vegafusion
-
PyPi (📥 1.5K / month · 📦 2 · ⏱️ 09.05.2024):
pip install vegafusion-jupyter
-
Conda (📥 410K · ⏱️ 25.03.2025):
conda install -c conda-forge vegafusion-python-embed
-
npm (📥 230 / month · 📦 3 · ⏱️ 09.05.2024):
npm install vegafusion-jupyter
animatplot (🥉19 · ⭐ 420 · 💤) - A python package for animating plots build on matplotlib. MIT
Show 18 hidden projects...
- missingno (🥉30 · ⭐ 4.1K · 💀) - Missing data visualization module for Python.
MIT
- Cufflinks (🥉28 · ⭐ 3.1K · 💀) - Productivity Tools for Plotly + Pandas.
MIT
- pythreejs (🥉28 · ⭐ 960 · 💀) - A Jupyter - Three.js bridge.
BSD-3
- Facets Overview (🥉27 · ⭐ 7.4K · 💀) - Visualizations for machine learning datasets.
Apache-2
- Sweetviz (🥉27 · ⭐ 3K · 💀) - Visualize and compare datasets, target values and associations, with..
MIT
- HiPlot (🥉25 · ⭐ 2.8K · 💀) - HiPlot makes understanding high dimensional data easy.
MIT
- HyperTools (🥉25 · ⭐ 1.8K · 💀) - A Python toolbox for gaining geometric insights into high-..
MIT
- PandasGUI (🥉24 · ⭐ 3.2K · 💀) - A GUI for Pandas DataFrames.
❗️MIT-0
- Multicore-TSNE (🥉24 · ⭐ 1.9K · 💀) - Parallel t-SNE implementation with Python and Torch..
BSD-3
- ridgeplot (🥉24 · ⭐ 220) - Beautiful ridgeline plots in Python.
MIT
- Pandas-Bokeh (🥉22 · ⭐ 880 · 💀) - Bokeh Plotting Backend for Pandas and GeoPandas.
MIT
- pivottablejs (🥉22 · ⭐ 700 · 💀) - Dragndrop Pivot Tables and Charts for Jupyter/IPython..
MIT
- joypy (🥉22 · ⭐ 580 · 💀) - Joyplots in Python with matplotlib & pandas.
MIT
- PDPbox (🥉21 · ⭐ 850 · 💀) - python partial dependence plot toolbox.
MIT
- pdvega (🥉16 · ⭐ 340 · 💀) - Interactive plotting for Pandas using Vega-Lite.
MIT
- data-describe (🥉15 · ⭐ 300 · 💀) - datadescribe: Pythonic EDA Accelerator for Data Science.
Apache-2
- nx-altair (🥉15 · ⭐ 220 · 💀) - Draw interactive NetworkX graphs with Altair.
MIT
- nptsne (🥉13 · ⭐ 33 · 💀) - nptsne is a numpy compatible python binary package that offers a..
Apache-2
Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.
transformers (🥇52 · ⭐ 140K) - Transformers: State-of-the-art Machine Learning for.. Apache-2


nltk (🥇45 · ⭐ 14K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
sentence-transformers (🥇44 · ⭐ 16K · 📈) - State-of-the-Art Text Embeddings. Apache-2

flair (🥇40 · ⭐ 14K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT

Tokenizers (🥇40 · ⭐ 9.6K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
gensim (🥇38 · ⭐ 16K) - Topic Modelling for Humans. ❗️LGPL-2.1
ChatterBot (🥇38 · ⭐ 14K) - ChatterBot is a machine learning, conversational dialog engine for.. BSD-3
TextBlob (🥇38 · ⭐ 9.3K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
haystack (🥈37 · ⭐ 20K · 📈) - AI orchestration framework to build customizable, production-.. Apache-2
sentencepiece (🥈37 · ⭐ 11K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
TensorFlow Text (🥈36 · ⭐ 1.3K) - Making text a first-class citizen in TensorFlow. Apache-2

qdrant (🥈34 · ⭐ 23K) - Qdrant - High-performance, massive-scale Vector Database and Vector.. Apache-2
-
GitHub (👨💻 130 · 🔀 1.6K · 📥 370K · 📦 120 · 📋 1.5K - 23% open · ⏱️ 31.03.2025):
git clone https://github.com/qdrant/qdrant
stanza (🥈33 · ⭐ 7.4K) - Stanford NLP Python library for tokenization, sentence segmentation,.. Apache-2
jellyfish (🥈33 · ⭐ 2.1K) - a python library for doing approximate and phonetic matching of strings. MIT
rubrix (🥈32 · ⭐ 4.4K) - Argilla is a collaboration tool for AI engineers and domain experts.. Apache-2
snowballstemmer (🥈32 · ⭐ 780) - Snowball compiler and stemming algorithms. BSD-3
DeepPavlov (🥈31 · ⭐ 6.8K) - An open source library for deep learning end-to-end dialog.. Apache-2

Opik (🥈31 · ⭐ 6.1K) - Debug, evaluate, and monitor your LLM applications, RAG systems, and.. Apache-2
Sumy (🥈29 · ⭐ 3.6K · 💤) - Module for automatic summarization of text documents and HTML pages. Apache-2
TextDistance (🥈29 · ⭐ 3.5K · 💤) - Compute distance between sequences. 30+ algorithms, pure.. MIT
SciSpacy (🥈29 · ⭐ 1.8K) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
spacy-transformers (🥈29 · ⭐ 1.4K) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT
spacy
english-words (🥉26 · ⭐ 11K) - A text file containing 479k English words for all your.. Unlicense
scattertext (🥉26 · ⭐ 2.3K · 💤) - Beautiful visualizations of how language differs among.. Apache-2
PyTextRank (🥉26 · ⭐ 2.2K · 💤) - Python implementation of TextRank algorithms (textgraphs) for.. MIT
detoxify (🥉25 · ⭐ 1K) - Trained models & code to predict toxic comments on all 3 Jigsaw Toxic.. Apache-2
happy-transformer (🥉22 · ⭐ 530) - Happy Transformer makes it easy to fine-tune and.. Apache-2
huggingface
small-text (🥉21 · ⭐ 610) - Active Learning for Text Classification in Python. MIT


Show 57 hidden projects...
- AllenNLP (🥈37 · ⭐ 12K · 💀) - An open-source NLP research library, built on PyTorch.
Apache-2
- fastText (🥈35 · ⭐ 26K · 💀) - Library for fast text representation and classification.
MIT
- ParlAI (🥈32 · ⭐ 11K · 💀) - A framework for training and evaluating AI models on a variety of..
MIT
- fuzzywuzzy (🥈32 · ⭐ 9.3K · 💀) - Fuzzy String Matching in Python.
❗️GPL-2.0
- nlpaug (🥈30 · ⭐ 4.5K · 💀) - Data augmentation for NLP.
MIT
- GluonNLP (🥈29 · ⭐ 2.6K · 💀) - Toolkit that enables easy text preprocessing, datasets..
Apache-2
- langid (🥈29 · ⭐ 2.4K · 💀) - Stand-alone language identification system.
BSD-3
- Ciphey (🥈28 · ⭐ 19K · 💀) - Automatically decrypt encryptions without knowing the key or cipher,..
MIT
- vaderSentiment (🥈28 · ⭐ 4.7K · 💀) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary..
MIT
- fastNLP (🥈28 · ⭐ 3.1K · 💀) - fastNLP: A Modularized and Extensible NLP Framework. Currently..
Apache-2
- textacy (🥈28 · ⭐ 2.2K · 💀) - NLP, before and after spaCy.
❗Unlicensed
- flashtext (🥉27 · ⭐ 5.6K · 💀) - Extract Keywords from sentence or Replace keywords in sentences.
MIT
- FARM (🥉27 · ⭐ 1.8K · 💀) - Fast & easy transfer learning for NLP. Harvesting language..
Apache-2
- pySBD (🥉27 · ⭐ 840 · 💀) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence..
MIT
- neuralcoref (🥉26 · ⭐ 2.9K · 💀) - Fast Coreference Resolution in spaCy with Neural Networks.
MIT
- polyglot (🥉26 · ⭐ 2.3K · 💀) - Multilingual text (NLP) processing toolkit.
❗️GPL-3.0
- underthesea (🥉26 · ⭐ 1.5K) - Underthesea - Vietnamese NLP Toolkit.
❗️GPL-3.0
- PyText (🥉25 · ⭐ 6.3K · 💀) - A natural language modeling framework based on PyTorch.
BSD-3
- OpenPrompt (🥉25 · ⭐ 4.5K · 💀) - An Open-Source Framework for Prompt-Learning.
Apache-2
- Snips NLU (🥉25 · ⭐ 3.9K · 💀) - Snips Python library to extract meaning from text.
Apache-2
- pytorch-nlp (🥉25 · ⭐ 2.2K · 💀) - Basic Utilities for PyTorch Natural Language Processing..
BSD-3
- sense2vec (🥉25 · ⭐ 1.6K · 💀) - Contextually-keyed word vectors.
MIT
- whoosh (🥉25 · ⭐ 620 · 💀) - Pure-Python full-text search library.
❗️BSD-1-Clause
- textgenrnn (🥉24 · ⭐ 4.9K · 💀) - Easily train your own text-generating neural network of any..
MIT
- MatchZoo (🥉24 · ⭐ 3.9K · 💀) - Facilitating the design, comparison and sharing of deep..
Apache-2
- promptsource (🥉24 · ⭐ 2.8K · 💀) - Toolkit for creating, sharing and using natural language..
Apache-2
- Kashgari (🥉24 · ⭐ 2.4K · 💀) - Kashgari is a production-level NLP Transfer learning..
Apache-2
- YouTokenToMe (🥉24 · ⭐ 970 · 💀) - Unsupervised text tokenizer focused on computational efficiency.
MIT
- Texar (🥉23 · ⭐ 2.4K · 💀) - Toolkit for Machine Learning, Natural Language Processing, and..
Apache-2
- jiant (🥉23 · ⭐ 1.7K · 💀) - jiant is an nlp toolkit.
MIT
- gpt-2-simple (🥉22 · ⭐ 3.4K · 💀) - Python package to easily retrain OpenAIs GPT-2 text-..
MIT
- NLP Architect (🥉22 · ⭐ 2.9K · 💀) - A model library for exploring state-of-the-art deep..
Apache-2
- Texthero (🥉22 · ⭐ 2.9K · 💀) - Text preprocessing, representation and visualization from zero to..
MIT
- stop-words (🥉22 · ⭐ 160 · 💀) - Get list of common stop words in various languages in Python.
BSD-3
- DELTA (🥉21 · ⭐ 1.6K · 💀) - DELTA is a deep learning based natural language and speech..
Apache-2
- anaGo (🥉21 · ⭐ 1.5K · 💀) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,..
MIT
- DeepMatcher (🥉20 · ⭐ 5.2K · 💀) - Python package for performing Entity and Text Matching using..
BSD-3
- lightseq (🥉20 · ⭐ 3.3K · 💀) - LightSeq: A High Performance Library for Sequence Processing..
Apache-2
- textpipe (🥉20 · ⭐ 300 · 💀) - Textpipe: clean and extract metadata from text.
MIT
- pyfasttext (🥉20 · ⭐ 230 · 💀) - Yet another Python binding for fastText.
❗️GPL-3.0
- Camphr (🥉19 · ⭐ 340 · 💀) - Camphr - NLP libary for creating pipeline components.
Apache-2
spacy
- numerizer (🥉19 · ⭐ 230 · 💤) - A Python module to convert natural language numerics into ints and..
MIT
- NeuroNER (🥉18 · ⭐ 1.7K · 💀) - Named-entity recognition using neural networks. Easy-to-use and..
MIT
- nboost (🥉18 · ⭐ 680 · 💀) - NBoost is a scalable, search-api-boosting platform for deploying..
Apache-2
- fastT5 (🥉18 · ⭐ 580 · 💀) - boost inference speed of T5 models by 5x & reduce the model size..
Apache-2
- textaugment (🥉18 · ⭐ 420 · 💀) - TextAugment: Text Augmentation Library.
MIT
- OpenNRE (🥉16 · ⭐ 4.4K · 💀) - An Open-Source Package for Neural Relation Extraction (NRE).
MIT
- Translate (🥉16 · ⭐ 830 · 💀) - Translate - a PyTorch Language Library.
BSD-3
- skift (🥉16 · ⭐ 240 · 💀) - scikit-learn wrappers for Python fastText.
MIT
- BLINK (🥉15 · ⭐ 1.2K · 💀) - Entity Linker solution.
MIT
- TextBox (🥉15 · ⭐ 1.1K · 💀) - TextBox 2.0 is a text generation library with pre-trained language..
MIT
- ONNX-T5 (🥉15 · ⭐ 250 · 💀) - Summarization, translation, sentiment-analysis, text-generation..
Apache-2
- NeuralQA (🥉15 · ⭐ 230 · 💀) - NeuralQA: A Usable Library for Question Answering on Large Datasets..
MIT
- Headliner (🥉15 · ⭐ 230 · 💀) - Easy training and deployment of seq2seq models.
MIT
- TransferNLP (🥉14 · ⭐ 290 · 💀) - NLP library designed for reproducible experimentation..
MIT
- textvec (🥉13 · ⭐ 190 · 💀) - Text vectorization tool to outperform TFIDF for classification..
MIT
- spacy-dbpedia-spotlight (🥉12 · ⭐ 110 · 💀) - A spaCy wrapper for DBpedia Spotlight.
MIT
spacy
Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.
PyTorch Image Models (🥇42 · ⭐ 34K) - The largest collection of PyTorch image encoders /.. Apache-2

torchvision (🥇42 · ⭐ 17K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3

Albumentations (🥇41 · ⭐ 15K) - Fast and flexible image augmentation library. Paper about.. MIT

InsightFace (🥈38 · ⭐ 25K · 📈) - State-of-the-art 2D and 3D Face Analysis Project. MIT

opencv-python (🥈36 · ⭐ 4.8K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
detectron2 (🥈33 · ⭐ 32K) - Detectron2 is a platform for object detection, segmentation.. Apache-2

vit-pytorch (🥈31 · ⭐ 22K) - Implementation of Vision Transformer, a simple way to achieve.. MIT

PaddleDetection (🥈31 · ⭐ 13K) - Object Detection toolkit based on PaddlePaddle. It.. Apache-2

sahi (🥈30 · ⭐ 4.4K) - Framework agnostic sliced/tiled inference + interactive ui + error analysis.. MIT
Face Alignment (🥉28 · ⭐ 7.3K · 💤) - 2D and 3D Face alignment library build using pytorch. BSD-3

facenet-pytorch (🥉27 · ⭐ 4.8K · 💤) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT

CellProfiler (🥉27 · ⭐ 970) - An open-source application for biological image analysis. BSD-3
Image Deduplicator (🥉24 · ⭐ 5.3K) - Finding duplicate images made easy!. Apache-2

segmentation_models (🥉24 · ⭐ 4.8K · 💤) - Segmentation models with pretrained backbones. Keras.. MIT

pytorchvideo (🥉24 · ⭐ 3.4K · 📉) - A deep learning library for video understanding.. Apache-2

tensorflow-graphics (🥉24 · ⭐ 2.8K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2

icevision (🥉22 · ⭐ 860) - An Agnostic Computer Vision Framework - Pluggable to any Training.. Apache-2
PySlowFast (🥉21 · ⭐ 6.9K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2

Image Super-Resolution (🥉21 · ⭐ 4.7K) - Super-scale your images and run experiments with.. Apache-2

-
GitHub (👨💻 11 · 🔀 750 · 📋 220 - 48% open · ⏱️ 18.12.2024):
git clone https://github.com/idealo/image-super-resolution
-
PyPi (📥 6.7K / month · 📦 5 · ⏱️ 08.01.2020):
pip install ISR
-
Docker Hub (📥 270 · ⭐ 1 · ⏱️ 01.04.2019):
docker pull idealo/image-super-resolution-gpu
scenic (🥉18 · ⭐ 3.5K) - Scenic: A Jax Library for Computer Vision Research and Beyond. Apache-2

-
GitHub (👨💻 91 · 🔀 450 · 📋 270 - 56% open · ⏱️ 29.03.2025):
git clone https://github.com/google-research/scenic
Show 26 hidden projects...
- scikit-image (🥇42 · ⭐ 6.2K) - Image processing in Python.
❗Unlicensed
- imgaug (🥈37 · ⭐ 15K · 💀) - Image augmentation for machine learning experiments.
MIT
- Face Recognition (🥈36 · ⭐ 54K · 💀) - The worlds simplest facial recognition api for Python..
MIT
- MMDetection (🥈36 · ⭐ 31K · 💀) - OpenMMLab Detection Toolbox and Benchmark.
Apache-2
- glfw (🥈36 · ⭐ 14K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input.
❗️Zlib
- PyTorch3D (🥈33 · ⭐ 9.2K) - PyTorch3D is FAIRs library of reusable components for..
❗Unlicensed
- imutils (🥈32 · ⭐ 4.6K · 💀) - A series of convenience functions to make basic image processing..
MIT
- imageai (🥈31 · ⭐ 8.8K · 💀) - A python library built to empower developers to build applications..
MIT
- GluonCV (🥉29 · ⭐ 5.9K · 💀) - Gluon CV Toolkit.
Apache-2
- layout-parser (🥉28 · ⭐ 5.2K · 💀) - A Unified Toolkit for Deep Learning Based Document Image..
Apache-2
- Augmentor (🥉27 · ⭐ 5.1K · 💀) - Image augmentation library in Python for machine learning.
MIT
- chainercv (🥉27 · ⭐ 1.5K · 💀) - ChainerCV: a Library for Deep Learning in Computer Vision.
MIT
- Pillow-SIMD (🥉25 · ⭐ 2.2K · 💤) - The friendly PIL fork.
❗️PIL
- vissl (🥉23 · ⭐ 3.3K · 💀) - VISSL is FAIRs library of extensible, modular and scalable..
MIT
- Luminoth (🥉23 · ⭐ 2.4K · 💀) - Deep Learning toolkit for Computer Vision.
BSD-3
- Classy Vision (🥉23 · ⭐ 1.6K · 💀) - An end-to-end PyTorch framework for image and video..
MIT
- deep-daze (🥉22 · ⭐ 4.4K · 💀) - Simple command line tool for text to image generation using..
MIT
- detecto (🥉22 · ⭐ 620 · 💀) - Build fully-functioning computer vision models with PyTorch.
MIT
- DE⫶TR (🥉21 · ⭐ 14K · 💀) - End-to-End Object Detection with Transformers.
Apache-2
- image-match (🥉21 · ⭐ 3K · 💀) - Quickly search over billions of images.
Apache-2
- nude.py (🥉20 · ⭐ 930 · 💀) - Nudity detection with Python.
MIT
- pycls (🥉18 · ⭐ 2.2K · 💀) - Codebase for Image Classification Research, written in PyTorch.
MIT
- solt (🥉18 · ⭐ 270) - Streaming over lightweight data transformations.
MIT
- Caer (🥉17 · ⭐ 790 · 💀) - A lightweight Computer Vision library. Scale your models, not boilerplate.
MIT
- Torch Points 3D (🥉17 · ⭐ 230 · 💀) - Pytorch framework for doing deep learning on point..
BSD-3
- HugsVision (🥉15 · ⭐ 200 · 💀) - HugsVision is a easy to use huggingface wrapper for state-of-..
MIT
huggingface
Libraries for graph processing, clustering, embedding, and machine learning tasks.
PyTorch Geometric (🥇40 · ⭐ 22K) - Graph Neural Network Library for PyTorch. MIT

dgl (🥇37 · ⭐ 14K) - Python package built to ease deep learning on graph, on top of existing DL.. Apache-2
pygraphistry (🥈31 · ⭐ 2.2K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3

PyKEEN (🥈30 · ⭐ 1.8K) - A Python library for learning and evaluating knowledge graph embeddings. MIT
pytorch_geometric_temporal (🥈28 · ⭐ 2.8K) - PyTorch Geometric Temporal: Spatiotemporal Signal.. MIT

torch-cluster (🥈24 · ⭐ 860) - PyTorch Extension Library of Optimized Graph Cluster.. MIT

Show 26 hidden projects...
- igraph (🥇32 · ⭐ 1.4K) - Python interface for igraph.
❗️GPL-2.0
- Spektral (🥈28 · ⭐ 2.4K · 💀) - Graph Neural Networks with Keras and Tensorflow 2.
MIT
- StellarGraph (🥈27 · ⭐ 3K · 💀) - StellarGraph - Machine Learning on Graphs.
Apache-2
- pygal (🥈27 · ⭐ 2.7K · 💤) - PYthon svg GrAph plotting Library.
❗️LGPL-3.0
- AmpliGraph (🥈26 · ⭐ 2.2K · 💀) - Python library for Representation Learning on Knowledge..
Apache-2
- Paddle Graph Learning (🥈26 · ⭐ 1.6K · 💀) - Paddle Graph Learning (PGL) is an efficient and..
Apache-2
- Karate Club (🥈24 · ⭐ 2.2K · 💤) - Karate Club: An API Oriented Open-source Python Framework..
❗️GPL-3.0
- PyTorch-BigGraph (🥉23 · ⭐ 3.4K · 💀) - Generate embeddings from large-scale graph-structured..
BSD-3
- jraph (🥉23 · ⭐ 1.4K · 💀) - A Graph Neural Network Library in Jax.
Apache-2
- graph4nlp (🥉22 · ⭐ 1.7K · 💀) - Graph4nlp is the library for the easy use of Graph..
Apache-2
- graph-nets (🥉21 · ⭐ 5.4K · 💀) - Build Graph Nets in Tensorflow.
Apache-2
- pyRDF2Vec (🥉21 · ⭐ 260 · 💀) - Python Implementation and Extension of RDF2Vec.
MIT
- DeepWalk (🥉20 · ⭐ 2.7K · 💀) - DeepWalk - Deep Learning for Graphs.
❗️GPL-3.0
- DIG (🥉20 · ⭐ 1.9K · 💀) - A library for graph deep learning research.
❗️GPL-3.0
- GraphGym (🥉20 · ⭐ 1.8K · 💀) - Platform for designing and evaluating Graph Neural Networks (GNN).
MIT
- deepsnap (🥉20 · ⭐ 560 · 💀) - Python library assists deep learning on graphs.
MIT
- DeepGraph (🥉18 · ⭐ 290) - Analyze Data with Pandas-based Networks. Documentation:.
BSD-3
- kglib (🥉17 · ⭐ 550 · 💀) - TypeDB-ML is the Machine Learning integrations library for TypeDB.
Apache-2
- Sematch (🥉17 · ⭐ 440 · 💀) - semantic similarity framework for knowledge graph.
Apache-2
- Euler (🥉16 · ⭐ 2.9K · 💀) - A distributed graph deep learning framework.
Apache-2
- AutoGL (🥉15 · ⭐ 1.1K · 💀) - An autoML framework & toolkit for machine learning on graphs.
Apache-2
- ptgnn (🥉15 · ⭐ 380 · 💀) - A PyTorch Graph Neural Network Library.
MIT
- GraphEmbedding (🥉14 · ⭐ 3.8K · 💀) - Implementation and experiments of graph embedding..
MIT
- GraphSAGE (🥉14 · ⭐ 3.5K · 💀) - Representation learning on large graphs using stochastic..
MIT
- OpenNE (🥉14 · ⭐ 1.7K · 💀) - An Open-Source Package for Network Embedding (NE).
MIT
- OpenKE (🥉13 · ⭐ 3.9K · 💀) - An Open-Source Package for Knowledge Embedding (KE).
❗Unlicensed
Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.
speechbrain (🥇38 · ⭐ 9.6K) - A PyTorch-based Speech Toolkit. Apache-2

SpeechRecognition (🥈35 · ⭐ 8.7K) - Speech recognition module for Python, supporting several.. BSD-3
torchaudio (🥈35 · ⭐ 2.6K) - Data manipulation and transformation for audio signal.. BSD-2

audiomentations (🥈31 · ⭐ 2K) - A Python library for audio data augmentation. Useful for making.. MIT
python-soundfile (🥈30 · ⭐ 750 · 📉) - SoundFile is an audio library based on libsndfile, CFFI,.. BSD-3
pyAudioAnalysis (🥉28 · ⭐ 6K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
DeepSpeech (🥉23 · ⭐ 26K) - DeepSpeech is an open source embedded (offline, on-device).. MPL-2.0

Show 13 hidden projects...
- Coqui TTS (🥇36 · ⭐ 39K · 💀) - - a deep learning toolkit for Text-to-Speech, battle-..
MPL-2.0
- Pydub (🥇36 · ⭐ 9.3K · 💀) - Manipulate audio with a simple and easy high level interface.
MIT
- audioread (🥈30 · ⭐ 510 · 💀) - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio..
MIT
- aubio (🥉28 · ⭐ 3.4K · 💀) - a library for audio and music analysis.
❗️GPL-3.0
- Essentia (🥉28 · ⭐ 3K) - C++ library for audio and music analysis, description and..
❗️AGPL-3.0
- TTS (🥉26 · ⭐ 9.8K · 💀) - Deep learning for Text to Speech (Discussion forum:..
MPL-2.0
- python_speech_features (🥉25 · ⭐ 2.4K · 💀) - This library provides common speech features for ASR..
MIT
- Dejavu (🥉23 · ⭐ 6.5K · 💀) - Audio fingerprinting and recognition in Python.
MIT
- nnAudio (🥉23 · ⭐ 1.1K · 💀) - Audio processing by using pytorch 1D convolution network.
MIT
- kapre (🥉22 · ⭐ 930 · 💀) - kapre: Keras Audio Preprocessors.
MIT
- TimeSide (🥉21 · ⭐ 380 · 📈) - scalable audio processing framework and server written in..
❗️AGPL-3.0
- Muda (🥉18 · ⭐ 230 · 💀) - A library for augmenting annotated audio data.
ISC
- textlesslib (🥉10 · ⭐ 540 · 💀) - Library for Textless Spoken Language Processing.
MIT
Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.
pydeck (🥇43 · ⭐ 13K) - WebGL2 powered visualization framework. MIT

-
GitHub (👨💻 280 · 🔀 2.1K · 📦 8.9K · 📋 3.2K - 12% open · ⏱️ 02.04.2025):
git clone https://github.com/visgl/deck.gl
-
PyPi (📥 7.9M / month · 📦 160 · ⏱️ 21.03.2025):
pip install pydeck
-
Conda (📥 720K · ⏱️ 25.03.2025):
conda install -c conda-forge pydeck
-
npm (📥 690K / month · 📦 340 · ⏱️ 24.03.2025):
npm install deck.gl
ArcGIS API (🥈35 · ⭐ 2K) - Documentation and samples for ArcGIS API for Python. Apache-2
ipyleaflet (🥉32 · ⭐ 1.5K) - A Jupyter - Leaflet.js bridge. MIT

-
GitHub (👨💻 92 · 🔀 360 · 📦 16K · 📋 660 - 45% open · ⏱️ 05.12.2024):
git clone https://github.com/jupyter-widgets/ipyleaflet
-
PyPi (📥 220K / month · 📦 280 · ⏱️ 22.07.2024):
pip install ipyleaflet
-
Conda (📥 1.4M · ⏱️ 25.03.2025):
conda install -c conda-forge ipyleaflet
-
npm (📥 3.2K / month · 📦 9 · ⏱️ 22.07.2024):
npm install jupyter-leaflet
pymap3d (🥉25 · ⭐ 410) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef.. BSD-2
Show 8 hidden projects...
- geopy (🥈33 · ⭐ 4.6K · 💀) - Geocoding library for Python.
MIT
- Geocoder (🥈33 · ⭐ 1.6K · 💀) - Python Geocoder.
MIT
- Satpy (🥈33 · ⭐ 1.1K) - Python package for earth-observing satellite data processing.
❗️GPL-3.0
- Sentinelsat (🥉27 · ⭐ 1K · 💀) - Search and download Copernicus Sentinel satellite images.
❗️GPL-3.0
- EarthPy (🥉27 · ⭐ 510 · 💀) - A package built to support working with spatial data using open..
BSD-3
- prettymaps (🥉26 · ⭐ 12K) - Draw pretty maps from OpenStreetMap data! Built with osmnx..
❗️AGPL-3.0
- gmaps (🥉23 · ⭐ 760 · 💀) - Google maps for Jupyter notebooks.
BSD-3
- geoplotlib (🥉22 · ⭐ 1K · 💀) - python toolbox for visualizing geographical data and making maps.
MIT
Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.
Alpha Vantage (🥈28 · ⭐ 4.5K · 💤) - A python wrapper for Alpha Vantage API for financial data. MIT
TensorTrade (🥉27 · ⭐ 4.9K · 💤) - An open source reinforcement learning framework for.. Apache-2
stockstats (🥉26 · ⭐ 1.4K) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3
finmarketpy (🥉25 · ⭐ 3.6K) - Python library for backtesting trading strategies & analyzing.. Apache-2
tf-quant-finance (🥉22 · ⭐ 4.8K) - High-performance TensorFlow library for quantitative.. Apache-2

Show 16 hidden projects...
- zipline (🥇33 · ⭐ 18K · 💀) - Zipline, a Pythonic Algorithmic Trading Library.
Apache-2
- pyfolio (🥇32 · ⭐ 5.9K · 💀) - Portfolio and risk analytics in Python.
Apache-2
- arch (🥇32 · ⭐ 1.4K) - ARCH models in Python.
❗Unlicensed
- ta (🥈31 · ⭐ 4.6K · 💀) - Technical Analysis Library using Pandas and Numpy.
MIT
- backtrader (🥈29 · ⭐ 17K · 💀) - Python Backtesting library for trading strategies.
❗️GPL-3.0
- Backtesting.py (🥈28 · ⭐ 6.2K) - Backtest trading strategies in Python.
❗️AGPL-3.0
- Alphalens (🥈28 · ⭐ 3.6K · 💀) - Performance analysis of predictive (alpha) stock factors.
Apache-2
- IB-insync (🥈28 · ⭐ 3K · 💀) - Python sync/async framework for Interactive Brokers API.
BSD-2
- empyrical (🥈28 · ⭐ 1.4K · 💀) - Common financial risk and performance metrics. Used by..
Apache-2
- Enigma Catalyst (🥉26 · ⭐ 2.5K · 💀) - An Algorithmic Trading Library for Crypto-Assets in..
Apache-2
- PyAlgoTrade (🥉24 · ⭐ 4.5K · 💀) - Python Algorithmic Trading Library.
Apache-2
- FinTA (🥉24 · ⭐ 2.2K · 💀) - Common financial technical indicators implemented in Pandas.
❗️LGPL-3.0
- Crypto Signals (🥉22 · ⭐ 5.1K · 💀) - Github.com/CryptoSignal - Trading & Technical Analysis Bot -..
MIT
- FinQuant (🥉22 · ⭐ 1.5K · 💀) - A program for financial portfolio management, analysis and..
MIT
- surpriver (🥉12 · ⭐ 1.8K · 💀) - Find big moving stocks before they move using machine..
❗️GPL-3.0
- pyrtfolio (🥉10 · ⭐ 150 · 💀) - Python package to generate stock portfolios.
❗️GPL-3.0
Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.
StatsForecast (🥇34 · ⭐ 4.2K) - Lightning fast forecasting with statistical and econometric.. Apache-2
Prophet (🥇33 · ⭐ 19K) - Tool for producing high quality forecasts for time series data that has.. MIT
Darts (🥇33 · ⭐ 8.5K) - A python library for user-friendly forecasting and anomaly detection.. Apache-2
-
GitHub (👨💻 130 · 🔀 920 · 📋 1.7K - 14% open · ⏱️ 21.03.2025):
git clone https://github.com/unit8co/darts
-
PyPi (📥 78K / month · 📦 10 · ⏱️ 09.03.2025):
pip install u8darts
-
Conda (📥 74K · ⏱️ 25.03.2025):
conda install -c conda-forge u8darts-all
-
Docker Hub (📥 1.3K · ⏱️ 09.03.2025):
docker pull unit8/darts
pytorch-forecasting (🥈32 · ⭐ 4.2K) - Time series forecasting with PyTorch. MIT
STUMPY (🥈32 · ⭐ 3.9K) - STUMPY is a powerful and scalable Python library for modern time series.. BSD-3
NeuralForecast (🥈32 · ⭐ 3.4K) - Scalable and user friendly neural forecasting algorithms. Apache-2
pmdarima (🥈32 · ⭐ 1.6K) - A statistical library designed to fill the void in Pythons time series.. MIT
skforecast (🥈31 · ⭐ 1.3K) - Time series forecasting with machine learning models. BSD-3

NeuralProphet (🥉26 · ⭐ 4K · 💤) - NeuralProphet: A simple forecasting package. MIT

Auto TS (🥉18 · ⭐ 750 · 💤) - Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost.. Apache-2
Show 10 hidden projects...
- pyts (🥉28 · ⭐ 1.8K · 💀) - A Python package for time series classification.
BSD-3
- PyFlux (🥉25 · ⭐ 2.1K · 💀) - Open source time series library for Python.
BSD-3
- ADTK (🥉23 · ⭐ 1.1K · 💀) - A Python toolkit for rule-based/unsupervised anomaly detection in..
MPL-2.0
- luminol (🥉22 · ⭐ 1.2K · 💀) - Anomaly Detection and Correlation library.
Apache-2
- seglearn (🥉21 · ⭐ 570 · 💀) - Python module for machine learning time series:.
BSD-3
- tick (🥉21 · ⭐ 500 · 💀) - Module for statistical learning, with a particular emphasis on time-..
BSD-3
- matrixprofile-ts (🥉19 · ⭐ 740 · 💀) - A Python library for detecting patterns and anomalies..
Apache-2
- atspy (🥉14 · ⭐ 510 · 💀) - AtsPy: Automated Time Series Models in Python (by @firmai).
MIT
- tsaug (🥉14 · ⭐ 350 · 💀) - A Python package for time series augmentation.
Apache-2
- tslumen (🥉8 · ⭐ 69 · 💀) - A library for Time Series EDA (exploratory data analysis).
Apache-2
Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.
MNE (🥇39 · ⭐ 2.9K) - MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3
DeepVariant (🥉24 · ⭐ 3.4K) - DeepVariant is an analysis pipeline that uses a deep neural.. BSD-3

Brainiak (🥉18 · ⭐ 350) - Brain Imaging Analysis Kit. Apache-2
-
GitHub (👨💻 35 · 🔀 140 · 📋 230 - 38% open · ⏱️ 06.01.2025):
git clone https://github.com/brainiak/brainiak
-
PyPi (📥 2.2K / month · ⏱️ 07.01.2025):
pip install brainiak
-
Docker Hub (📥 1.9K · ⭐ 1 · ⏱️ 07.01.2025):
docker pull brainiak/brainiak
Show 10 hidden projects...
- DIPY (🥈32 · ⭐ 750) - DIPY is the paragon 3D/4D+ medical imaging library in Python...
❗Unlicensed
- NiftyNet (🥉25 · ⭐ 1.4K · 💀) - [unmaintained] An open-source convolutional neural..
Apache-2
- NIPY (🥉24 · ⭐ 390) - Neuroimaging in Python FMRI analysis package.
❗Unlicensed
- MedPy (🥉23 · ⭐ 590 · 💤) - Medical image processing in Python.
❗️GPL-3.0
- DLTK (🥉20 · ⭐ 1.4K · 💀) - Deep Learning Toolkit for Medical Image Analysis.
Apache-2
- Glow (🥉20 · ⭐ 270) - An open-source toolkit for large-scale genomic analysis.
Apache-2
- MedicalTorch (🥉15 · ⭐ 860 · 💀) - A medical imaging framework for Pytorch.
Apache-2
- DeepNeuro (🥉15 · ⭐ 130 · 💀) - A deep learning python package for neuroimaging data. Made by:.
MIT
- Medical Detection Toolkit (🥉14 · ⭐ 1.3K · 💀) - The Medical Detection Toolkit contains 2D + 3D..
Apache-2
- MedicalNet (🥉12 · ⭐ 2K · 💀) - Many studies have shown that the performance on deep learning is..
MIT
Libraries for processing tabular and structured data.
pytorch_tabular (🥇24 · ⭐ 1.5K) - A standard framework for modelling Deep Learning Models.. MIT

miceforest (🥇24 · ⭐ 370 · 💤) - Multiple Imputation with LightGBM in Python. MIT
upgini (🥉22 · ⭐ 330) - Data search & enrichment library for Machine Learning Easily find and add.. BSD-3
Show 2 hidden projects...
- carefree-learn (🥉18 · ⭐ 410 · 💀) - Deep Learning PyTorch.
MIT
- deltapy (🥉13 · ⭐ 540 · 💀) - DeltaPy - Tabular Data Augmentation (by @firmai).
MIT
Libraries for optical character recognition (OCR) and text extraction from images or videos.
OCRmyPDF (🥇38 · ⭐ 26K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them.. MPL-2.0
EasyOCR (🥈34 · ⭐ 26K · 💤) - Ready-to-use OCR with 80+ supported languages and all popular.. Apache-2
Tesseract (🥈31 · ⭐ 6.1K) - Python-tesseract is an optical character recognition (OCR) tool.. Apache-2
Show 6 hidden projects...
- keras-ocr (🥉26 · ⭐ 1.4K · 💀) - A packaged and flexible version of the CRAFT text detector..
MIT
- calamari (🥉24 · ⭐ 1.1K) - Line based ATR Engine based on OCRopy.
❗️GPL-3.0
- pdftabextract (🥉21 · ⭐ 2.2K · 💀) - A set of tools for extracting tables from PDF files..
Apache-2
- attention-ocr (🥉21 · ⭐ 1.1K · 💀) - A Tensorflow model for text recognition (CNN + seq2seq..
MIT
- doc2text (🥉20 · ⭐ 1.3K · 💀) - Detect text blocks and OCR poorly scanned PDFs in bulk. Python..
MIT
- Mozart (🥉10 · ⭐ 650 · 💀) - An optical music recognition (OMR) system. Converts sheet..
Apache-2
General-purpose data containers & structures as well as utilities & extensions for pandas.
🔗 best-of-python - Data Containers ( ⭐ 3.9K) - Collection of data-container, dataframe, and pandas-..
Libraries for loading, collecting, and extracting data from a variety of data sources and formats.
🔗 best-of-python - Data Extraction ( ⭐ 3.9K) - Collection of data-loading and -extraction libraries.
Libraries for web scraping, crawling, downloading, and mining as well as libraries.
🔗 best-of-web-python - Web Scraping ( ⭐ 2.5K · 💤) - Collection of web-scraping and crawling libraries.
Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
🔗 best-of-python - Data Pipelines ( ⭐ 3.9K) - Libraries for data batch- and stream-processing,..
Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.
DeepSpeed (🥇41 · ⭐ 38K) - DeepSpeed is a deep learning optimization library that makes.. Apache-2

-
GitHub (👨💻 380 · 🔀 4.3K · 📦 12K · 📋 3K - 33% open · ⏱️ 02.04.2025):
git clone https://github.com/microsoft/DeepSpeed
-
PyPi (📥 650K / month · 📦 270 · ⏱️ 27.03.2025):
pip install deepspeed
-
Docker Hub (📥 22K · ⭐ 4 · ⏱️ 02.09.2022):
docker pull deepspeed/deepspeed
dask.distributed (🥇40 · ⭐ 1.6K · 📈) - A distributed task scheduler for Dask. BSD-3
H2O-3 (🥈35 · ⭐ 7.1K) - H2O is an Open Source, Distributed, Fast & Scalable Machine Learning.. Apache-2
ColossalAI (🥈32 · ⭐ 41K) - Making large AI models cheaper, faster and more accessible. Apache-2
-
GitHub (👨💻 200 · 🔀 4.5K · 📦 490 · 📋 1.8K - 26% open · ⏱️ 07.03.2025):
git clone https://github.com/hpcaitech/colossalai
horovod (🥈32 · ⭐ 14K · 📉) - Distributed training framework for TensorFlow, Keras, PyTorch,.. Apache-2
BigDL (🥈31 · ⭐ 7.7K) - Accelerate local LLM inference and finetuning (LLaMA, Mistral,.. Apache-2
-
GitHub (👨💻 120 · 🔀 1.3K · 📥 6.1K · 📋 2.8K - 39% open · ⏱️ 03.04.2025):
git clone https://github.com/intel-analytics/BigDL
-
PyPi (📥 18K / month · 📦 2 · ⏱️ 24.03.2024):
pip install bigdl
-
Maven (📦 5 · ⏱️ 20.04.2021):
<dependency> <groupId>com.intel.analytics.bigdl</groupId> <artifactId>bigdl-SPARK_2.4</artifactId> <version>[VERSION]</version> </dependency>
Apache Singa (🥉25 · ⭐ 3.4K) - a distributed deep learning platform. Apache-2
-
GitHub (👨💻 97 · 🔀 1.3K · 📦 5 · 📋 140 - 35% open · ⏱️ 26.03.2025):
git clone https://github.com/apache/singa
-
Conda (📥 970 · ⏱️ 25.03.2025):
conda install -c nusdbsystem singa
-
Docker Hub (📥 9K · ⭐ 4 · ⏱️ 31.05.2022):
docker pull apache/singa
analytics-zoo (🥉21 · ⭐ 2.6K) - Distributed Tensorflow, Keras and PyTorch on Apache.. Apache-2

Show 18 hidden projects...
- DEAP (🥈34 · ⭐ 6K) - Distributed Evolutionary Algorithms in Python.
❗️LGPL-3.0
- ipyparallel (🥈29 · ⭐ 2.6K) - IPython Parallel: Interactive Parallel Computing in..
❗Unlicensed
- petastorm (🥉28 · ⭐ 1.8K · 💀) - Petastorm library enables single machine or distributed..
Apache-2
- TensorFlowOnSpark (🥉27 · ⭐ 3.9K · 💀) - TensorFlowOnSpark brings TensorFlow programs to..
Apache-2
- Elephas (🥉25 · ⭐ 1.6K · 💀) - Distributed Deep learning with Keras & Spark.
MIT
keras
- BytePS (🥉22 · ⭐ 3.7K · 💀) - A high performance and generic framework for distributed DNN..
Apache-2
- Mesh (🥉22 · ⭐ 1.6K · 💀) - Mesh TensorFlow: Model Parallelism Made Easier.
Apache-2
- somoclu (🥉21 · ⭐ 270 · 💀) - Massively parallel self-organizing maps: accelerate training on..
MIT
- sk-dist (🥉20 · ⭐ 280 · 💀) - Distributed scikit-learn meta-estimators in PySpark.
Apache-2
- launchpad (🥉19 · ⭐ 320 · 💀) - Launchpad is a library that simplifies writing..
Apache-2
- mesh-transformer-jax (🥉18 · ⭐ 6.3K · 💀) - Model parallel transformers in JAX and Haiku.
Apache-2
- bluefog (🥉18 · ⭐ 290 · 💀) - Distributed and decentralized training framework for PyTorch..
Apache-2
- Fiber (🥉17 · ⭐ 1K · 💀) - Distributed Computing for AI Made Simple.
Apache-2
- parallelformers (🥉17 · ⭐ 780 · 💀) - Parallelformers: An Efficient Model Parallelization..
Apache-2
- TensorFrames (🥉15 · ⭐ 720 · 💀) - Tensorflow wrapper for DataFrames on Apache Spark.
Apache-2
- LazyCluster (🥉15 · ⭐ 49 · 💀) - Distributed machine learning made simple.
Apache-2
- autodist (🥉12 · ⭐ 130 · 💀) - Simple Distributed Deep Learning on TensorFlow.
Apache-2
- moolib (🥉11 · ⭐ 370 · 💀) - A library for distributed ML training with PyTorch.
MIT
Libraries for hyperparameter optimization, automl and neural architecture search.
AutoGluon (🥇36 · ⭐ 8.6K) - Fast and Accurate ML in 3 Lines of Code. Apache-2


-
GitHub (👨💻 130 · 🔀 990 · 📦 1K · 📋 1.6K - 25% open · ⏱️ 03.04.2025):
git clone https://github.com/autogluon/autogluon
-
PyPi (📥 180K / month · 📦 31 · ⏱️ 03.04.2025):
pip install autogluon
-
Conda (📥 32K · ⏱️ 25.03.2025):
conda install -c conda-forge autogluon
-
Docker Hub (📥 15K · ⭐ 18 · ⏱️ 07.03.2024):
docker pull autogluon/autogluon
Bayesian Optimization (🥇33 · ⭐ 8.2K) - A Python implementation of global optimization with.. MIT
featuretools (🥈32 · ⭐ 7.4K) - An open source python library for automated feature engineering. BSD-3
Keras Tuner (🥈31 · ⭐ 2.9K · 💤) - A Hyperparameter Tuning Library for Keras. Apache-2

mljar-supervised (🥈29 · ⭐ 3.1K) - Python package for AutoML on Tabular Data with Feature.. MIT
lazypredict (🥈28 · ⭐ 3.1K) - Lazy Predict help build a lot of basic models without much code.. MIT

featurewiz (🥉23 · ⭐ 630) - Use advanced feature engineering strategies and select best.. Apache-2
Hyperactive (🥉23 · ⭐ 520) - An optimization and data collection toolbox for convenient and fast.. MIT
Auto ViML (🥉21 · ⭐ 540) - Automatically Build Multiple ML Models with a Single Line of Code... Apache-2
opytimizer (🥉18 · ⭐ 610 · 💤) - Opytimizer is a Python library consisting of meta-heuristic.. Apache-2
Show 33 hidden projects...
- TPOT (🥇33 · ⭐ 9.9K) - A Python Automated Machine Learning tool that optimizes machine..
❗️LGPL-3.0
- scikit-optimize (🥇33 · ⭐ 2.8K · 💀) - Sequential model-based optimization with a..
BSD-3
- NNI (🥈31 · ⭐ 14K · 💀) - An open source AutoML toolkit for automate machine learning lifecycle,..
MIT
- auto-sklearn (🥈31 · ⭐ 7.8K · 💀) - Automated Machine Learning with scikit-learn.
BSD-3
- SMAC3 (🥈28 · ⭐ 1.1K) - SMAC3: A Versatile Bayesian Optimization Package for..
❗️BSD-1-Clause
- Hyperas (🥈27 · ⭐ 2.2K · 💀) - Keras + Hyperopt: A very simple wrapper for convenient..
MIT
- GPyOpt (🥈25 · ⭐ 940 · 💀) - Gaussian Process Optimization using GPy.
BSD-3
- lightwood (🥈25 · ⭐ 460) - Lightwood is Legos for Machine Learning.
❗️GPL-3.0
- AdaNet (🥉24 · ⭐ 3.5K · 💀) - Fast and flexible AutoML with learning guarantees.
Apache-2
- auto_ml (🥉24 · ⭐ 1.6K · 💀) - [UNMAINTAINED] Automated machine learning for analytics & production.
MIT
- HpBandSter (🥉23 · ⭐ 620 · 💀) - a distributed Hyperband implementation on Steroids.
BSD-3
- Orion (🥉23 · ⭐ 290 · 💀) - Asynchronous Distributed Hyperparameter Optimization.
BSD-3
- igel (🥉21 · ⭐ 3.1K · 💀) - a delightful machine learning tool that allows you to train, test, and..
MIT
- MLBox (🥉21 · ⭐ 1.5K · 💀) - MLBox is a powerful Automated Machine Learning python library.
❗️BSD-1-Clause
- Test Tube (🥉21 · ⭐ 740 · 💀) - Python library to easily log experiments and parallelize..
MIT
- Neuraxle (🥉21 · ⭐ 610 · 💀) - The worlds cleanest AutoML library - Do hyperparameter tuning..
Apache-2
- sklearn-deap (🥉20 · ⭐ 770 · 💀) - Use evolutionary algorithms instead of gridsearch in..
MIT
- optunity (🥉20 · ⭐ 420 · 💀) - optimization routines for hyperparameter tuning.
BSD-3
- Dragonfly (🥉19 · ⭐ 880 · 💀) - An open source python library for scalable Bayesian optimisation.
MIT
- Auto Tune Models (🥉19 · ⭐ 530 · 💀) - Auto Tune Models - A multi-tenant, multi-data system for..
MIT
- Sherpa (🥉19 · ⭐ 340 · 💀) - Hyperparameter optimization that enables researchers to..
❗️GPL-3.0
- Advisor (🥉18 · ⭐ 1.6K · 💀) - Open-source implementation of Google Vizier for hyper parameters..
Apache-2
- Xcessiv (🥉18 · ⭐ 1.3K · 💀) - A web-based application for quick, scalable, and automated..
Apache-2
- shap-hypetune (🥉18 · ⭐ 580 · 💀) - A python package for simultaneous Hyperparameters Tuning and..
MIT
- HyperparameterHunter (🥉17 · ⭐ 700 · 💀) - Easy hyperparameter optimization and automatic result..
MIT
- automl-gs (🥉16 · ⭐ 1.9K · 💀) - Provide an input CSV and a target field to predict, generate a..
MIT
- Parfit (🥉15 · ⭐ 200 · 💀) - A package for parallelizing the fit and flexibly scoring of..
MIT
- ENAS (🥉13 · ⭐ 2.7K · 💀) - PyTorch implementation of Efficient Neural Architecture Search via..
Apache-2
- Auptimizer (🥉13 · ⭐ 200 · 💀) - An automatic ML model optimization tool.
❗️GPL-3.0
- Hypermax (🥉13 · ⭐ 110 · 💀) - Better, faster hyper-parameter optimization.
BSD-3
- model_search (🥉11 · ⭐ 3.3K · 💀) - AutoML algorithms for model architecture search at scale.
Apache-2
- Devol (🥉11 · ⭐ 950 · 💀) - Genetic neural architecture search with Keras.
MIT
- Hypertunity (🥉10 · ⭐ 140 · 💀) - A toolset for black-box hyperparameter optimisation.
Apache-2
Libraries for building and evaluating reinforcement learning & agent-based systems.
TensorForce (🥈26 · ⭐ 3.3K · 💤) - Tensorforce: a TensorFlow library for applied.. Apache-2

PARL (🥉24 · ⭐ 3.3K) - A high-performance distributed training framework for Reinforcement.. Apache-2

rliable (🥉14 · ⭐ 820 · 💤) - [NeurIPS21 Outstanding Paper] Library for reliable evaluation on.. Apache-2
Show 12 hidden projects...
- OpenAI Gym (🥇41 · ⭐ 36K · 💀) - A toolkit for developing and comparing reinforcement learning..
MIT
- baselines (🥈28 · ⭐ 16K · 💀) - OpenAI Baselines: high-quality implementations of reinforcement..
MIT
- TensorLayer (🥈27 · ⭐ 7.3K · 💀) - Deep Learning and Reinforcement Learning Library for..
Apache-2
- keras-rl (🥈27 · ⭐ 5.5K · 💀) - Deep Reinforcement Learning for Keras.
MIT
- garage (🥉25 · ⭐ 2K · 💀) - A toolkit for reproducible reinforcement learning research.
MIT
- Stable Baselines (🥉24 · ⭐ 4.2K · 💀) - A fork of OpenAI Baselines, implementations of..
MIT
- ChainerRL (🥉24 · ⭐ 1.2K · 💀) - ChainerRL is a deep reinforcement learning library built on top of..
MIT
- TRFL (🥉22 · ⭐ 3.1K · 💀) - TensorFlow Reinforcement Learning.
Apache-2
- Coach (🥉21 · ⭐ 2.3K · 💀) - Reinforcement Learning Coach by Intel AI Lab enables easy..
Apache-2
- SerpentAI (🥉19 · ⭐ 6.9K · 💀) - Game Agent Framework. Helping you create AIs / Bots that learn to..
MIT
- DeepMind Lab (🥉17 · ⭐ 7.2K · 💀) - A customisable 3D platform for agent-based AI research.
❗Unlicensed
- Maze (🥉13 · ⭐ 280 · 💀) - Maze Applied Reinforcement Learning Framework.
❗️Custom
Libraries for building and evaluating recommendation systems.
Recommenders (🥇33 · ⭐ 20K) - Best Practices on Recommendation Systems. MIT
scikit-surprise (🥈28 · ⭐ 6.6K · 💤) - A Python scikit for building and analyzing recommender.. BSD-3
TF Recommenders (🥉23 · ⭐ 1.9K) - TensorFlow Recommenders is a library for building.. Apache-2

Show 11 hidden projects...
- implicit (🥈30 · ⭐ 3.6K · 💀) - Fast Python Collaborative Filtering for Implicit Feedback Datasets.
MIT
- lightfm (🥈29 · ⭐ 4.9K · 💀) - A Python implementation of LightFM, a hybrid recommendation..
Apache-2
- lkpy (🥈28 · ⭐ 290) - Python recommendation toolkit.
MIT
- TF Ranking (🥉26 · ⭐ 2.8K · 💀) - Learning to Rank in TensorFlow.
Apache-2
- tensorrec (🥉21 · ⭐ 1.3K · 💀) - A TensorFlow recommendation algorithm and framework in..
Apache-2
- fastFM (🥉21 · ⭐ 1.1K · 💀) - fastFM: A Library for Factorization Machines.
BSD-3
- Spotlight (🥉19 · ⭐ 3K · 💀) - Deep recommender models using PyTorch.
MIT
- recmetrics (🥉19 · ⭐ 580 · 💀) - A library of metrics for evaluating recommender systems.
MIT
- Case Recommender (🥉18 · ⭐ 500 · 💀) - Case Recommender: A Flexible and Extensible Python..
MIT
- OpenRec (🥉17 · ⭐ 420 · 💀) - OpenRec is an open-source and modular library for neural network-..
Apache-2
- Collie (🥉17 · ⭐ 110 · 💀) - A library for preparing, training, and evaluating scalable deep..
BSD-3
Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.
TensorFlow Privacy (🥈25 · ⭐ 2K) - Library for training machine learning models with.. Apache-2

TFEncrypted (🥉24 · ⭐ 1.2K · 💤) - A Framework for Encrypted Machine Learning in.. Apache-2

Show 1 hidden projects...
- PipelineDP (🥉19 · ⭐ 280) - PipelineDP is a Python framework for applying differentially..
Apache-2
Libraries to organize, track, and visualize machine learning experiments.
Tensorboard (🥇43 · ⭐ 6.8K) - TensorFlows Visualization Toolkit. Apache-2

wandb client (🥇42 · ⭐ 9.7K) - The AI developer platform. Use Weights & Biases to train and fine-.. MIT
SageMaker SDK (🥈40 · ⭐ 2.1K) - A library for training and deploying machine learning.. Apache-2


tensorboardX (🥈33 · ⭐ 7.9K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
ClearML (🥈33 · ⭐ 5.9K) - ClearML - Auto-Magical CI/CD to streamline your AI workload... Apache-2
-
GitHub (👨💻 100 · 🔀 670 · 📥 3.2K · 📦 1.7K · 📋 1.1K - 43% open · ⏱️ 02.04.2025):
git clone https://github.com/allegroai/clearml
-
PyPi (📥 380K / month · 📦 53 · ⏱️ 09.03.2025):
pip install clearml
-
Docker Hub (📥 30K · ⏱️ 05.10.2020):
docker pull allegroai/trains
AzureML SDK (🥈30 · ⭐ 4.2K) - Python notebooks with ML and deep learning examples with Azure.. MIT
Neptune.ai (🥈28 · ⭐ 610) - The experiment tracker for foundation model training. Apache-2
livelossplot (🥉27 · ⭐ 1.3K) - Live training loss plot in Jupyter Notebook for Keras,.. MIT

Labml (🥉26 · ⭐ 2.1K) - Monitor deep learning model training and hardware usage from your mobile.. MIT
ml-metadata (🥉26 · ⭐ 640) - For recording and retrieving metadata associated with ML.. Apache-2
gokart (🥉25 · ⭐ 320) - Gokart solves reproducibility, task dependencies, constraints of good code,.. MIT
Show 16 hidden projects...
- Catalyst (🥈28 · ⭐ 3.3K · 💀) - Accelerated deep learning R&D.
Apache-2
- knockknock (🥉26 · ⭐ 2.8K · 💀) - Knock Knock: Get notified when your training ends with only two..
MIT
- SKLL (🥉24 · ⭐ 550) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine..
❗Unlicensed
- Guild AI (🥉23 · ⭐ 880 · 💀) - Experiment tracking, ML developer tools.
Apache-2
- hiddenlayer (🥉22 · ⭐ 1.8K · 💀) - Neural network graphs and training metrics for..
MIT
- Studio.ml (🥉22 · ⭐ 380 · 💀) - Studio: Simplify and expedite model building process.
Apache-2
- TensorWatch (🥉21 · ⭐ 3.4K · 💀) - Debugging, monitoring and visualization for Python Machine..
MIT
- lore (🥉21 · ⭐ 1.6K · 💀) - Lore makes machine learning approachable for Software Engineers and..
MIT
- TensorBoard Logger (🥉21 · ⭐ 630 · 💀) - Log TensorBoard events without touching TensorFlow.
MIT
- MXBoard (🥉20 · ⭐ 320 · 💀) - Logging MXNet data for visualization in TensorBoard.
Apache-2
- datmo (🥉18 · ⭐ 340 · 💀) - Open source production model management tool for data scientists.
MIT
- chitra (🥉17 · ⭐ 230 · 💤) - A multi-functional library for full-stack Deep Learning...
Apache-2
- steppy (🥉17 · ⭐ 130 · 💀) - Lightweight, Python library for fast and reproducible experimentation.
MIT
- caliban (🥉16 · ⭐ 500 · 💀) - Research workflows made easy, locally and in the Cloud.
Apache-2
- ModelChimp (🥉13 · ⭐ 130 · 💀) - Experiment tracking for machine and deep learning projects.
BSD-2
- traintool (🥉9 · ⭐ 12 · 💀) - Train off-the-shelf machine learning models in one..
Apache-2
Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.
huggingface_hub (🥈38 · ⭐ 2.5K) - The official Python client for the Huggingface Hub. Apache-2
BentoML (🥈35 · ⭐ 7.6K) - The easiest way to serve AI apps and models - Build Model Inference.. Apache-2
Core ML Tools (🥈35 · ⭐ 4.6K) - Core ML tools contain supporting tools for Core ML model.. BSD-3
TorchServe (🥈33 · ⭐ 4.3K) - Serve, optimize and scale PyTorch models in production. Apache-2

-
GitHub (👨💻 220 · 🔀 880 · 📥 7.7K · 📦 850 · 📋 1.7K - 25% open · ⏱️ 17.03.2025):
git clone https://github.com/pytorch/serve
-
PyPi (📥 93K / month · 📦 24 · ⏱️ 30.09.2024):
pip install torchserve
-
Conda (📥 490K · ⏱️ 25.03.2025):
conda install -c pytorch torchserve
-
Docker Hub (📥 1.4M · ⭐ 32 · ⏱️ 30.09.2024):
docker pull pytorch/torchserve
Hummingbird (🥉24 · ⭐ 3.4K) - Hummingbird compiles trained ML models into tensor computation for.. MIT
Show 10 hidden projects...
- mmdnn (🥈26 · ⭐ 5.8K · 💀) - MMdnn is a set of tools to help users inter-operate among different deep..
MIT
- m2cgen (🥉25 · ⭐ 2.9K · 💀) - Transform ML models into a native code (Java, C, Python, Go,..
MIT
- sklearn-porter (🥉24 · ⭐ 1.3K · 💀) - Transpile trained scikit-learn estimators to C, Java,..
BSD-3
- cortex (🥉22 · ⭐ 8K · 💀) - Production infrastructure for machine learning at scale.
Apache-2
- OMLT (🥉20 · ⭐ 300) - Represent trained machine learning models as Pyomo optimization..
❗Unlicensed
- Larq Compute Engine (🥉20 · ⭐ 250) - Highly optimized inference engine for Binarized..
Apache-2
- pytorch2keras (🥉19 · ⭐ 860 · 💀) - PyTorch to Keras model convertor.
MIT
- modelkit (🥉19 · ⭐ 160 · 💤) - Toolkit for developing and maintaining ML models.
MIT
- backprop (🥉15 · ⭐ 240 · 💀) - Backprop makes it simple to use, finetune, and deploy state-of-..
Apache-2
- ml-ane-transformers (🥉13 · ⭐ 2.6K · 💀) - Reference implementation of the Transformer..
❗Unlicensed
Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.
shap (🥇42 · ⭐ 24K) - A game theoretic approach to explain the output of any machine learning model. MIT
InterpretML (🥇34 · ⭐ 6.4K) - Fit interpretable models. Explain blackbox machine learning. MIT

explainerdashboard (🥈31 · ⭐ 2.4K) - Quickly build Explainable AI dashboards that show the inner.. MIT
evaluate (🥈30 · ⭐ 2.2K) - Evaluate: A library for easily evaluating machine learning models.. Apache-2
LIT (🥈28 · ⭐ 3.5K) - The Learning Interpretability Tool: Interactively analyze ML models to.. Apache-2
dtreeviz (🥈28 · ⭐ 3K) - A python library for decision tree visualization and model interpretation. MIT
Model Analysis (🥈28 · ⭐ 1.3K) - Model analysis tools for TensorFlow. Apache-2


Fairness 360 (🥈27 · ⭐ 2.6K) - A comprehensive set of fairness metrics for datasets and.. Apache-2
Explainability 360 (🥉25 · ⭐ 1.7K) - Interpretability and explainability of data and.. Apache-2
responsible-ai-widgets (🥉25 · ⭐ 1.5K) - Responsible AI Toolbox is a suite of tools providing.. MIT



random-forest-importances (🥉22 · ⭐ 610) - Code to compute permutation and drop-column.. MIT

fairness-indicators (🥉19 · ⭐ 350) - Tensorflows Fairness Evaluation and Visualization.. Apache-2


ExplainX.ai (🥉16 · ⭐ 430 · 💤) - Explainable AI framework for data scientists. Explain & debug any.. MIT
Show 29 hidden projects...
- Lime (🥇33 · ⭐ 12K · 💀) - Lime: Explaining the predictions of any machine learning classifier.
BSD-2
- yellowbrick (🥈28 · ⭐ 4.3K · 💀) - Visual analysis and diagnostic tools to facilitate..
Apache-2
- Deep Checks (🥈28 · ⭐ 3.8K) - Deepchecks: Tests for Continuous Validation of ML Models &..
❗️AGPL-3.0
- eli5 (🥈28 · ⭐ 2.8K · 💀) - A library for debugging/inspecting machine learning classifiers and..
MIT
- scikit-plot (🥈28 · ⭐ 2.4K · 💀) - An intuitive library to add plotting functionality to..
MIT
- DALEX (🥈27 · ⭐ 1.4K) - moDel Agnostic Language for Exploration and eXplanation.
❗️GPL-3.0
- Alibi (🥈26 · ⭐ 2.5K) - Algorithms for explaining machine learning models.
❗️Intel
- iNNvestigate (🥈26 · ⭐ 1.3K · 💀) - A toolbox to iNNvestigate neural networks predictions!.
BSD-2
- Lucid (🥉25 · ⭐ 4.7K · 💀) - A collection of infrastructure and tools for research in..
Apache-2
- keras-vis (🥉25 · ⭐ 3K · 💀) - Neural network visualization toolkit for keras.
MIT
- checklist (🥉25 · ⭐ 2K · 💀) - Beyond Accuracy: Behavioral Testing of NLP models with CheckList.
MIT
- CausalNex (🥉24 · ⭐ 2.3K · 💀) - A Python library that helps data scientists to infer..
Apache-2
- What-If Tool (🥉23 · ⭐ 940 · 💀) - Source code/webpage/demos for the What-If Tool.
Apache-2
- TreeInterpreter (🥉23 · ⭐ 760 · 💀) - Package for interpreting scikit-learns decision tree..
BSD-3
- tf-explain (🥉22 · ⭐ 1K · 💀) - Interpretability Methods for tf.keras models with Tensorflow..
MIT
- Quantus (🥉22 · ⭐ 600) - Quantus is an eXplainable AI toolkit for responsible evaluation of..
❗️GPL-3.0
- deeplift (🥉21 · ⭐ 850 · 💀) - Public facing deeplift repo.
MIT
- tcav (🥉20 · ⭐ 640 · 💀) - Code for the TCAV ML interpretability project.
Apache-2
- XAI (🥉19 · ⭐ 1.2K · 💀) - XAI - An eXplainability toolbox for machine learning.
MIT
- model-card-toolkit (🥉18 · ⭐ 430 · 💀) - A toolkit that streamlines and automates the..
Apache-2
- sklearn-evaluation (🥉17 · ⭐ 460 · 💀) - Machine learning model evaluation made easy: plots,..
MIT
- effector (🥉17 · ⭐ 82) - Effector - a Python package for global and regional effect methods.
MIT
- Anchor (🥉16 · ⭐ 800 · 💀) - Code for High-Precision Model-Agnostic Explanations paper.
BSD-2
- FlashTorch (🥉16 · ⭐ 740 · 💀) - Visualization toolkit for neural networks in PyTorch! Demo --.
MIT
- Skater (🥉15 · ⭐ 1.1K) - Python Library for Model Interpretation/Explanations.
❗️UPL-1.0
- interpret-text (🥉15 · ⭐ 420 · 💀) - A library that incorporates state-of-the-art explainers..
MIT
- Attribution Priors (🥉13 · ⭐ 120 · 💀) - Tools for training explainable models using..
MIT
- contextual-ai (🥉13 · ⭐ 87 · 💀) - Contextual AI adds explainability to different stages of..
Apache-2
- bias-detector (🥉13 · ⭐ 43 · 💀) - Bias Detector is a python package for detecting bias in machine..
MIT
Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search.
🔗 ANN Benchmarks ( ⭐ 5.2K) - Benchmarks of approximate nearest neighbor libraries in Python.
Milvus (🥇42 · ⭐ 34K) - Milvus is a high-performance, cloud-native vector database built for.. Apache-2
-
GitHub (👨💻 310 · 🔀 3.1K · 📥 330K · 📋 13K - 6% open · ⏱️ 03.04.2025):
git clone https://github.com/milvus-io/milvus
-
PyPi (📥 1.8M / month · 📦 240 · ⏱️ 21.03.2025):
pip install pymilvus
-
Docker Hub (📥 68M · ⭐ 78 · ⏱️ 03.04.2025):
docker pull milvusdb/milvus
Faiss (🥇41 · ⭐ 34K) - A library for efficient similarity search and clustering of dense vectors. MIT
Annoy (🥈35 · ⭐ 14K · 💤) - Approximate Nearest Neighbors in C++/Python optimized for memory.. Apache-2
NMSLIB (🥈31 · ⭐ 3.5K · 💤) - Non-Metric Space Library (NMSLIB): An efficient similarity search.. Apache-2
USearch (🥈31 · ⭐ 2.6K) - Fast Open-Source Search & Clustering engine for Vectors & Strings in.. Apache-2
-
GitHub (👨💻 63 · 🔀 160 · 📥 60K · 📦 170 · 📋 200 - 40% open · ⏱️ 07.02.2025):
git clone https://github.com/unum-cloud/usearch
-
PyPi (📥 500K / month · 📦 27 · ⏱️ 29.12.2024):
pip install usearch
-
npm (📥 13K / month · 📦 15 · ⏱️ 23.01.2025):
npm install usearch
-
Docker Hub (📥 190 · ⭐ 1 · ⏱️ 21.11.2024):
docker pull unum/usearch
PyNNDescent (🥉28 · ⭐ 920) - A Python nearest neighbor descent for approximate nearest neighbors. BSD-2
Show 4 hidden projects...
- NearPy (🥉21 · ⭐ 770 · 💀) - Python framework for fast (approximated) nearest neighbour search in..
MIT
- Magnitude (🥉20 · ⭐ 1.6K · 💀) - A fast, efficient universal vector embedding utility package.
MIT
- N2 (🥉20 · ⭐ 580 · 💀) - TOROS N2 - lightweight approximate Nearest Neighbor library which runs..
Apache-2
- PySparNN (🥉11 · ⭐ 920 · 💀) - Approximate Nearest Neighbor Search for Sparse Data in Python!.
BSD-3
Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, gaussian processes, or statistics.
tensorflow-probability (🥇37 · ⭐ 4.3K) - Probabilistic reasoning and statistical analysis in.. Apache-2

SALib (🥉28 · ⭐ 920) - Sensitivity Analysis Library in Python. Contains Sobol, Morris, FAST, and.. MIT
pomegranate (🥉27 · ⭐ 3.4K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT
scikit-posthocs (🥉27 · ⭐ 360 · 📈) - Multiple Pairwise Comparisons (Post Hoc) Tests in.. MIT

TorchUncertainty (🥉23 · ⭐ 370) - Open-source framework for uncertainty and deep.. Apache-2

Baal (🥉22 · ⭐ 890 · 💤) - Bayesian active learning library for research and industrial usecases. Apache-2
Show 5 hidden projects...
- filterpy (🥈32 · ⭐ 3.5K · 💀) - Python Kalman filtering and optimal estimation library. Implements..
MIT
- pingouin (🥈30 · ⭐ 1.7K) - Statistical package in Python based on Pandas.
❗️GPL-3.0
- Edward (🥉27 · ⭐ 4.8K · 💀) - A probabilistic programming language in TensorFlow. Deep..
Apache-2
- Funsor (🥉20 · ⭐ 240 · 💀) - Functional tensors for probabilistic programming.
Apache-2
- ZhuSuan (🥉15 · ⭐ 2.2K · 💀) - A probabilistic programming library for Bayesian deep learning,..
MIT
Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.
ART (🥇34 · ⭐ 5.2K) - Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning.. MIT
-
GitHub (👨💻 140 · 🔀 1.2K · 📦 690 · 📋 900 - 2% open · ⏱️ 28.02.2025):
git clone https://github.com/Trusted-AI/adversarial-robustness-toolbox
-
PyPi (📥 26K / month · 📦 20 · ⏱️ 22.01.2025):
pip install adversarial-robustness-toolbox
-
Conda (📥 68K · ⏱️ 25.03.2025):
conda install -c conda-forge adversarial-robustness-toolbox
TextAttack (🥈28 · ⭐ 3.1K · 💤) - TextAttack is a Python framework for adversarial attacks, data.. MIT
Show 7 hidden projects...
- CleverHans (🥈30 · ⭐ 6.3K · 💀) - An adversarial example library for constructing attacks,..
MIT
- Foolbox (🥈28 · ⭐ 2.8K · 💀) - A Python toolbox to create adversarial examples that fool neural..
MIT
- advertorch (🥉23 · ⭐ 1.3K · 💀) - A Toolbox for Adversarial Robustness Research.
❗️GPL-3.0
- robustness (🥉21 · ⭐ 930 · 💀) - A library for experimenting with, training and evaluating neural..
MIT
- AdvBox (🥉19 · ⭐ 1.4K · 💀) - Advbox is a toolbox to generate adversarial examples that fool..
Apache-2
- textflint (🥉17 · ⭐ 640 · 💀) - Unified Multilingual Robustness Evaluation Toolkit for..
❗️GPL-3.0
- Adversary (🥉16 · ⭐ 400 · 💀) - Tool to generate adversarial text examples and test machine..
MIT
Libraries that require and make use of CUDA/GPU or other accelerator hardware capabilities to optimize machine learning tasks.
optimum (🥇35 · ⭐ 2.8K) - Accelerate inference and training of Transformers, Diffusers, TIMM.. Apache-2
CuPy (🥉27 · ⭐ 10K) - NumPy & SciPy for GPU. MIT
-
GitHub (👨💻 340 · 🔀 890):
git clone https://github.com/cupy/cupy
-
PyPi (📥 40K / month · 📦 340 · ⏱️ 21.03.2025):
pip install cupy
-
Conda (📥 6.1M · ⏱️ 25.03.2025):
conda install -c conda-forge cupy
-
Docker Hub (📥 79K · ⭐ 13 · ⏱️ 21.03.2025):
docker pull cupy/cupy
DALI (🥉25 · ⭐ 5.3K) - A GPU-accelerated library containing highly optimized building blocks.. Apache-2
-
GitHub (👨💻 96 · 🔀 630 · 📋 1.7K - 14% open · ⏱️ 02.04.2025):
git clone https://github.com/NVIDIA/DALI
Vulkan Kompute (🥉23 · ⭐ 2.2K) - General purpose GPU compute framework built on Vulkan to.. Apache-2
Merlin (🥉20 · ⭐ 810 · 💤) - NVIDIA Merlin is an open source library providing end-to-end GPU-.. Apache-2
Show 9 hidden projects...
- gpustat (🥈30 · ⭐ 4.2K · 💀) - A simple command-line utility for querying and monitoring GPU status.
MIT
- GPUtil (🥉25 · ⭐ 1.2K · 💀) - A Python module for getting the GPU status from NVIDA GPUs using..
MIT
- scikit-cuda (🥉24 · ⭐ 990 · 💀) - Python interface to GPU-powered libraries.
BSD-3
- py3nvml (🥉23 · ⭐ 240 · 💀) - Python 3 Bindings for NVML library. Get NVIDIA GPU status inside..
BSD-3
- BlazingSQL (🥉21 · ⭐ 2K · 💀) - BlazingSQL is a lightweight, GPU accelerated, SQL engine for..
Apache-2
- nvidia-ml-py3 (🥉19 · ⭐ 140 · 💤) - Python 3 Bindings for the NVIDIA Management Library.
BSD-3
- cuSignal (🥉16 · ⭐ 720 · 💀) - GPU accelerated signal processing.
Apache-2
- SpeedTorch (🥉16 · ⭐ 680 · 💀) - Library for faster pinned CPU - GPU transfer in Pytorch.
MIT
- ipyexperiments (🥉16 · ⭐ 220 · 💀) - Automatic GPU+CPU memory profiling, re-use and memory..
Apache-2
Libraries that extend TensorFlow with additional capabilities.
TensorFlow Datasets (🥇39 · ⭐ 4.4K) - TFDS is a collection of datasets ready to use with.. Apache-2

tensorflow-hub (🥈32 · ⭐ 3.5K) - A library for transfer learning by reusing parts of.. Apache-2

TensorFlow I/O (🥈29 · ⭐ 720 · 💤) - Dataset, streaming, and file system extensions.. Apache-2

TF Model Optimization (🥉28 · ⭐ 1.5K) - A toolkit to optimize ML models for deployment for.. Apache-2

TensorFlow Transform (🥉26 · ⭐ 990) - Input pipeline framework. Apache-2

Neural Structured Learning (🥉23 · ⭐ 990) - Training neural models with structured signals. Apache-2

TensorFlow Cloud (🥉20 · ⭐ 380) - The TensorFlow Cloud repository provides APIs that.. Apache-2

TF Compression (🥉19 · ⭐ 880) - Data compression in TensorFlow. Apache-2

Show 6 hidden projects...
- tensor2tensor (🥇33 · ⭐ 16K · 💀) - Library of deep learning models and datasets designed..
Apache-2
- Keras-Preprocessing (🥈29 · ⭐ 1K · 💀) - Utilities for working with image data, text data, and..
MIT
- efficientnet (🥉27 · ⭐ 2.1K · 💀) - Implementation of EfficientNet model. Keras and..
Apache-2
- Saliency (🥉22 · ⭐ 970 · 💀) - Framework-agnostic implementation for state-of-the-art..
Apache-2
- TensorNets (🥉21 · ⭐ 1K · 💀) - High level network definitions with pre-trained weights in..
MIT
- tffm (🥉18 · ⭐ 780 · 💀) - TensorFlow implementation of an arbitrary order Factorization Machine.
MIT
Libraries that extend Jax with additional capabilities.
Show 1 hidden projects...
- jaxdf (🥉11 · ⭐ 120 · 💤) - A JAX-based research framework for writing differentiable..
❗️LGPL-3.0
Libraries that extend scikit-learn with additional capabilities.
scikit-learn-intelex (🥇35 · ⭐ 1.3K) - Extension for Scikit-learn is a seamless way to speed.. Apache-2

imbalanced-learn (🥈33 · ⭐ 7K) - A Python Package to Tackle the Curse of Imbalanced.. MIT

category_encoders (🥈33 · ⭐ 2.4K) - A library of sklearn compatible categorical variable.. BSD-3

scikit-lego (🥉26 · ⭐ 1.3K) - Extra blocks for scikit-learn pipelines. MIT

scikit-opt (🥉25 · ⭐ 5.5K · 💤) - Genetic Algorithm, Particle Swarm Optimization, Simulated.. MIT

iterative-stratification (🥉21 · ⭐ 860) - scikit-learn cross validators for iterative.. BSD-3

scikit-tda (🥉15 · ⭐ 540 · 💤) - Topological Data Analysis for Python. MIT

Show 9 hidden projects...
- scikit-survival (🥈32 · ⭐ 1.2K) - Survival analysis built on top of scikit-learn.
❗️GPL-3.0
- fancyimpute (🥈27 · ⭐ 1.3K · 💀) - Multivariate imputation and matrix completion..
Apache-2
- scikit-multilearn (🥈27 · ⭐ 930 · 💀) - A scikit-learn based module for multi-label et. al...
BSD-2
- sklearn-crfsuite (🥈27 · ⭐ 430 · 💀) - scikit-learn inspired API for CRFsuite.
MIT
- sklearn-contrib-lightning (🥉24 · ⭐ 1.7K · 💀) - Large-scale linear classification, regression and..
BSD-3
- skope-rules (🥉22 · ⭐ 630 · 💀) - machine learning with logical rules in Python.
❗️BSD-1-Clause
- combo (🥉21 · ⭐ 650 · 💀) - (AAAI 20) A Python Toolbox for Machine Learning Model..
BSD-2
xgboost
- celer (🥉20 · ⭐ 220 · 💤) - Fast solver for L1-type problems: Lasso, sparse Logisitic..
BSD-3
- skggm (🥉18 · ⭐ 250 · 💀) - Scikit-learn compatible estimation of general graphical models.
MIT
Libraries that extend Pytorch with additional capabilities.
accelerate (🥇42 · ⭐ 8.6K) - A simple way to launch, train, and use PyTorch models on.. Apache-2

tinygrad (🥇34 · ⭐ 29K) - You like pytorch? You like micrograd? You love tinygrad!. MIT

-
GitHub (👨💻 380 · 🔀 3.2K · 📦 190 · 📋 890 - 14% open · ⏱️ 03.04.2025):
git clone https://github.com/geohot/tinygrad
PML (🥇33 · ⭐ 6.1K) - The easiest way to use deep metric learning in your application. Modular,.. MIT

-
GitHub (👨💻 46 · 🔀 660 · 📦 2.5K · 📋 530 - 14% open · ⏱️ 11.12.2024):
git clone https://github.com/KevinMusgrave/pytorch-metric-learning
-
PyPi (📥 760K / month · 📦 55 · ⏱️ 11.12.2024):
pip install pytorch-metric-learning
-
Conda (📥 13K · ⏱️ 25.03.2025):
conda install -c metric-learning pytorch-metric-learning
torchdiffeq (🥇32 · ⭐ 5.9K) - Differentiable ODE solvers with full GPU support and.. MIT

torch-scatter (🥈26 · ⭐ 1.6K) - PyTorch Extension Library of Optimized Scatter Operations. MIT

Pytorch Toolbelt (🥈26 · ⭐ 1.5K) - PyTorch extensions for fast R&D prototyping and Kaggle.. MIT

EfficientNets (🥉25 · ⭐ 1.6K · 💤) - Pretrained EfficientNet, EfficientNet-Lite, MixNet,.. Apache-2

PyTorch Sparse (🥉24 · ⭐ 1.1K) - PyTorch Extension Library of Optimized Autograd Sparse.. MIT

pytorchviz (🥉19 · ⭐ 3.3K) - A small package to create visualizations of PyTorch execution graphs. MIT
-
GitHub (👨💻 6 · 🔀 280 · 📦 2.7K · 📋 72 - 47% open · ⏱️ 30.12.2024):
git clone https://github.com/szagoruyko/pytorchviz
Show 21 hidden projects...
- pretrainedmodels (🥈30 · ⭐ 9.1K · 💀) - Pretrained ConvNets for pytorch: NASNet, ResNeXt,..
BSD-3
- pytorch-summary (🥈28 · ⭐ 4K · 💀) - Model summary in PyTorch similar to
model.summary()
in..MIT
- lightning-flash (🥈28 · ⭐ 1.7K · 💀) - Your PyTorch AI Factory - Flash enables you to easily..
Apache-2
- EfficientNet-PyTorch (🥈27 · ⭐ 8.1K · 💀) - A PyTorch implementation of EfficientNet.
Apache-2
- pytorch-optimizer (🥈27 · ⭐ 3.1K · 💀) - torch-optimizer -- collection of optimizers for..
Apache-2
- TabNet (🥈26 · ⭐ 2.7K · 💀) - PyTorch implementation of TabNet paper :..
MIT
- Torchmeta (🥉25 · ⭐ 2K · 💀) - A collection of extensions and data-loaders for few-shot..
MIT
- Higher (🥉24 · ⭐ 1.6K · 💀) - higher is a pytorch library allowing users to obtain higher..
Apache-2
- micrograd (🥉22 · ⭐ 12K · 💀) - A tiny scalar-valued autograd engine and a neural net library..
MIT
- SRU (🥉22 · ⭐ 2.1K · 💀) - Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755).
MIT
- AdaBound (🥉21 · ⭐ 2.9K · 💀) - An optimizer that trains as fast as Adam and as good as SGD.
Apache-2
- reformer-pytorch (🥉21 · ⭐ 2.2K · 💀) - Reformer, the efficient Transformer, in Pytorch.
MIT
- Antialiased CNNs (🥉21 · ⭐ 1.7K · 💀) - pip install antialiased-cnns to improve stability and..
❗️CC BY-NC-SA 4.0
- Performer Pytorch (🥉21 · ⭐ 1.1K · 💀) - An implementation of Performer, a linear attention-..
MIT
- Poutyne (🥉21 · ⭐ 570) - A simplified framework and utilities for PyTorch.
❗️LGPL-3.0
- Lambda Networks (🥉19 · ⭐ 1.5K · 💀) - Implementation of LambdaNetworks, a new approach to..
MIT
- Torch-Struct (🥉19 · ⭐ 1.1K · 💀) - Fast, general, and tested differentiable structured..
MIT
- Tensor Sensor (🥉18 · ⭐ 800 · 💀) - The goal of this library is to generate more helpful..
MIT
- Tez (🥉17 · ⭐ 1.2K · 💀) - Tez is a super-simple and lightweight Trainer for PyTorch. It..
Apache-2
- Pywick (🥉17 · ⭐ 400 · 💀) - High-level batteries-included neural network training library for..
MIT
- TorchDrift (🥉15 · ⭐ 320 · 💀) - Drift Detection for your PyTorch Models.
Apache-2
Libraries for connecting to, operating, and querying databases.
🔗 best-of-python - DB Clients ( ⭐ 3.9K) - Collection of database clients for python.
scipy (🥇50 · ⭐ 14K) - Ecosystem of open-source software for mathematics, science, and engineering. BSD-3
PyOD (🥈35 · ⭐ 9K) - A Python Library for Outlier and Anomaly Detection, Integrating Classical and.. BSD-2
anomalib (🥈30 · ⭐ 4.2K) - An anomaly detection library comprising state-of-the-art algorithms.. Apache-2
adapter-transformers (🥉28 · ⭐ 2.7K) - A Unified Library for Parameter-Efficient and Modular.. Apache-2
huggingface
TabPy (🥉28 · ⭐ 1.6K) - Execute Python code on the fly and display results in Tableau visualizations:. MIT
metric-learn (🥉26 · ⭐ 1.4K · 💤) - Metric learning algorithms in Python. MIT

Feature Engine (🥉25 · ⭐ 2K · 💤) - Feature engineering package with sklearn like functionality. BSD-3
MONAILabel (🥉22 · ⭐ 680) - MONAI Label is an intelligent open source image labeling and.. Apache-2
SUOD (🥉22 · ⭐ 380) - (MLSys 21) An Acceleration System for Large-scare Unsupervised Heterogeneous.. BSD-2
benchmark_VAE (🥉21 · ⭐ 1.9K · 💤) - Unifying Variational Autoencoder (VAE).. Apache-2

pymdp (🥉20 · ⭐ 510) - A Python implementation of active inference for Markov Decision Processes. MIT
NeuralCompression (🥉15 · ⭐ 540 · 💤) - A collection of tools for neural compression enthusiasts. MIT
Show 28 hidden projects...
- Cython BLIS (🥈32 · ⭐ 220) - Fast matrix-multiplication as a self-contained Python library no..
BSD-3
- cleanlab (🥈31 · ⭐ 10K) - The standard data-centric AI package for data quality and machine..
❗️AGPL-3.0
- pysc2 (🥈29 · ⭐ 8.1K · 💀) - StarCraft II Learning Environment.
Apache-2
- kmodes (🥈29 · ⭐ 1.3K · 💀) - Python implementations of the k-modes and k-prototypes clustering..
MIT
- pyclustering (🥈29 · ⭐ 1.2K · 💀) - pyclustering is a Python, C++ data mining library.
BSD-3
- alibi-detect (🥉28 · ⭐ 2.3K) - Algorithms for outlier, adversarial and drift detection.
❗️Intel
- modAL (🥉28 · ⭐ 2.3K · 💀) - A modular active learning framework for Python.
MIT
- gplearn (🥉27 · ⭐ 1.7K · 💀) - Genetic Programming in Python, with a scikit-learn inspired API.
BSD-3
- minisom (🥉27 · ⭐ 1.5K) - MiniSom is a minimalistic implementation of the Self Organizing..
❗️CC-BY-3.0
- PySwarms (🥉27 · ⭐ 1.3K · 💀) - A research toolkit for particle swarm optimization in Python.
MIT
- metricflow (🥉26 · ⭐ 1.2K) - MetricFlow allows you to define, build, and maintain metrics..
❗Unlicensed
- findspark (🥉25 · ⭐ 520 · 💀) - Find pyspark to make it importable.
BSD-3
- pandas-ai (🥉24 · ⭐ 19K) - Chat with your database or your datalake (SQL, CSV, parquet)...
❗Unlicensed
- Mars (🥉24 · ⭐ 2.7K · 💀) - Mars is a tensor-based unified framework for large-scale data..
Apache-2
- AstroML (🥉23 · ⭐ 1.1K · 💀) - Machine learning, statistics, and data mining for astronomy..
BSD-2
- vecstack (🥉23 · ⭐ 690 · 💀) - Python package for stacking (machine learning technique).
MIT
- opyrator (🥉22 · ⭐ 3.1K · 💀) - Turns your machine learning code into microservices with web API,..
MIT
- mlens (🥉22 · ⭐ 850 · 💀) - ML-Ensemble high performance ensemble learning.
MIT
- apricot (🥉22 · ⭐ 500 · 💀) - apricot implements submodular optimization for the purpose of..
MIT
- impyute (🥉21 · ⭐ 360 · 💀) - Data imputations library to preprocess datasets with missing data.
MIT
- StreamAlert (🥉20 · ⭐ 2.9K · 💀) - StreamAlert is a serverless, realtime data analysis..
Apache-2
- rrcf (🥉20 · ⭐ 510 · 💀) - Implementation of the Robust Random Cut Forest algorithm for anomaly..
MIT
- scikit-rebate (🥉20 · ⭐ 420 · 💀) - A scikit-learn-compatible Python implementation of..
MIT
- baikal (🥉19 · ⭐ 590 · 💀) - A graph-based functional API for building complex scikit-learn..
BSD-3
- KD-Lib (🥉16 · ⭐ 620 · 💀) - A Pytorch Knowledge Distillation library for benchmarking and..
MIT
- pandas-ml (🥉16 · ⭐ 320 · 💀) - pandas, scikit-learn, xgboost and seaborn integration.
BSD-3
- nylon (🥉14 · ⭐ 83 · 💀) - An intelligent, flexible grammar of machine learning.
MIT
- traingenerator (🥉13 · ⭐ 1.4K · 💀) - A web app to generate template code for machine learning.
MIT
- Papers With Code: Discover ML papers, code, and evaluation tables.
- Sotabench: Discover & compare open-source ML models.
- Google Dataset Search: Dataset search engine by Google.
- Dataset List: List of the biggest ML datasets from across the web.
- Awesome Public Datasets: A topic-centric list of open datasets.
- Best-of lists: Discover other best-of lists with awesome open-source projects on all kinds of topics.
- best-of-python-dev: A ranked list of awesome python developer tools and libraries.
- best-of-web-python: A ranked list of awesome python libraries for web development.
Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:
- Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
- Modify the projects.yaml with your additions or changes, and submit a pull request. This can also be done directly via the Github UI.
If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository. If you like to create your own best-of list, we recommend to follow this guide.
For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.