Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python data science
data-science
x
python
x
3,569 search results found
Keras
⭐
58,548
Deep Learning for humans
Scikit Learn
⭐
54,507
scikit-learn: machine learning in Python
Superset
⭐
52,360
Apache Superset is a Data Visualization and Data Exploration Platform
Ml For Beginners
⭐
48,894
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Pandas
⭐
38,610
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Made With Ml
⭐
33,193
Learn how to responsibly develop, deploy and maintain production machine learning applications.
Spacy
⭐
26,305
💫 Industrial-strength Natural Language Processing (NLP) in Python
Ray
⭐
25,939
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
Streamlit
⭐
25,161
Streamlit — A faster way to build and share data apps.
Data Science Ipython Notebooks
⭐
25,025
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Lightning
⭐
23,613
Deep learning framework to train, deploy, and ship AI products Lightning fast.
Data Science For Beginners
⭐
19,218
10 Weeks, 20 Lessons, Data Science for All!
Dash
⭐
18,809
Data Apps & Dashboards for Python. No JavaScript Required.
Fastbook
⭐
18,555
The fastai book, published as Jupyter Notebooks
Gradio
⭐
18,502
Create UIs for your machine learning model in Python in 3 minutes
D2l En
⭐
18,049
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 400 universities from 60 countries including Stanford, MIT, Harvard, and Cambridge.
Matplotlib
⭐
17,518
matplotlib: plotting with Python
Ipython
⭐
15,830
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
Recommenders
⭐
15,820
Best Practices on Recommendation Systems
Gensim
⭐
14,374
Topic Modelling for Humans
Awesome Pytorch List
⭐
14,103
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Best Of Ml Python
⭐
13,778
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Virgilio
⭐
13,415
Your new Mentor for Data Science E-Learning.
Nni
⭐
12,968
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
500 Ai Machine Learning Deep Learning Computer Vision Nlp Projects With Code
⭐
12,490
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Prefect
⭐
12,102
The easiest way to orchestrate and observe your data pipelines
Python Machine Learning Book
⭐
11,645
The "Python Machine Learning (1st edition)" book code repository and info resource
Dvc
⭐
11,620
🦉 Data Version Control | Git for Data & Models | ML Experiments Management
Ds Cheatsheets
⭐
11,535
List of Data Science Cheatsheets to rule the world
Allennlp
⭐
11,300
An open-source NLP research library, built on PyTorch.
Seaborn
⭐
10,751
Statistical data visualization in Python
Ydata Profiling
⭐
10,699
Create HTML profiling reports from pandas DataFrame objects
Tflearn
⭐
9,512
Deep learning library featuring a higher-level API for TensorFlow.
Numerical Linear Algebra
⭐
9,325
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
Tpot
⭐
9,098
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Ludwig
⭐
8,959
Data-centric declarative deep learning framework
Computervision Recipes
⭐
8,950
Best Practices, code samples, and documentation for Computer Vision.
Mlcourse.ai
⭐
8,803
Open Machine Learning Course
Modin
⭐
8,692
Modin: Scale your Pandas workflows by changing a single line of code
Statsmodels
⭐
8,531
Statsmodels: statistical modeling and econometrics in Python
Great_expectations
⭐
8,435
Always know what to expect from your data.
Vaex
⭐
7,880
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Dagster
⭐
7,600
An orchestration platform for the development, production, and observation of data assets.
Data Science From Scratch
⭐
7,555
code for Data Science From Scratch book
Pycaret
⭐
7,372
An open-source, low-code machine learning library in Python
Machine_learning_examples
⭐
7,348
A collection of machine learning examples and tutorials.
Catboost
⭐
7,177
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Python Small Examples
⭐
7,135
告别枯燥,致力于打造 Python 实用小例子,更多Python良心教程见 Python中文网 http://www.zglg.work
Pyod
⭐
7,013
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
Pandas Ai
⭐
6,989
Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational
Cookiecutter Data Science
⭐
6,701
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Metaflow
⭐
6,698
🚀 Build and manage real-life data science projects with ease!
Featuretools
⭐
6,659
An open source python library for automated feature engineering
Akshare
⭐
6,624
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Sktime
⭐
6,536
A unified framework for machine learning with time series
Ai Learn
⭐
6,327
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Py tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Imbalanced Learn
⭐
6,325
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
H2o 3
⭐
6,303
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Folium
⭐
6,261
Python Data. Leaflet.js Maps.
Wandb
⭐
6,168
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
Boltons
⭐
6,144
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
Deeplake
⭐
6,095
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Industry Machine Learning
⭐
6,077
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Cleanlab
⭐
6,050
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Data Science Blogs
⭐
6,048
A curated list of data science blogs
Darts
⭐
5,961
A python library for user-friendly forecasting and anomaly detection on time series.
Python Machine Learning Book 2nd Edition
⭐
5,944
The "Python Machine Learning (2nd edition)" book code repository and info resource
Dowhy
⭐
5,923
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
Autogluon
⭐
5,815
AutoGluon: AutoML for Image, Text, Time Series, and Tabular Data
Data Scientist Roadmap
⭐
5,698
Toturials coming with the "data science roadmap" picture.
Cudf
⭐
5,554
cuDF - GPU DataFrame Library
Snorkel
⭐
5,490
A system for quickly generating training data with weak supervision
Knowledge Repo
⭐
5,314
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Data Analysis And Machine Learning Projects
⭐
4,836
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Mage Ai
⭐
4,808
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Datasciencepython
⭐
4,776
common data analysis and machine learning tasks using python
Mlxtend
⭐
4,404
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Serenata De Amor
⭐
4,378
🕵 Artificial Intelligence for social control of public administration
Feast
⭐
4,367
Feature Store for Machine Learning
Lux
⭐
4,303
Automatically visualize your pandas dataframe via a single print! 📊 💡
River
⭐
4,261
🌊 Online machine learning in Python
Orange3
⭐
4,132
🍊 📊 💡 Orange: Interactive data analysis
Dtale
⭐
4,068
Visualizer for pandas data structures
Machine_learning_complete
⭐
3,985
A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.
Mimesis
⭐
3,972
Mimesis is a robust data generator for Python, capable of rapidly producing large volumes of synthetic data for various use cases.
Orchest
⭐
3,876
Build data pipelines, the easy way 🛠️
Aim
⭐
3,783
Aim 💫 — An easy-to-use & supercharged open-source AI metadata tracker (experiment tracking, AI agents tracing)
Datascience
⭐
3,751
Curated list of Python resources for data science.
Data Science
⭐
3,718
Collection of useful data science topics along with articles, videos, and code
Datascienceresources
⭐
3,710
Open Source Data Science Resources.
Gluonts
⭐
3,598
Probabilistic time series modeling in Python
Evidently
⭐
3,502
Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b
Aws Sdk Pandas
⭐
3,477
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Flyte
⭐
3,442
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Fastpages
⭐
3,435
An easy to use blogging platform, with enhanced support for Jupyter Notebooks.
Awesome Jupyter
⭐
3,375
A curated list of awesome Jupyter projects, libraries and resources
Chartify
⭐
3,333
Python library that makes it easy for data scientists to create charts.
Course Nlp
⭐
3,271
A Code-First Introduction to NLP course
Data Science At The Command Line
⭐
3,231
Data Science at the Command Line
Koalas
⭐
3,228
Koalas: pandas API on Apache Spark
Related Searches
Python Machine Learning (20,195)
Python Jupyter Notebook (18,595)
Python Flask (16,475)
Python Pytorch (15,135)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Javascript Python (9,798)
Python Algorithms (9,749)
1-100 of 3,569 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2023 Awesome Open Source. All rights reserved.