Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python data science
data-science
x
python
x
2,550 search results found
Ml For Beginners
⭐
63,698
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Keras
⭐
60,854
Deep Learning for humans
Superset
⭐
58,051
Apache Superset is a Data Visualization and Data Exploration Platform
Scikit Learn
⭐
57,160
scikit-learn: machine learning in Python
Pandas
⭐
41,701
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Made With Ml
⭐
35,496
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Airflow
⭐
34,299
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Streamlit
⭐
29,794
Streamlit — A faster way to build and share data apps.
Ray
⭐
29,596
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Spacy
⭐
28,628
💫 Industrial-strength Natural Language Processing (NLP) in Python
Pytorch Lightning
⭐
26,592
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
Gradio
⭐
25,823
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Data Science Ipython Notebooks
⭐
25,668
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Data Science For Beginners
⭐
25,362
10 Weeks, 20 Lessons, Data Science for All!
D2l En
⭐
20,613
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Dash
⭐
19,976
Data Apps & Dashboards for Python. No JavaScript Required.
Fastbook
⭐
19,737
The fastai book, published as Jupyter Notebooks
Matplotlib
⭐
18,777
matplotlib: plotting with Python
Recommenders
⭐
17,739
Best Practices on Recommendation Systems
Ipython
⭐
16,063
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
Gensim
⭐
15,180
Topic Modelling for Humans
Best Of Ml Python
⭐
14,990
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Awesome Pytorch List
⭐
14,715
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Prefect
⭐
14,339
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
500 Ai Machine Learning Deep Learning Computer Vision Nlp Projects With Code
⭐
14,248
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Nni
⭐
13,725
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Virgilio
⭐
13,515
Your new Mentor for Data Science E-Learning.
Dvc
⭐
12,813
🦉 ML Experiments Management with Git
Ydata Profiling
⭐
11,983
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Python Machine Learning Book
⭐
11,645
The "Python Machine Learning (1st edition)" book code repository and info resource
Seaborn
⭐
11,624
Statistical data visualization in Python
Ds Cheatsheets
⭐
11,535
List of Data Science Cheatsheets to rule the world
Allennlp
⭐
11,300
An open-source NLP research library, built on PyTorch.
Numerical Linear Algebra
⭐
9,850
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
Tflearn
⭐
9,602
Deep learning library featuring a higher-level API for TensorFlow.
Pandas Ai
⭐
9,533
Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational
Dagster
⭐
9,467
An orchestration platform for the development, production, and observation of data assets.
Tpot
⭐
9,463
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Mlcourse.ai
⭐
9,376
Open Machine Learning Course
Modin
⭐
9,275
Modin: Scale your Pandas workflows by changing a single line of code
Statsmodels
⭐
9,242
Statsmodels: statistical modeling and econometrics in Python
Computervision Recipes
⭐
9,225
Best Practices, code samples, and documentation for Computer Vision.
Great_expectations
⭐
9,179
Always know what to expect from your data.
Ai Learn
⭐
8,256
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Py tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Vaex
⭐
8,161
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Pycaret
⭐
8,130
An open-source, low-code machine learning library in Python
Data Science From Scratch
⭐
7,967
code for Data Science From Scratch book
Machine_learning_examples
⭐
7,861
A collection of machine learning examples and tutorials.
Pyod
⭐
7,751
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
Deeplake
⭐
7,663
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Python Small Examples
⭐
7,607
告别枯燥,致力于打造 Python 实用小例子,更多Python良心教程见 Python中文网 http://www.zglg.work
Catboost
⭐
7,564
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Metaflow
⭐
7,524
🚀 Build and manage real-life ML, AI, and data science projects with ease!
Cookiecutter Data Science
⭐
7,351
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Autogluon
⭐
7,027
AutoGluon: AutoML for Image, Text, Time Series, and Tabular Data
Featuretools
⭐
7,009
An open source python library for automated feature engineering
Cudf
⭐
6,936
cuDF - GPU DataFrame Library
Darts
⭐
6,903
A python library for user-friendly forecasting and anomaly detection on time series.
Imbalanced Learn
⭐
6,680
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Dowhy
⭐
6,656
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
Folium
⭐
6,649
Python Data. Leaflet.js Maps.
H2o 3
⭐
6,618
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Boltons
⭐
6,330
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
Fiftyone
⭐
6,327
The open-source tool for building high-quality datasets and computer vision models
Mage Ai
⭐
6,324
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Industry Machine Learning
⭐
6,077
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Data Science Blogs
⭐
6,048
A curated list of data science blogs
Python Machine Learning Book 2nd Edition
⭐
5,944
The "Python Machine Learning (2nd edition)" book code repository and info resource
Data Scientist Roadmap
⭐
5,698
Toturials coming with the "data science roadmap" picture.
Snorkel
⭐
5,692
A system for quickly generating training data with weak supervision
Data Analysis And Machine Learning Projects
⭐
5,596
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Knowledge Repo
⭐
5,344
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Feast
⭐
5,053
Feature Store for Machine Learning
Skypilot
⭐
4,975
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
Datasciencepython
⭐
4,776
common data analysis and machine learning tasks using python
River
⭐
4,748
🌊 Online machine learning in Python
Mlxtend
⭐
4,669
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Lux
⭐
4,642
Automatically visualize your pandas dataframe via a single print! 📊 💡
Aim
⭐
4,497
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
Orange3
⭐
4,469
🍊 📊 💡 Orange: Interactive data analysis
Dtale
⭐
4,407
Visualizer for pandas data structures
Flyte
⭐
4,380
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Taipy
⭐
4,311
Turns Data and AI algorithms into production-ready web applications in no time.
Mimesis
⭐
4,298
Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently.
Machine_learning_complete
⭐
4,296
A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.
Datascience
⭐
3,955
Curated list of Python resources for data science.
Pymupdf
⭐
3,908
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Data Science
⭐
3,898
Collection of useful data science topics along with articles, videos, and code
Orchest
⭐
3,876
Build data pipelines, the easy way 🛠️
Datascienceresources
⭐
3,826
Open Source Data Science Resources.
Aws Sdk Pandas
⭐
3,779
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Panel
⭐
3,728
Panel: The powerful data exploration & web app framework for Python
Awesome Jupyter
⭐
3,709
A curated list of awesome Jupyter projects, libraries and resources
Mercury
⭐
3,660
Convert Jupyter Notebooks to Web Apps
Data Science At The Command Line
⭐
3,518
Data Science at the Command Line
Flaml
⭐
3,500
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
Pytorch Forecasting
⭐
3,439
Time series forecasting with PyTorch
Chartify
⭐
3,436
Python library that makes it easy for data scientists to create charts.
Fastpages
⭐
3,435
An easy to use blogging platform, with enhanced support for Jupyter Notebooks.
God Level Data Science Ml Full Stack
⭐
3,384
A collection of scientific methods, processes, algorithms, and systems to build stories & models. Whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI
Related Searches
Python Machine Learning (20,195)
Python Flask (17,643)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Html (10,924)
Python Algorithms (10,033)
Python Natural Language Processing (9,064)
1-100 of 2,550 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.