Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for data science
data-science
x
5,348 search results found
Ml For Beginners
⭐
63,698
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Keras
⭐
60,854
Deep Learning for humans
Superset
⭐
58,778
Apache Superset is a Data Visualization and Data Exploration Platform
Scikit Learn
⭐
57,160
scikit-learn: machine learning in Python
Pandas
⭐
41,935
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Made With Ml
⭐
35,496
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Airflow
⭐
34,468
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Streamlit
⭐
29,794
Streamlit — A faster way to build and share data apps.
Ray
⭐
29,596
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Spacy
⭐
28,628
💫 Industrial-strength Natural Language Processing (NLP) in Python
Ai Expert Roadmap
⭐
27,583
Roadmap to becoming an Artificial Intelligence Expert in 2022
Pytorch Lightning
⭐
26,894
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
Probabilistic Programming And Bayesian Methods For Hackers
⭐
26,097
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Gradio
⭐
25,823
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Data Science Ipython Notebooks
⭐
25,668
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Data Science For Beginners
⭐
25,362
10 Weeks, 20 Lessons, Data Science for All!
Applied Ml
⭐
24,828
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Go
⭐
23,899
The Open Source Data Science Masters
Ml From Scratch
⭐
23,095
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
Awesome Datascience
⭐
23,007
📝 An awesome Data Science repository to learn and apply for real world problems.
D2l En
⭐
20,613
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Dash
⭐
19,976
Data Apps & Dashboards for Python. No JavaScript Required.
Fastbook
⭐
19,737
The fastai book, published as Jupyter Notebooks
Matplotlib
⭐
18,777
matplotlib: plotting with Python
Recommenders
⭐
17,972
Best Practices on Recommendation Systems
Excelize
⭐
17,188
Go language library for reading and writing Microsoft Excel™ (XLAM / XLSM / XLSX / XLTM / XLTX) spreadsheets
Ipython
⭐
16,063
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
Gensim
⭐
15,180
Topic Modelling for Humans
Best Of Ml Python
⭐
14,990
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Awesome Pytorch List
⭐
14,715
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Prefect
⭐
14,603
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
500 Ai Machine Learning Deep Learning Computer Vision Nlp Projects With Code
⭐
14,248
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Nni
⭐
13,725
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Virgilio
⭐
13,515
Your new Mentor for Data Science E-Learning.
Dvc
⭐
12,813
🦉 ML Experiments Management with Git
Awesome Bigdata
⭐
12,798
A curated list of awesome big data frameworks, ressources and other awesomeness.
Ml Youtube Courses
⭐
11,992
📺 Discover the latest machine learning / AI courses on YouTube.
Ydata Profiling
⭐
11,983
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Awesome Mlops
⭐
11,723
A curated list of references for MLOps
Python Machine Learning Book
⭐
11,645
The "Python Machine Learning (1st edition)" book code repository and info resource
Seaborn
⭐
11,624
Statistical data visualization in Python
Ds Cheatsheets
⭐
11,535
List of Data Science Cheatsheets to rule the world
Allennlp
⭐
11,300
An open-source NLP research library, built on PyTorch.
Ludwig
⭐
10,740
Low-code framework for building custom LLMs, neural networks, and other AI models
Stanford Cs 229 Machine Learning
⭐
10,399
VIP cheatsheets for Stanford's CS 229 Machine Learning
Openrefine
⭐
10,106
OpenRefine is a free, open source power tool for working with messy data and improving it
Mit Deep Learning
⭐
9,897
Tutorials, assignments, and competitions for MIT Deep Learning related courses.
Numerical Linear Algebra
⭐
9,850
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
Tflearn
⭐
9,602
Deep learning library featuring a higher-level API for TensorFlow.
Pandas Ai
⭐
9,533
Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational
Machine Learning For Trading
⭐
9,469
Code for Machine Learning for Algorithmic Trading, 2nd edition.
Dagster
⭐
9,467
An orchestration platform for the development, production, and observation of data assets.
Tpot
⭐
9,463
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Mlcourse.ai
⭐
9,376
Open Machine Learning Course
Modin
⭐
9,275
Modin: Scale your Pandas workflows by changing a single line of code
Statsmodels
⭐
9,242
Statsmodels: statistical modeling and econometrics in Python
Computervision Recipes
⭐
9,225
Best Practices, code samples, and documentation for Computer Vision.
Amazon Sagemaker Examples
⭐
9,221
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
Great_expectations
⭐
9,179
Always know what to expect from your data.
Trino
⭐
9,118
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Gridstudio
⭐
8,848
Grid studio is a web-based application for data science with full integration of open source data science frameworks and languages.
Gop
⭐
8,768
The Go+ programming language is designed for engineering, STEM education, and data science
Akshare
⭐
8,269
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Ai Learn
⭐
8,256
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Py tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Wandb
⭐
8,204
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
Cleanlab
⭐
8,182
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Vaex
⭐
8,161
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Pycaret
⭐
8,130
An open-source, low-code machine learning library in Python
Xonsh
⭐
7,986
🐚 Python-powered, cross-platform, Unix-gazing shell.
Data Science From Scratch
⭐
7,967
code for Data Science From Scratch book
Data Science Interviews
⭐
7,936
Data science interview questions and answers
Machine_learning_examples
⭐
7,861
A collection of machine learning examples and tutorials.
Tsfresh
⭐
7,824
Automatic extraction of relevant features from time series:
Machine Learning Systems Design
⭐
7,818
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
Hugo Blox Builder
⭐
7,787
😍 EASILY BUILD THE WEBSITE YOU WANT - NO CODE, JUST MARKDOWN BLOCKS! 使用块轻松创建任何类型的网站 - 无需代码。 一个应用程序,没有依赖项,没有 JS
Pyod
⭐
7,751
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
Deeplake
⭐
7,689
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Python Small Examples
⭐
7,607
告别枯燥,致力于打造 Python 实用小例子,更多Python良心教程见 Python中文网 http://www.zglg.work
Catboost
⭐
7,564
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Metaflow
⭐
7,524
🚀 Build and manage real-life ML, AI, and data science projects with ease!
Sktime
⭐
7,405
A unified framework for machine learning with time series
Cookiecutter Data Science
⭐
7,351
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Autogluon
⭐
7,109
Fast and Accurate ML in 3 Lines of Code
Featuretools
⭐
7,009
An open source python library for automated feature engineering
Cudf
⭐
6,936
cuDF - GPU DataFrame Library
Darts
⭐
6,903
A python library for user-friendly forecasting and anomaly detection on time series.
Dowhy
⭐
6,730
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
Imbalanced Learn
⭐
6,680
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Folium
⭐
6,649
Python Data. Leaflet.js Maps.
H2o 3
⭐
6,618
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Roughviz
⭐
6,548
Reusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.
Ml Papers Of The Week
⭐
6,370
🔥Highlighting the top ML papers every week.
Boltons
⭐
6,330
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
Fiftyone
⭐
6,327
The open-source tool for building high-quality datasets and computer vision models
Mage Ai
⭐
6,324
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Nteract
⭐
6,111
📘 The interactive computing suite for you! ✨
Industry Machine Learning
⭐
6,077
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
Data Science Blogs
⭐
6,048
A curated list of data science blogs
Machine Learning Roadmap
⭐
6,040
A roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
Pachyderm
⭐
6,035
Data-Centric Pipelines and Data Versioning
Related Searches
Python Data Science (6,905)
Machine Learning Data Science (5,390)
Jupyter Notebook Data Science (3,734)
R Data Science (1,164)
Deep Learning Data Science (1,039)
Html Data Science (872)
1-100 of 5,348 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.