Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python data science
data-science
x
python
x
2,551 search results found
God Level Data Science Ml Full Stack
⭐
3,384
A collection of scientific methods, processes, algorithms, and systems to build stories & models. Whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI
Pipelines
⭐
3,368
Machine Learning Pipelines for Kubeflow
Python Training
⭐
3,365
Python training for business analysts and traders
Statsforecast
⭐
3,339
Lightning ⚡️ fast forecasting with statistical and econometric models.
Tensorwatch
⭐
3,333
Debugging, monitoring and visualization for Python Machine Learning and Data Science
Ploomber
⭐
3,318
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
Koalas
⭐
3,291
Koalas: pandas API on Apache Spark
Course Nlp
⭐
3,271
A Code-First Introduction to NLP course
Awesome Machine Learning Interpretability
⭐
3,241
A curated list of awesome responsible machine learning resources.
Awesome Mlops
⭐
3,233
😎 A curated list of awesome MLOps tools
Deepchecks
⭐
3,206
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
Ml Workspace
⭐
3,197
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Vectorbt
⭐
3,189
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
Scikit Learn Videos
⭐
3,180
Jupyter notebooks from the scikit-learn video series
Geemap
⭐
3,159
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
Interviews.ai
⭐
3,146
It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced researchers will find it fascinating as well.
Awesome Conformal Prediction
⭐
3,097
A professionally curated list of awesome Conformal Prediction videos, tutorials, books, papers, PhD and MSc theses, articles and open-source libraries.
Stable Baselines
⭐
3,064
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Marimo
⭐
3,037
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Igel
⭐
3,037
a delightful machine learning tool that allows you to train, test, and use models without writing code
Lance
⭐
3,003
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
Stumpy
⭐
2,901
STUMPY is a powerful and scalable Python library for modern time series analysis
Leafmap
⭐
2,899
A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment
Mljar Supervised
⭐
2,867
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
Stellargraph
⭐
2,768
StellarGraph - Machine Learning on Graphs
Dtreeviz
⭐
2,720
A python library for decision tree visualization and model interpretation.
Data Science Best Resources
⭐
2,718
Carefully curated resource links for data science in one place
Ml Glossary
⭐
2,710
Machine learning glossary
Data Diff
⭐
2,707
Compare tables within or across databases
Eli5
⭐
2,695
A library for debugging/inspecting machine learning classifiers and explaining their predictions
Ffcv
⭐
2,694
FFCV: Fast Forward Computer Vision (and other ML workloads!)
Sweetviz
⭐
2,687
Visualize and compare datasets, target values and associations, with one line of code.
Deep Learning Book
⭐
2,650
Repository for "Introduction to Artificial Neural Networks and Deep Learning: A Practical Guide with Applications in Python"
Machine Learning
⭐
2,607
🌎 machine learning tutorials (mainly in Python3)
Whylogs
⭐
2,533
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
Quadratic
⭐
2,485
Quadratic | Data Science Spreadsheet with Python & SQL
Python Is Cool
⭐
2,468
Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
Gopup
⭐
2,451
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字
Data Science Roadmap
⭐
2,445
Data Science Roadmap from A to Z
Mlops Course
⭐
2,427
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Scikit Plot
⭐
2,277
An intuitive library to add plotting functionality to scikit-learn objects.
Python Causality Handbook
⭐
2,265
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
Awesome Community Detection
⭐
2,232
A curated list of community detection research papers with implementations.
Pyfunctional
⭐
2,232
Python library for creating data pipelines with chain functional programming
Lifelines
⭐
2,227
Survival analysis in Python
Ml Foundations
⭐
2,224
Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science
Mito
⭐
2,201
The mitosheet package, trymito.io, and other public Mito code.
Linear Algebra With Python
⭐
2,160
Lecture Notes for Linear Algebra Featuring Python. This series of lecture notes will walk you through all the must-know concepts that set the foundation of data science or advanced quantitative skillsets. Suitable for statistician/econometrician, quantitative analysts, data scientists and etc. to quickly refresh the linear algebra with the assistance of Python computation and visualization.
Awesome Python Data Science
⭐
2,126
Probably the best curated list of data science software in Python.
Sketch
⭐
2,106
AI code-writing assistant that understands data content
Data Science Interview Questions Answers
⭐
2,102
Curated list of data science interview questions and answers
Wooey
⭐
2,061
A Django app that creates automatic web UIs for Python scripts.
Codesearchnet
⭐
2,054
Datasets, tools, and benchmarks for representation learning of code.
Fast F1
⭐
1,960
FastF1 is a python package for accessing and analyzing Formula 1 results, schedules, timing data and telemetry
Torchmetrics
⭐
1,913
Torchmetrics - Machine learning metrics for distributed, scalable PyTorch applications.
Blazingsql
⭐
1,900
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
Handout
⭐
1,880
Turn Python scripts into handouts with Markdown and figures
Lazynlp
⭐
1,867
Library to scrape and clean web pages to create massive datasets.
Benchm Ml
⭐
1,839
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Datasciencecoursera
⭐
1,838
Data Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.
Awesome Streamlit
⭐
1,832
The purpose of this project is to share knowledge on how awesome Streamlit is and can be
Pandas Videos
⭐
1,808
Jupyter notebook and datasets from the pandas Q&A video series
Dataprep
⭐
1,807
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Diffgram
⭐
1,772
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
Scanpy
⭐
1,753
Single-cell analysis in Python. Scales to >1M cells.
Python Cheat Sheet
⭐
1,714
Python Cheat Sheet NumPy, Matplotlib
Arcgis Python Api
⭐
1,710
Documentation and samples for ArcGIS API for Python
Nannyml
⭐
1,695
nannyml: post-deployment data science in python
Chdb
⭐
1,686
chDB is an embedded OLAP SQL Engine 🚀 powered by ClickHouse
Featureform
⭐
1,670
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Feature_engine
⭐
1,651
Feature engineering package with sklearn like functionality
Autolabel
⭐
1,645
Label, clean and enrich text datasets with LLMs.
Covid19 Dashboard
⭐
1,645
A site that displays up to date COVID-19 stats, powered by fastpages.
Doit
⭐
1,590
task management & automation tool
Pysr
⭐
1,580
High-Performance Symbolic Regression in Python and Julia
Machine_learning_refined
⭐
1,574
Notes, examples, and Python demos for the 2nd edition of the textbook "Machine Learning Refined" (published by Cambridge University Press).
Dat8
⭐
1,549
General Assembly's 2015 Data Science course in Washington, DC
Spark Py Notebooks
⭐
1,515
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
An Introduction To Statistical Learning
⭐
1,481
This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.
Fklearn
⭐
1,465
fklearn: Functional Machine Learning
Pybroker
⭐
1,455
Algorithmic Trading in Python with Machine Learning
Auto_ml
⭐
1,442
[UNMAINTAINED] Automated machine learning for analytics & production
Pycm
⭐
1,413
Multi-class confusion matrix library in Python
H2o Tutorials
⭐
1,403
Tutorials and training material for the H2O Machine Learning Platform
Mlbox
⭐
1,403
MLBox is a powerful Automated Machine Learning python library.
Scikit Learn Tips
⭐
1,393
🤖⚡ 50 scikit-learn tips
Hyperlearn
⭐
1,387
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.
Andrew Ng Notes
⭐
1,367
This is Andrew NG Coursera Handwritten Notes.
Awesome Fraud Detection Papers
⭐
1,364
A curated list of data mining papers about fraud detection.
Hackermath
⭐
1,339
Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way
Lifetimes
⭐
1,339
Lifetime value in Python
Efficient_python_tricks_and_tools_for_data_scientists
⭐
1,332
Efficient Python Tricks and Tools for Data Scientists
Uncertainty Baselines
⭐
1,324
High-quality implementations of standard and SOTA methods on a variety of tasks.
Finance
⭐
1,317
150+ quantitative finance Python programs to help you gather, manipulate, and analyze stock market data
Dataprofiler
⭐
1,310
What's in your data? Extract schema, statistics and entities from datasets
Dltk
⭐
1,293
Deep Learning Toolkit for Medical Image Analysis
Cracking The Data Science Interview
⭐
1,291
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Budgetml
⭐
1,290
Deploy a ML inference service on a budget in less than 10 lines of code.
Hamilton
⭐
1,272
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
Refinery
⭐
1,257
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
Related Searches
Python Machine Learning (20,195)
Python Flask (17,643)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Html (10,924)
Python Algorithms (10,033)
Python Natural Language Processing (9,064)
101-200 of 2,551 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.