Awesome Open Source
Awesome Open Source
Combined Topics
exploratory-data-analysis
x
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210
The Top 26 Exploratory Data Analysis Open Source Projects
Categories
>
Data Processing
>
Exploratory Data Analysis
Pandas Profiling
⭐
6,578
Create HTML profiling reports from pandas DataFrame objects
Great_expectations
⭐
3,425
Always know what to expect from your data.
Scattertext
⭐
1,483
Beautiful visualizations of how language differs among document types.
Sweetviz
⭐
1,194
Visualize and compare datasets, target values and associations, with one line of code.
Lux
⭐
655
Python API for Intelligent Visual Data Discovery
Dataprep
⭐
578
DataPrep — The easiest way to prepare data in Python
Data Science Your Way
⭐
525
Ways of doing Data Science Engineering and Machine Learning in R and Python
Musicmood
⭐
385
A machine learning approach to classify songs by mood.
Visdat
⭐
356
Preliminary Exploratory Visualisation of Data
Kdepy
⭐
234
Kernel Density Estimation in Python
Datavisualization
⭐
232
Tutorials on visualizing data using python packages like bokeh, plotly, seaborn and igraph
Autoeda Resources
⭐
228
A list of software and papers related to automatic and fast Exploratory Data Analysis
Lotteryprediction
⭐
199
🌝 Lottery prediction besides of following "law of proability","Probability: Independent Events", there are still "Saying "a Tail is due", or "just one more go, my luck is due to change" is called The Gambler's Fallacy" existed.
Mirador
⭐
186
Tool for visual exploration of complex data.
Inspectdf
⭐
186
🛠️ 📊 Tools for Exploring and Comparing Data Frames
100 Days Of Ml Code
⭐
169
A day to day plan for this challenge. Covers both theoritical and practical aspects
Ditching Excel For Python
⭐
163
Functionalities in Excel translated to Python
Handyspark
⭐
152
HandySpark - bringing pandas-like capabilities to Spark dataframes
Data Describe
⭐
151
data⎰describe: Pythonic EDA Accelerator for Data Science
Complete Life Cycle Of A Data Science Project
⭐
123
Complete-Life-Cycle-of-a-Data-Science-Project
Xda
⭐
111
R package for exploratory data analysis
Spark R Notebooks
⭐
109
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Impy
⭐
109
Impy is a Python3 library with features that help you in your computer vision tasks.
Hn_so_analysis
⭐
94
Is there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causality
Kaggle Competitions
⭐
86
There are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Edarf
⭐
62
exploratory data analysis using random forests
1-26 of 26 projects
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210