Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for data science eda
data-science
x
eda
x
70 search results found
Ydata Profiling
⭐
11,983
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Great_expectations
⭐
9,179
Always know what to expect from your data.
Sweetviz
⭐
2,687
Visualize and compare datasets, target values and associations, with one line of code.
Dataprep
⭐
1,807
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Complete Life Cycle Of A Data Science Project
⭐
499
Complete-Life-Cycle-of-a-Data-Science-Project
Dataexplorer
⭐
486
Automate Data Exploration and Treatment
Piperider
⭐
443
Code review for data in dbt
Skimpy
⭐
332
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Erlemar.github.io
⭐
321
Data science portfolio
Data Describe
⭐
292
data⎰describe: Pythonic EDA Accelerator for Data Science
Arkouda
⭐
215
Arkouda (αρκούδα): Interactive Data Analytics at Supercomputing Scale 🐻
100 Days Of Ml Code
⭐
201
A day to day plan for this challenge. Covers both theoritical and practical aspects
Rust Data Analysis
⭐
193
Rust for data analysis encyclopedia (WIP).
Data Science Toolkit
⭐
185
Collection of stats, modeling, and data science tools in Python and R.
Covid 19 Casestudy And Predictions
⭐
90
This repository is a case study, analysis and visualization of COVID-19 Pandemic spread along with prediction models.
Data Analysis Using Python
⭐
58
Exploratory data analysis 📊using python 🐍of used car 🚘 database taken from ⓚ𝖆𝖌𝖌𝖑𝖊
Leila
⭐
56
Librería para la evaluación de calidad de datos, e interacción con el portal de datos.gov.co
Datascience365
⭐
47
DataScience365
Sliceguard
⭐
43
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
Olliepy
⭐
41
OlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning experiments by utilising the power and structure of modern web applications. The data scientist only needs to provide the data and any required information and OlliePy will generate the rest.
Cracking Kaggle Competitions
⭐
40
Different approaches for different Classical Machine Learning, and NLP competitions from Kaggle.
Awesome Kaggle Kernels
⭐
38
Compilation of good Kaggle Kernels.
Dsf
⭐
36
Edvart
⭐
29
An open-source Python library for Data Scientists & Data Analysts designed to simplify the exploratory data analysis process. Using Edvart, you can explore data sets and generate reports with minimal coding.
Breadroll
⭐
27
⚠️ breadroll is a simple lightweight application toolkit for data processing operations written in Typescript and powered by Bun.
Data Purifier
⭐
26
A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning, and Automated Data Preprocessing For Machine Learning and Natural Language Processing Applications in Python.
Springboard Data Science Immersive
⭐
23
Data Analysis
⭐
22
Different types of data analytics projects : EDA, PDA, DDA, TSA and much more.....
Mlmachine
⭐
22
mlmachine accelerates machine learning experimentation
Experiments
⭐
21
Code and notebooks containing my experiments in data science, EDA, visualization, and machine learning
Data Viz Utils
⭐
18
Functions for easily making publication-quality figures with matplotlib.
Viz It
⭐
18
Data Visualizer Web-Application
Machine Learning
⭐
17
A set of jupyter notebooks
Welcome To The Tidyverse
⭐
16
A gentle introduction to R and its Tidyverse that focuses on Exploratory Data Analysis (EDA).
Edapy
⭐
16
Exploratory Data Analysis with Python
Data Science With Julia
⭐
15
Exploring data science through Julia programming language
Python_chilla
⭐
13
This repository contains practice materials on Python, used to deliver online training course. The course was sponsered by codenics and Scholership Network. Pakistan
Online Retail Transactions Of Uk
⭐
13
Analyzing the Online Transactions in UK and the countries who are purchase stuff from them and analyzing the reviews from them using NLP and Machine Learning
Octopus Ml
⭐
12
A collection of handy ML and data visualization and validation tools. Go ahead and train, evaluate and validate your ML models and data with minimal effort.
Trainity_data_analytics_trainee
⭐
12
This repository has all the Pdfs along with Data Analysis Portfolio and Data Analytics Certificates from Trainity. Click on the below link for enrolling yourself into Data Analytics internship from Trainity.
Kakao Valhalla
⭐
11
A2V first Project_kakao-arena
Exploratory Data Analysis App
⭐
11
Analyze the descriptive statistics and the distribution of your data. Preview and save your graphics.
Churn Modelling Dataset
⭐
10
Predicting which set of the customers are gong to churn out from the organization by looking into some of the important attributes and applying Machine Learning and Deep Learning on it.
Ads Optimization
⭐
10
Optimizing the best Ads using Reinforcement learning Algorithms such as Thompson Sampling and Upper Confidence Bound.
Pga Tour Data Science Project
⭐
10
Kydlib
⭐
10
Routines for exploratory data analysis.
Credit Card Fraud Detection
⭐
10
The notebook contains Python code for various machine learning tasks and models. Here is an overview of its content:
Dexter
⭐
10
Data Exploration Terser
Covid 19 Complete Eda Analysis
⭐
9
Performed Exploratory Data Analysis(EDA) on the global COVID-19 dataset. Used Geopython to get a worldwide view of COVID-19 cases.
Burro
⭐
9
Exploring data together using shiny (burro(w) into the data)
Data_science_tools1
⭐
8
course website for data science tools 1
Springboard
⭐
8
Notes, Ideas, and Projects related to my Springboard data science career track
Hackathon_motorica_2022
⭐
8
3 этапа хакатона, совместно проведенного Motorica и Skillfactory (numpy, tensorflow)
Lambda Blog Contents
⭐
8
Including Data Competition notes, top solution analysis etc.
Machinelearning
⭐
8
My machine learning portfolio
Car_evaluation
⭐
7
Evaluating a Car based on some popular attributes which could be beneficial in decision making while purchasing a Car, Who do not have enough knowledge about Cars.
Avito Demand Prediction Challenge
⭐
7
It is a Competition for Regression Challenge held by Kaggle, It is based on a Avito Dataset whose size is 123GB which can be accessed from Kaggle, I have done Data Pre-processing, feature engineering, feature extraction, data visualization, machine learning, stacking and boosting
Python_notebooks
⭐
7
Jupyter notebooks for LeetCode problems, Python library notes (NumPy, Pandas, Matplotlib) and DSA concepts.
Big Mart Sales Prediction
⭐
7
Using Machine Learning Algorithms for Regression Analysis to predict the sales pattern and Using Data Analysis and Data Visualizations to Support it.
Employee Reviews
⭐
7
This is Project which contains Data Visualization, EDA, Machine Learning Modelling for Checking the Sentiments.
Capital One Data Challenge
⭐
7
NYC Taxi Data Challenge - Data Scientist
Covid Dataset Analysis
⭐
6
Analysis and review of the Covid dataset for EDA practice in pandas, numpy, and matplotlib and practice working with data.
Datasethub
⭐
6
Complete Notebooks on how AnimeWorld dataset was created, EDA, and Recommendation Engine.
Amazon Alexa Reviews
⭐
6
Using Natural Language Processing, Data Visualizations and Classification Algorithms of Machine Learning
Model Eda Analysis Cart
⭐
5
REPO : OPEN SOURCE CONTRI
Kaggle_kernels
⭐
5
It's contain a Data scince - Machine learning ,Data visualizations codes & Datasets
Zeppelin Datascience
⭐
5
Edasql
⭐
5
edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries. The query results can be passed to the EDA tool which can give greater insights to the user.
Mlt
⭐
5
Machine learning on ATP and WTA Tennis Matches with Betting odds Data
Aeda
⭐
5
AEDA - Automated Data Exploratory Analysis in R
Automat
⭐
5
Do EDA from the command line. A small tool that helps wrangling tabular data and plays nicely with other command line tools
Predictive Modeling In R
⭐
5
Workshop (2-6 hours): cleaning, missing value imputation, EDA, ensemble learning, calibration, variable importance ranking, accumulated local effect plots. WIP.
Related Searches
Python Data Science (6,905)
Machine Learning Data Science (5,390)
Jupyter Notebook Data Science (3,734)
Deep Learning Data Science (1,108)
R Data Science (939)
Html Data Science (872)
Data Science Pandas (794)
Jupyter Notebook Eda (770)
Statistics Data Science (557)
1-70 of 70 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.