Awesome Open Source
Awesome Open Source
Combined Topics
data-wrangling
x
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210
The Top 22 Data Wrangling Open Source Projects
Categories
>
Data Processing
>
Data Wrangling
Openrefine
⭐
8,022
OpenRefine is a free, open source power tool for working with messy data and improving it
Hypertools
⭐
1,622
A Python toolbox for gaining geometric insights into high-dimensional data
Data Science Best Resources
⭐
1,141
Carefully curated resource links for data science in one place
Optimus
⭐
997
🚚 Agile Data Preparation Workflows made easy with pandas, dask, cudf, dask_cudf and pyspark
Data Forge Ts
⭐
977
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Cracking The Data Science Interview
⭐
715
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Moderndive_book
⭐
532
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse
Prose
⭐
474
Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
Sqawk
⭐
264
Like Awk but with SQL and table joins
Data Cleaning 101
⭐
245
Data Cleaning Libraries with Python
Datatest
⭐
237
Tools for test driven data-wrangling and data validation.
R Ecology Lesson
⭐
221
Data Analysis and Visualization in R for Ecologists
Qsacnpj
⭐
194
Pacote que trata e organiza os dados do Cadastro Nacional da Pessoa Jurídica (CNPJ)
Web Database Analytics
⭐
175
Web scrapping and related analytics using Python tools
Sjmisc
⭐
143
Data transformation and utility functions for R
Data Forge Js
⭐
139
JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
R Novice Gapminder
⭐
128
R for Reproducible Scientific Analysis
Python Ecology Lesson
⭐
116
Data Analysis and Visualization in Python for Ecologists
Python Novice Gapminder
⭐
111
Plotting and Programming in Python
Data Analysis Using Python
⭐
91
Exploratory data analysis 📊using python 🐍of used car 🚘 database taken from ⓚ𝖆𝖌𝖌𝖑𝖊
R Raster Vector Geospatial
⭐
76
Introduction to Geospatial Raster and Vector Data with R
Uc R.github.io
⭐
76
Main repository for R programming courses @ University of Cincinnati, courses and tutorials that focus on data wrangling, exploration, visualization, and analysis with R.
1-22 of 22 projects
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210