Data Wrangling With Python Alternatives

Name: TrainingByPackt/Data-Wrangling-with-Python
Brand: TrainingByPackt/Data-Wrangling-with-Python
SKU: project/TrainingByPackt/Data-Wrangling-with-Python
Rating: 4.44 (66 reviews)

Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices

Categories > Data Processing > Database

Suggest Alternative

Stars

Alternatives

License

mit

Open Issues

Most Recent Commit

over 4 years ago

Programming Language

Jupyter Notebook

Dependent Repos

Dependent Packages

Total Releases

Categories

Programming Languages > Python

Data Processing > Jupyter Notebook

Data Storage > Database

Data Processing > Data Science

Text Processing > Regular Expression

Mathematics > Numpy

Data Processing > Pandas

Data Processing > Web Crawler

Data Processing > Etl

Text Processing > Beautifulsoup

Data Processing > Data Analytics

Repo

Alternatives To TrainingByPackt/Data-Wrangling-with-Python

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
apache/airflow	33,219	0	320	over 2 years ago	169	November 27, 2023	890	apache-2.0	Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
dagster-io/dagster	9,467	2	133	over 2 years ago	585	December 07, 2023	2,343	apache-2.0	Python
An orchestration platform for the development, production, and observation of data assets.
mage-ai/mage-ai	6,324	0	0	over 2 years ago	314	December 06, 2023	189	apache-2.0	Python
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
orchest/orchest	3,876	0	0	about 3 years ago	19	December 13, 2022	125	apache-2.0	TypeScript
Build data pipelines, the easy way 🛠️
aws/aws-sdk-pandas	3,723	0	65	over 2 years ago	143	November 13, 2023	34	apache-2.0	Python
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
quadratichq/quadratic	2,485	0	0	over 2 years ago	0		124	mit	Rust
Quadratic \| Data Science Spreadsheet with Python & SQL
thenaturalist/awesome-business-intelligence	1,862	0	0	over 2 years ago	0		11	mit
Actively curated list of awesome BI tools. PRs welcome!
DAGWorks-Inc/hamilton	1,139	0	2	over 2 years ago	116	December 05, 2023	136	bsd-3-clause-clear	Jupyter Notebook
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
AlexIoannides/pyspark-example-project	1,034	0	0	over 3 years ago	0		11		Python
Example project implementing best practices for PySpark ETL jobs and applications.
stitchfix/hamilton	877	0	0	about 3 years ago	10	October 23, 2022	12	bsd-3-clause-clear	Python
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

Alternatives To TrainingByPackt/Data-Wrangling-with-Python

Select To Compare

apache/airflow ⭐ 33,219

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

dependent packages 320 total releases 169 most recent commit over 2 years ago downloads badge

dagster-io/dagster ⭐ 9,467

An orchestration platform for the development, production, and observation of data assets.

dependent packages 133 total releases 585 most recent commit over 2 years ago downloads badge

mage-ai/mage-ai ⭐ 6,324

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

dependent packages 0 total releases 314 most recent commit over 2 years ago downloads badge

orchest/orchest ⭐ 3,876

Build data pipelines, the easy way 🛠️

dependent packages 0 total releases 19 most recent commit about 3 years ago downloads badge

aws/aws-sdk-pandas ⭐ 3,723

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

dependent packages 65 total releases 143 most recent commit over 2 years ago downloads badge

quadratichq/quadratic ⭐ 2,485

Quadratic | Data Science Spreadsheet with Python & SQL

dependent packages 0 total releases 0 most recent commit over 2 years ago

thenaturalist/awesome-business-intelligence ⭐ 1,862

Actively curated list of awesome BI tools. PRs welcome!

dependent packages 0 total releases 0 most recent commit over 2 years ago

DAGWorks-Inc/hamilton ⭐ 1,139

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

dependent packages 2 total releases 116 most recent commit over 2 years ago downloads badge

AlexIoannides/pyspark-example-project ⭐ 1,034

Example project implementing best practices for PySpark ETL jobs and applications.

dependent packages 0 total releases 0 most recent commit over 3 years ago

stitchfix/hamilton ⭐ 877

A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

dependent packages 0 total releases 10 most recent commit about 3 years ago

Suggest An Alternative To Data-Wrangling-with-Python

Alternative Project Comparisons

TrainingByPackt/Data-Wrangling-with-Python vs Airflow

TrainingByPackt/Data-Wrangling-with-Python vs Dagster

TrainingByPackt/Data-Wrangling-with-Python vs Mage Ai

TrainingByPackt/Data-Wrangling-with-Python vs Orchest

TrainingByPackt/Data-Wrangling-with-Python vs Aws Sdk Pandas

TrainingByPackt/Data-Wrangling-with-Python vs Quadratic

TrainingByPackt/Data-Wrangling-with-Python vs Awesome Business Intelligence

TrainingByPackt/Data-Wrangling-with-Python vs Hamilton

TrainingByPackt/Data-Wrangling-with-Python vs Pyspark Example Project

TrainingByPackt/Data-Wrangling-with-Python vs Hamilton

Popular Etl Projects

pingcap/tidb⭐ 35,604

TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://tidbcloud.com/free-trial

airbytehq/airbyte⭐ 12,918

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

apache/doris⭐ 10,666

Apache Doris is an easy-to-use, high performance and unified analytics database.

pentaho/pentaho-kettle⭐ 7,194

Pentaho Data Integration ( ETL ) a.k.a Kettle

benthosdev/benthos⭐ 7,051

Fancy stream processing made operationally mundane

Popular Data Science Projects

microsoft/ML-For-Beginners⭐ 63,698

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

keras-team/keras⭐ 60,198

Deep Learning for humans

scikit-learn/scikit-learn⭐ 57,160

scikit-learn: machine learning in Python

apache/superset⭐ 56,358

Apache Superset is a Data Visualization and Data Exploration Platform

pandas-dev/pandas⭐ 41,008

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Popular Data Processing Categories

Jupyter Notebook

Dataset

Sql

Validation

Pipeline

Translation

Data Science

Classification

Transaction

Scraper