Big Data Rosetta Code

Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Alternatives To Big Data Rosetta Code
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Beam7,355143 months ago568November 13, 20234,327apache-2.0Java
Apache Beam is a unified programming model for Batch and Streaming data processing.
Pachyderm6,03513 months ago613December 04, 2023897apache-2.0Go
Data-Centric Pipelines and Data Versioning
Dataflowjavasdk853249143 years ago38June 26, 201854
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Tez446
3 months ago67apache-2.0Java
Apache Tez
Smooks377143 months ago5June 19, 202319otherJava
Extensible data integration Java framework for building XML and non-XML fragment-based applications
Dataengineering Roadmap297
4 months agomit
Un repositorio más con conceptos básicos, desafíos técnicos y recursos sobre ingeniería de datos en español 🧙✨
Big Data Rosetta Code283
5 months ago5apache-2.0Scala
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Shifu23512a year ago9April 03, 2019237apache-2.0Java
An end-to-end machine learning and data mining framework on Hadoop
Mobydq175
2 years ago5apache-2.0Vue
:whale: Tool to automate data quality checks on data pipelines
Setl173
5 months ago4August 21, 20205apache-2.0Scala
A simple Spark-powered ETL framework that just works 🍺
Alternatives To Big Data Rosetta Code
Select To Compare


Alternative Project Comparisons
Popular Big Data Projects
Popular Pipeline Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Scala
Pipeline
Spark
Spotify
Big Data