Spark2 Etl Examples

A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
Alternatives To Spark2 Etl Examples
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Spark Jobserver2,837
4 months ago110otherScala
REST job server for Apache Spark
Pkpmspark697
4 years agoScala
awesome 三维数据挖掘 数据分析 & 推荐
Metorikku536
a year ago126February 27, 202365mitScala
A simplified, lightweight ETL Framework based on Apache Spark
Spark Solr440212a year ago102June 29, 202357apache-2.0Scala
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
Spark Fast Tests38510a year ago8April 27, 202228mitScala
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Connectors383
9 months ago5December 06, 2022apache-2.0Java
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
Spark Jobserver348
7 years ago50otherScala
REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver/spark-jobserver. This fork now serves as a semi-private repo for Ooyala.
Sparklint293
6 years ago9February 21, 201816apache-2.0Scala
A tool for monitoring and tuning Spark jobs for efficiency.
Transport288173 months ago27October 17, 202334bsd-2-clauseJava
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
Sagemaker Spark285
27 months ago36August 26, 202234apache-2.0Scala
A Spark library for Amazon SageMaker.
Alternatives To Spark2 Etl Examples
Select To Compare


Alternative Project Comparisons
Popular Spark Projects
Popular Jar Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Scala
Spark
Jar
Hive
Hdfs