Spark Data Sources

Developing Spark External Data Sources using the V2 API
Alternatives To Spark Data Sources
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Iceberg5,179
3 months ago3October 29, 20221,485apache-2.0Java
Apache Iceberg
Nessie762323 months ago40November 21, 2023110apache-2.0Java
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
Iceberg409
3 years ago27apache-2.0Java
Iceberg is a table format for large, slow-moving tabular data
Connectors383
9 months ago5December 06, 2022apache-2.0Java
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
Parquet Index113
3 years ago16apache-2.0Scala
Spark SQL index for Parquet tables
Tpch Spark91
3 months ago1mitC
TPC-H queries in Apache Spark SQL using native DataFrames API
Spark Dynamodb90
3 years ago12March 21, 201817apache-2.0Scala
DynamoDB data source for Apache Spark
Flowman85244 months ago65October 16, 202355apache-2.0Scala
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
Spark Llap82
4 years ago31apache-2.0Java
Spark Acid79
3 years ago19apache-2.0Scala
ACID Data Source for Apache Spark based on Hive ACID
Alternatives To Spark Data Sources
Select To Compare


Alternative Project Comparisons
Popular Spark Projects
Popular Table Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Java
Table
Spark