Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Iceberg | 4,339 | 17 hours ago | 4 | May 23, 2022 | 1,356 | apache-2.0 | Java | |||
Apache Iceberg | ||||||||||
Airpal | 2,758 | 2 years ago | 87 | apache-2.0 | Java | |||||
Web UI for PrestoDB. | ||||||||||
Nessie | 638 | 3 | 17 hours ago | 27 | May 13, 2022 | 131 | apache-2.0 | Java | ||
Nessie: Transactional Catalog for Data Lakes with Git-like semantics | ||||||||||
Sql Metadata | 552 | 21 | 12 days ago | 28 | June 17, 2022 | 54 | mit | Python | ||
Uses tokenized query returned by python-sqlparse and generates query metadata | ||||||||||
Connectors | 377 | 4 days ago | 4 | July 28, 2022 | 79 | apache-2.0 | Java | |||
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake. | ||||||||||
Sql Scripts | 268 | 4 days ago | mit | Shell | ||||||
100+ SQL Scripts - PostgreSQL, MySQL, Google BigQuery, MariaDB, AWS Athena. DevOps / DBA / Analytics / performance engineering. Google BigQuery ML machine learning classification. | ||||||||||
Reair | 254 | 4 years ago | 10 | apache-2.0 | Java | |||||
ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses. | ||||||||||
Tibigdata | 201 | 6 days ago | 2 | September 12, 2021 | 24 | apache-2.0 | Java | |||
TiDB connectors for Flink/Hive/Presto | ||||||||||
Amundsendatabuilder | 196 | a year ago | 1 | apache-2.0 | Python | |||||
Data ingestion library for Amundsen to build graph and search index | ||||||||||
Sqlsubmit | 133 | a month ago | 7 | gpl-3.0 | Java | |||||
基于 Flink 的 sqlSubmit 程序 |
We are building connectors to bring Delta Lake to popular big-data engines outside Apache Spark (e.g., Apache Hive, Presto, Apache Flink) and also to common reporting tools like Microsoft Power BI.
This is the repository for Delta Lake Connectors. It includes
Please refer to the main Delta Lake repository if you want to learn more about the Delta Lake project.
Delta Standalone, formerly known as the Delta Standalone Reader (DSR), is a JVM library to read and write Delta tables. Unlike delta-io/delta, this project doesn't use Spark to read or write tables and it has only a few transitive dependencies. It can be used by any application that cannot use a Spark cluster.
build/sbt standalone/compile
build/sbt standalone/test
build/sbt standaloneCosmetic/publishM2
See Delta Standalone for detailed documentation.
Read Delta tables directly from Apache Hive using the Hive Connector. See the dedicated README.md for more details.
Use the Flink/Delta Connector to read and write Delta tables from Apache Flink applications. The connector includes a sink for writing to Delta tables from Apache Flink, and a source for reading Delta tables using Apache Flink (still in progress.) See the dedicated README.md for more details.
sql-delta-import allows for importing data from a JDBC source into a Delta table.
The connector for Microsoft Power BI is basically just a custom Power Query function that allows you to read a Delta table from any file-based data source supported by Microsoft Power BI. Details can be found in the dedicated README.md.
We use GitHub Issues to track community reported issues. You can also contact the community for getting answers.
We welcome contributions to Delta Lake Connectors repository. We use GitHub Pull Requests for accepting changes.
There are two mediums of communication within the Delta Lake community.
Public Slack Channel
Public Mailing list
standalone
tests in IntelliJ, run all standalone
tests using SBT. This helps IntelliJ recognize the golden tables as class resources.