Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Bigdata Notes | 14,872 | 4 months ago | 39 | Java | ||||||
大数据入门指南 :star: | ||||||||||
Flink Learning | 13,801 | 7 months ago | 8 | apache-2.0 | Java | |||||
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》 | ||||||||||
God Of Bigdata | 8,483 | 9 months ago | 3 | |||||||
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive... | ||||||||||
Zeppelin | 6,259 | 32 | 31 | 15 days ago | 2 | June 21, 2017 | 160 | apache-2.0 | Java | |
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more. | ||||||||||
Risingwave | 5,799 | 3 months ago | 14 | December 07, 2023 | 1,010 | apache-2.0 | Rust | |||
The distributed streaming database. Engineered to offer the simplest and most cost-efficient way for stream processing and management. | ||||||||||
Iceberg | 5,179 | 3 months ago | 3 | October 29, 2022 | 1,485 | apache-2.0 | Java | |||
Apache Iceberg | ||||||||||
Dataspherestudio | 2,860 | 39 | 3 months ago | 7 | August 07, 2023 | 360 | apache-2.0 | Java | ||
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling. | ||||||||||
Analytics Zoo | 2,592 | 3 | 4 months ago | 1 | July 29, 2022 | 533 | apache-2.0 | Jupyter Notebook | ||
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray | ||||||||||
Bigdataguide | 2,355 | 4 months ago | Java | |||||||
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料 | ||||||||||
Lakesoul | 2,248 | 1 | 3 months ago | 7 | October 12, 2023 | 15 | apache-2.0 | Java | ||
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications. |