Project Name	Stars	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
Data Engineering Zoomcamp	19,461	5 months ago			27		Jupyter Notebook
Free Data Engineering course!
Bigdata Notes	14,872	6 months ago			39		Java
大数据入门指南 :star:
Flink Learning	13,801	9 months ago			8	apache-2.0	Java
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Technology Talk	13,579	8 months ago			6
【大厂面试专栏】一份Java程序员需要的技术指南，这里有面试题、系统架构、职场锦囊、主流中间件等，让你成为更牛的自己！
Cookbook	12,557	6 months ago			111	apache-2.0
The Data Engineering Cookbook
God Of Bigdata	8,483	a year ago			3
专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Risingwave	5,799	5 months ago	14	December 07, 2023	1,010	apache-2.0	Rust
The distributed streaming database. Engineered to offer the simplest and most cost-efficient way for stream processing and management.
Pipeline	4,158	2 years ago	85	July 18, 2017	1	apache-2.0	Jsonnet
PipelineAI
Bigdataguide	2,355	6 months ago					Java
大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料
Szt Bigdata	2,055	6 months ago			13	other	Scala
深圳地铁大数据客流分析系统🚇🚄🌟

Alternatives To Spark Kafka

Select To Compare

Data Engineering Zoomcamp ⭐ 19,461

Free Data Engineering course!

most recent commit 5 months ago

Bigdata Notes ⭐ 14,872

大数据入门指南 :star:

most recent commit 6 months ago

Flink Learning ⭐ 13,801

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《 Flink 实战与性能优化》

most recent commit 9 months ago

Technology Talk ⭐ 13,579

【大厂面试专栏】一份Java程序员需要的技术指南，这里有面试题、系统架构、职场锦囊、主流中间件等，让

most recent commit 8 months ago

Cookbook ⭐ 12,557

The Data Engineering Cookbook

most recent commit 6 months ago

God Of Bigdata ⭐ 8,483

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive.

most recent commit a year ago

Risingwave ⭐ 5,799

The distributed streaming database. Engineered to offer the simplest and most cost-efficient way for stream processing and management.

total releases 14most recent commit 5 months ago

Pipeline ⭐ 4,158

PipelineAI

total releases 85most recent commit 2 years ago

Bigdataguide ⭐ 2,355

大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料

most recent commit 6 months ago

Szt Bigdata ⭐ 2,055

深圳地铁大数据客流分析系统🚇🚄🌟

most recent commit 6 months ago

Suggest An Alternative To spark-kafka

Alternative Project Comparisons

Spark Kafka vs Data Engineering Zoomcamp

Spark Kafka vs Bigdata Notes

Spark Kafka vs Flink Learning

Spark Kafka vs Technology Talk

Spark Kafka vs Cookbook

Spark Kafka vs God Of Bigdata

Spark Kafka vs Risingwave

Spark Kafka vs Pipeline

Spark Kafka vs Bigdataguide

Spark Kafka vs Szt Bigdata

Popular Kafka Projects

Javafamily ⭐ 34,620

【Java面试+Java学习指南】一份涵盖大部分Java程序员所需要掌握的核心知识。

most recent commit 8 months ago

Canal ⭐ 27,268

阿里巴巴 MySQL binlog 增量订阅&消费组件

dependent packages 22total releases 33latest release October 09, 2023most recent commit 5 months ago

Kafka ⭐ 26,687

Mirror of Apache Kafka

most recent commit 5 months ago

Springboot Labs ⭐ 18,521

一个涵盖六个专栏：Spring Boot 2.X、Spring Cloud、Spring Cloud Alibaba、Dubbo、分布式消息队列、分布式事务的仓库。希望胖友小手一抖，右上角来个 Star，感恩 1024

most recent commit 2 months ago

Thingsboard ⭐ 15,050

Open-source IoT Platform - Device management, data collection, processing and visualization.

most recent commit 5 months ago

Popular Spark Projects

Spark ⭐ 37,661

Apache Spark - A unified analytics engine for large-scale data processing

dependent packages 939total releases 46latest release May 09, 2021most recent commit 5 months ago

Data Science Ipython Notebooks ⭐ 25,668

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

most recent commit 9 months ago

Redash ⭐ 24,479

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

dependent packages 3total releases 2latest release May 05, 2020most recent commit 5 months ago

Docker_practice ⭐ 23,279

Learn and understand Docker&Container technologies, with real DevOps practice!

total releases 9latest release December 01, 2021most recent commit 6 months ago

Chuanhuchatgpt ⭐ 14,595

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

most recent commit 3 months ago

Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories

No Spam. Unsubscribe easily at any time.

Scala

Streaming

Time

Spark

Kafka

Spark Streaming

Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.