Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Smart_open | 3,065 | a month ago | 94 | mit | Python | |||||
Utils for streaming large files (S3, HDFS, gzip, bz2...) | ||||||||||
Sparta | 526 | 4 years ago | 9 | apache-2.0 | Scala | |||||
Real Time Analytics and Data Pipelines based on Spark Streaming | ||||||||||
Kafka Connect Hdfs | 473 | 4 months ago | 153 | other | Java | |||||
Kafka Connect HDFS connector | ||||||||||
Storagetapper | 269 | 2 years ago | 4 | November 19, 2021 | 21 | mit | Go | |||
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service | ||||||||||
Megfile | 99 | 2 | 3 months ago | 68 | November 27, 2023 | 5 | apache-2.0 | Python | ||
Megvii FILE Library - Working with Files in Python same as the standard library | ||||||||||
Spdt | 46 | 7 years ago | 1 | mit | Scala | |||||
Streaming Parallel Decision Tree | ||||||||||
Stream To Hdfs | 27 | 14 years ago | 1 | |||||||
A simple utility for streaming stdin to a file in HDFS | ||||||||||
Wasp | 25 | 15 | 7 months ago | 25 | September 14, 2023 | 4 | other | Scala | ||
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you. | ||||||||||
Streamingstopgraceful | 23 | 6 years ago | 1 | Scala | ||||||
Example to show how to stop the Spark Streaming Application Gracefully | ||||||||||
Spark_log_data | 21 | 8 years ago | mit | Scala | ||||||
Flume-to-Spark-Streaming Log Parser |