Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Streamx | 95 | 5 years ago | 26 | apache-2.0 | Java | |||||
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3) | ||||||||||
Spark | 55 | 2 years ago | 7 | Scala | ||||||
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language . | ||||||||||
Avro Json | 29 | 4 years ago | Java | |||||||
Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive. | ||||||||||
Iow Hadoop Streaming | 26 | 7 years ago | apache-2.0 | Java | ||||||
Set of hadoop input/output formats for use in combination with hadoop streaming | ||||||||||
Wasp | 25 | 15 | 7 months ago | 25 | September 14, 2023 | 4 | other | Scala | ||
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you. | ||||||||||
Spark_log_data | 21 | 8 years ago | mit | Scala | ||||||
Flume-to-Spark-Streaming Log Parser | ||||||||||
Streaming Data Platform | 19 | 2 years ago | 1 | apache-2.0 | Java | |||||
Spark Streaming Twitter | 7 | 4 years ago | Jupyter Notebook | |||||||
Building pipeline to process the real-time data using Spark and Mongodb. |