Awesome Open Source
Awesome Open Source
Combined Topics
streaming-data
x
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210
The Top 32 Streaming Data Open Source Projects
Categories
>
Data Processing
>
Streaming Data
Awesome Bigdata
⭐
9,614
A curated list of awesome big data frameworks, ressources and other awesomeness.
Benthos
⭐
2,790
Declarative streaming ETL for mundane tasks, written in Go
Miller
⭐
2,626
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Smart_open
⭐
1,891
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Pravega
⭐
1,359
Pravega - Streaming as a new software defined storage primitive
River
⭐
1,280
🌊 Online machine learning in Python
Trill
⭐
1,105
Trill is a single-node query processor for temporal or streaming data.
Streamz
⭐
895
Real-time stream processing for python
Go Streams
⭐
579
A lightweight stream processing library for Go
Sparta
⭐
512
Real Time Analytics and Data Pipelines based on Spark Streaming
Onlinestats.jl
⭐
474
Single-pass algorithms for statistics
Scikit Multiflow
⭐
470
A machine learning package for streaming data in Python. The other ancestor of River.
Awesome Kafka
⭐
382
A list about Apache Kafka
Swim
⭐
355
Distributed software platform for building stateful, massively real-time streaming applications.
Rrcf
⭐
276
🌲 Implementation of the Robust Random Cut Forest algorithm for anomaly detection on streams
Cloudflow
⭐
266
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Data Accelerator
⭐
247
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Shioaji
⭐
180
Shioaji all new cross platform api for trading ( 跨平台證券交易API )
Kafka Streams In Action
⭐
158
Source code for the Kafka Streams in Action Book
Rangeless
⭐
146
c++ LINQ -like library of higher-order functions for data manipulation
Real Time Sentiment Tracking On Twitter For Brand Improvement And Trend Recognition
⭐
117
A real-time interactive web app based on data pipelines using streaming Twitter data, automated sentiment analysis, and MySQL&PostgreSQL database (Deployed on Heroku)
Toolbox
⭐
104
A Java Toolbox for Scalable Probabilistic Machine Learning
Gsf
⭐
103
Grid Solutions Framework
Axway Amplify Streams Js
⭐
79
AMPLIFY Streams Javascript package containing SDK, documentation and sample applications
Pysad
⭐
74
Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)
Tractor
⭐
73
structured concurrent "actors"
Optbinning
⭐
61
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning
Machine
⭐
61
Machine is a workflow/pipeline library for processing data
Nsdb
⭐
48
Natural Series Database
Flexible Clustering
⭐
36
Clustering for arbitrary data and dissimilarity function
Saber
⭐
34
Window-Based Hybrid CPU/GPU Stream Processing Engine
Go Mesh
⭐
20
Realtime data exchange platform for Smart Cities
1-32 of 32 projects
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210