Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for spark cassandra
cassandra
x
spark
x
148 search results found
Pipeline
⭐
4,158
PipelineAI
Zio Quill
⭐
2,135
Compile-time Language Integrated Queries for Scala
Spark Cassandra Connector
⭐
1,929
DataStax Connector for Apache Spark to Apache Cassandra
Elassandra
⭐
1,633
Elassandra = Elasticsearch + Apache Cassandra
Killrweather
⭐
1,174
KillrWeather is a reference application (work in progress) showing how to easily integrate streaming and batch data processing with Apache Spark Streaming, Apache Cassandra, Apache Kafka and Akka for fast, streaming computations on time series data in asynchronous event-driven environments.
Dockerfiles
⭐
1,171
50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak
Reference Apps
⭐
615
Spark reference applications
Freestyle
⭐
615
A cohesive & pragmatic framework of FP centric Scala libraries
Cassandra Lucene Index
⭐
574
Lucene based secondary indexes for Cassandra
Data Engineering Interview Questions
⭐
554
More than 2000+ Data engineer interview questions.
Data Engineering Projects
⭐
322
Personal Data Engineering Projects
Decision
⭐
311
Powered by Spark Streaming & Siddhi
Koober
⭐
301
Akka Analytics
⭐
281
Large-scale event processing with Akka Persistence and Apache Spark
Gimel
⭐
230
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Zio Protoquill
⭐
192
Quill for Scala 3
Zipkin Dependencies
⭐
173
Spark job that aggregates zipkin spans for use in the UI
Dcos Commons
⭐
162
DC/OS SDK is a collection of tools, libraries, and documentation for easy integration of technologies such as Kafka, Cassandra, HDFS, Spark, and TensorFlow with DC/OS.
Spark Structured Streaming Examples
⭐
153
Spark Structured Streaming / Kafka / Cassandra / Elastic
Lambda Arch
⭐
151
A full big data pipeline (Lambda Architecture) with Spark, Kafka, HDFS and Cassandra.
Iot Traffic Monitor
⭐
123
Spark Dependencies
⭐
112
Spark job for dependency links
Logisland
⭐
106
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Learning Spark
⭐
94
Practical examples of using Apache Spark in several different use cases
Kafka Sparkstreaming Cassandra
⭐
86
Docker container for Kafka - Spark Streaming - Cassandra
Pyspark Cassandra
⭐
81
PySpark Cassandra brings back the fun in working with Cassandra data in PySpark.
Data Engineering Nanodegree
⭐
76
Projects done in the Data Engineering Nanodegree by Udacity.com
Cassandra Count
⭐
73
Count rows in Cassandra Table
Google Finance Stock Data Analysis
⭐
69
Developed a high performance data processing platform using Apache Kafka, Apache Cassandra, and Apache Spark to analyze stock price and related stock tweets sentiment.
Pipeline
⭐
68
Complete Pipeline Training at Big Data Scala By the Bay
Pyspark Cassandra
⭐
67
pyspark-cassandra is a Python port of the awesome @datastax Spark Cassandra connector. Compatible w/ Spark 2.0, 2.1, 2.2, 2.3 and 2.4
Spark Structured Streaming
⭐
66
Spark structured streaming with Kafka data source and writing to Cassandra
User Guide Smack
⭐
66
[Cloudframeworks]SMACK Big Data Architecture - user guide / [云框架]SMACK大数据架构-用户指南
Spark Kafka Cassandra Applying Lambda Architecture
⭐
64
Lambda Arch Spark
⭐
63
Busfloatingdata
⭐
59
Showcase for IoT Platform Blog
Data Processing Pipeline
⭐
59
Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra
Datapipeline
⭐
57
Real time stock data pipeline --play with Kafka, Cassandra, Spark, Redis, Node.js, Zookeeper
Sparkbuildexamples
⭐
56
Example projects for using Spark and Cassandra With DSE Analytics
Bestconf
⭐
53
A tool automatically improving the performance of large-scale systems by finding better configuration settings
Calliope Release
⭐
51
The repository for the G.A. codebase for Calliope. For the E.A. codebase request an early access to the development repo, http://tuplejump.github.io/calliope/
Datastax Spark Streaming Demo
⭐
43
Counting Twitter hashtags using Spark Streaming and Cassandra
Actitracker Cassandra Spark
⭐
43
Activity recognition using Spark, Cassandra and MLlib
Trembita
⭐
43
Model complex data transformation pipelines easily
Udacity Data Engineering
⭐
42
Udacity Data Engineering Nano Degree (DEND)
Datasource Receiver
⭐
41
Spark Receiver for SQL or NoSQL Databases like Cassandra, MongoDB, Elasticsearch or JDBC
Devops
⭐
40
DevOps
Gemini
⭐
38
Advanced similarity and duplicate source code at scale.
Awesome Cassandra
⭐
37
awesome cassandra resources
Killranalytics
⭐
35
Open source analytics platform powered by Apache Cassandra, Spark, and Kafka
Styx
⭐
34
Streaming Analytics platform, built with Apache Flink and Kafka
Tweet Driven Comparable Companies
⭐
33
Analyzing Twitter real time feed with Spark Streaming
Scylla Migrator
⭐
32
Migrate data extract using Spark to Scylla, normally from Cassandra
Cassandra Spark Demo
⭐
31
Demo for the Spark Cassandra connector
Dockerfiles
⭐
31
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Vagrant Cassandra Spark
⭐
28
Data Engineering Nanodegree
⭐
27
Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift, Data Lake with Spark and Data Pipeline with Airflow.
Data Engineer Nanodegree Projects Udacity
⭐
27
Projects done in the Data Engineer Nanodegree Program by Udacity.com
Spark Cassandra
⭐
26
An Akka Extension for easy integration of spark and cassandra in Akka micro services.
Mesos Workshop
⭐
25
Dockerized environment for Mesos, walkthrough for use cases and code examples of Mesos Framework
Spark Cassandra Example
⭐
25
Example usage of spark cassandra connector
Cassandra.realtime
⭐
25
Different ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Spark Training
⭐
24
Spark Training Exercises
Spark Cassandra Stress
⭐
23
A tool for testing the DataStax Spark Connector against Apache Cassandra or DSE
Spark Workshop
⭐
22
Code examples and docker environment for Spark
Fastdata Cluster
⭐
22
Fast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Bigdatariver
⭐
22
Simple demo implementation of Lambda and Kappa architectures using Python, Docker, Kafka, Spark and Cassandra
Spark Cassandra Collabfiltering
⭐
22
Collaborative filtering with MLLib on Spark based on data in Cassandra
Sample Kafkasparkcassandra
⭐
21
Introductory sample scala app using Apache Spark Streaming to accept data from Kafka and write a summary to Cassandra.
Cassandra Spark Analytics
⭐
21
Supercharge your analysis of Cassandra data with Apache Spark
Idocuments
⭐
20
收集与 Java 开发相关的文档,包括基础系统服务(大数据、流计算、NoSQL 等)、专业名词、jar 包、开发工具等文档,持续更新……
Spark Cassandra Csv
⭐
20
An example stand alone program to import CSV files into Apache Cassandra using Apache Spark
Spark Cassandra Bulkreader
⭐
19
Spark-Cassandra Bulk Reader CASSANDRA-16222
Sparkassandra Dockerized
⭐
18
How to setup a cluster with Spark 1.3 + Cassandra 2.1 using Docker ?
Openblockchain
⭐
17
{START HERE} docker engine to roll your own openblockchain
Zeppelin Spark Cassandra Demo
⭐
17
A demo explaining how to use Zeppelin notebook to access Apache Cassandra data via Apache Spark or CQL language
Cassowary
⭐
16
Hive storage handler for Cassandra and Shark that reads the SSTables directly
Rtfap2
⭐
16
Real-Time Fraud Analysis and Prevention Using Kafka, Spark and Cassandra with a nodejs ReST Server
Real Time Stock Analyzer
⭐
16
Bigdata Pipeline
Visualizingstreamingdata
⭐
15
Visuallizing a real time streaming data on web app using node.js socket.io and Apache Kafka
Graphsense Transformation
⭐
15
GraphSense Transformation Pipeline
Datastax Eventsourcing
⭐
14
Small code example of how to use Cassandra/DSE for event sourcing and replay
Project Fortis Pipeline
⭐
14
Project Fortis is a data ingestion, analysis and visualization pipeline.
Big Data Course
⭐
14
Practice course on Big Data
Logeventsprocessingspark
⭐
13
real time log event processing using spark, kafka & cassandra
Kafka Spark Streaming
⭐
12
An example project for Kafka and Spark Streaming integration
Big Data Types
⭐
12
A library to transform Scala product types and Schemas from different systems into other Schemas. Any implemented type gets automatically methods to convert it into the rest of the types and vice versa. For example, an Spark Schema can be transformed into a BigQuery table.
Twitter Realtime Sentiment
⭐
12
Spark/Cassandra/Akka combo to visualize a cloud of words using d3.js
Scarff
⭐
12
SCARFF (SCAlable Real-time Frauds Finder) is a framework which enables credit card fraud detection.
Spark Tests
⭐
12
source code for hashmade.fr/InfoQ articles
Spark Cassandra Integrations
⭐
11
This is a repository containing Apache Spark and Apache Cassandra integration code samples written in Scala.
Spark_streaming_aggregation
⭐
11
Event aggregation with spark streaming
Artmosphere
⭐
11
Data Engineering Project at Insight
Oracle_to_cassandra
⭐
11
A description of the processes and techniques required to migrate a relational schema to a Cassandra database using Spark and SparkSQL
Structured Streaming Application
⭐
10
Structured Streaming is a reference application showing how to easily integrate structured streaming Apache Spark Structured Streaming, Apache Cassandra and Apache Kafka for fast, structured streaming computations on data.
Big Data
⭐
10
Big data technologies
Spark Boilerplate
⭐
10
A boilerplate for spark projects with docker support for local development and scripts for emr support.
Spark Cass
⭐
10
Spark and Cassandra docker image and test files
Deployer
⭐
10
Deploy Apache Spark & Cassandra in a Docker Swarm cluster
Kanalony
⭐
9
Kaltura's next generation Analytics solution based on Spark, Cassandra and Kafka
Related Searches
Scala Spark (3,279)
Python Spark (2,053)
Java Spark (1,587)
Apache Spark (1,207)
Spark Hadoop (1,188)
Jupyter Notebook Spark (1,151)
Spark Kafka (985)
Java Cassandra (916)
Spark Streaming (817)
Spark Pyspark (812)
1-100 of 148 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.