Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for scala apache
apache
x
scala
x
310 search results found
Spark
⭐
37,661
Apache Spark - A unified analytics engine for large-scale data processing
Flink
⭐
22,747
Apache Flink
Bigdl
⭐
4,728
Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using bigdl-llm
Marathon
⭐
4,015
Deploy and manage containers (including Docker) on top of Apache Mesos at scale.
Tensorflowonspark
⭐
3,851
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Spark Nlp
⭐
3,578
State of the Art Natural Language Processing
Scio
⭐
2,505
A Scala API for Apache Beam and Google Cloud Dataflow.
Awesome Streaming
⭐
2,447
a curated list of awesome streaming frameworks, applications, etc
Ballista
⭐
2,244
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Spark Cassandra Connector
⭐
1,929
DataStax Connector for Apache Spark to Apache Cassandra
Awesome Spark
⭐
1,461
A curated list of awesome Apache Spark packages and resources.
Carbondata
⭐
1,401
High performance data store solution
Griffin
⭐
1,063
Mirror of Apache griffin
Adam
⭐
966
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
Spark Scala Tutorial
⭐
922
A free tutorial for Apache Spark.
Tispark
⭐
872
TiSpark is built for running Apache Spark on top of TiDB/TiKV
Incubator Livy
⭐
840
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
Samza
⭐
792
Mirror of Apache Samza
Kafka Storm Starter
⭐
729
[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Incubator Toree
⭐
721
Mirror of Apache Toree (Incubating)
Spark Rapids
⭐
619
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Reference Apps
⭐
615
Spark reference applications
Sparkmeasure
⭐
603
This is the development repository for sparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task and stage metrics data.
Sparklearning
⭐
573
Learning Apache spark,including code and data .Most part can run local.
Nussknacker
⭐
564
Low-code tool for automating actions on real time data | Stream processing for the users.
Spline
⭐
553
Data Lineage Tracking And Visualization Solution
Shc
⭐
484
The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
Spark Training
⭐
365
Apache Spark training material
Graphx
⭐
353
Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there.
Morpheus
⭐
335
Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
Hyperspace
⭐
334
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Akka Persistence Cassandra
⭐
328
A replicated Akka Persistence journal backed by Apache Cassandra
Bahir
⭐
325
Mirror of Apache Bahir
Spark Standalone Cluster On Docker
⭐
311
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. ⚡
Every Single Day I Tldr
⭐
311
A daily digest of the articles or videos I've found interesting, that I want to share with you.
Neo4j Spark Connector
⭐
300
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
Sparkstreaming
⭐
253
Spark Streaming+Flume+Kafka+HBase+Hadoop+Zookeeper实现实时日志
S2graph
⭐
250
This code base is retained for historical interest only, please visit Apache Incubator Repo for latest one
Spark Indexedrdd
⭐
247
An efficient updatable key-value store for Apache Spark
Sql Spark Connector
⭐
242
Apache Spark Connector for SQL Server and Azure SQL
Succinct
⭐
239
Enabling queries on compressed data.
Azure Event Hubs Spark
⭐
225
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Flink Notes
⭐
223
flink学习笔记
Awesome Scala
⭐
197
A curated list of awesome Scala frameworks, libraries and software.
Spark Snowflake
⭐
196
Snowflake Data Source for Apache Spark.
Sparkrdma
⭐
191
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Flink Tensorflow
⭐
190
flink-tensorflow - TensorFlow support for Apache Flink
Spark Authorizer
⭐
158
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Docker Flink
⭐
157
Apache Flink docker image
Bigdata Playground
⭐
154
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Servicemix
⭐
153
Apache ServiceMix
Spark Metrics
⭐
150
Spark metrics related custom classes and sinks (e.g. Prometheus)
Dbscan On Spark
⭐
146
An implementation of DBSCAN runing on top of Apache Spark
Spookystuff
⭐
137
Scalable query engine for web scrapping/data mashup/acceptance QA, powered by Apache Spark
Spark Tsne
⭐
134
Distributed t-SNE via Apache Spark
Flink Web
⭐
133
Apache Flink Website
Flink Shaded
⭐
130
Apache Flink shaded artifacts repository
Example Spark Kafka
⭐
118
Apache Spark and Apache Kafka integration example
Drizzle Spark
⭐
113
Drizzle integration with Apache Spark
Spark Atlas Connector
⭐
112
A Spark Atlas connector to track data lineage in Apache Atlas
Crunch
⭐
100
Mirror of Apache Crunch (Incubating)
Learning Spark
⭐
94
Practical examples of using Apache Spark in several different use cases
Blog Spark Streaming Log Aggregation
⭐
91
Example of use of Spark Streaming with Kafka
Spark States
⭐
88
Custom state store providers for Apache Spark
Sparkcube
⭐
87
SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
Predictionio Template Recommender
⭐
78
PredictionIO Recommendation Engine Template (Scala-based parallelized engine)
Docker Spark
⭐
77
🚢 Docker image for Apache Spark
Spark Examples
⭐
75
Apache Spark jobs such as Principal Coordinate Analysis.
Incubator Nlpcraft
⭐
75
Apache NLPCraft - API to convert natural language into actions.
Waimak
⭐
73
Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Avocado
⭐
71
A Variant Caller, Distributed. Apache 2 licensed.
Scalaapacheaccesslogparser
⭐
71
An Apache access log parser written in Scala
Cleanframes
⭐
70
type-class based data cleansing library for Apache Spark SQL
Spark Lp
⭐
64
Distributed Linear Programming Solver on top of Apache Spark
Lambda Arch Spark
⭐
63
Spark Etl
⭐
62
Apache Spark based ETL Engine
Net.jgp.books.spark.ch01
⭐
61
Spark in Action, 2nd edition - chapter 1 - Introduction
Stellar Random Walk
⭐
61
Sparkinternals
⭐
61
Learning notes of Apache Spark source code
Spark
⭐
60
Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References
Emma
⭐
60
A quotation-based Scala DSL for scalable data analysis.
Ksql Jdbc Driver
⭐
59
JDBC driver for Apache Kafka
Gatling Kafka
⭐
59
A Gatling stress test plugin for Apache Kafka protocol
Pyspark Setup Guide
⭐
54
A guide for setting up Spark + PySpark under Ubuntu linux
Lighthouse
⭐
54
Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.
Sparker
⭐
53
SparkER: an Entity Resolution framework for Apache Spark
Gatlingcql
⭐
52
Gatling support for Apache Cassandra CQL
Scalaz Camel
⭐
51
A Scala(z)-based DSL for Apache Camel
Carbondatalearning
⭐
50
Apache CarbonData Learning
Spark Json Schema
⭐
50
JSON schema parser for Apache Spark
Scalikesolr
⭐
50
Apache Solr Client for Scala/Java
Spark Tpcds Datagen
⭐
50
All the things about TPC-DS in Apache Spark
Spark Nkp
⭐
47
Natural Korean Processor for Apache Spark
Marvin Engine Executor
⭐
47
Marvin AI has been accepted into the Apache Foundation and is now available at https://github.com/apache/incubator-marvin
Spark Tda
⭐
46
SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Demo Spark Analytics
⭐
46
Demo about realtime analytics of user behavior using elk stack/apache spark streaming+mllib/redis/slamdata
Spark Vlbfgs
⭐
45
Vector-free L-BFGS implementation for Spark MLlib
Openwhisk Runtime Python
⭐
45
Apache OpenWhisk Runtime Python supports Apache OpenWhisk functions written in Python
Spark Es
⭐
44
ElasticSearch integration for Apache Spark
Vagrant Spark Zeppelin
⭐
43
Vagrant, Apache Spark and Apache Zeppelin VM for teaching
Related Searches
Java Apache (4,331)
Scala Sbt (4,178)
Scala Spark (3,279)
Php Apache (2,627)
Scala Akka (2,120)
Java Scala (1,794)
Javascript Apache (1,522)
Shell Apache (1,492)
Python Apache (1,438)
Scala Play Framework (1,309)
1-100 of 310 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.