Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for java apache spark
apache-spark
x
java
x
84 search results found
Spark
⭐
37,661
Apache Spark - A unified analytics engine for large-scale data processing
Hudi
⭐
4,901
Upserts, Deletes And Incremental Processing on Big Data.
Ballista
⭐
2,244
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Oryx
⭐
1,793
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Dr Elephant
⭐
1,301
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
Spark Doc Zh
⭐
1,186
Apache Spark 官方文档中文版
Livy
⭐
911
Livy is an open source REST interface for interacting with Apache Spark from anywhere
Openscoring
⭐
565
REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
Incubator Hivemall
⭐
308
Mirror of Apache Hivemall (incubating)
Succinct
⭐
239
Enabling queries on compressed data.
Rumble
⭐
194
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Sparkrdma
⭐
191
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Vn.vitk
⭐
189
A Vietnamese Text Processing Toolkit
Whylogs Java
⭐
179
Profile and monitor your ML data pipeline end-to-end
Hydrograph
⭐
138
A visual ETL development and debugging tool for big data
Envelope
⭐
133
Build configuration-driven ETL pipelines on Apache Spark
Bunsen
⭐
110
Explore, transform, and analyze FHIR data with Apache Spark
Mongo Spark
⭐
93
Example application on how to use mongo-hadoop connector with Spark
Splash
⭐
86
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Spork
⭐
84
Pig on Apache Spark
Docker Spark
⭐
77
🚢 Docker image for Apache Spark
Euphoria
⭐
74
Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model which can express both batch and stream transformations.
Openstreetmap_h3
⭐
72
OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into partitioned by H3 regions PostGIS pgsnapshot (lossless) OSM schema representation and/or into ArrowIPC/Parquet dumps
Lighter
⭐
72
REST API for Apache Spark on K8S or YARN
Net.jgp.labs.spark
⭐
63
Apache Spark examples exclusively in Java
Net.jgp.books.spark.ch01
⭐
61
Spark in Action, 2nd edition - chapter 1 - Introduction
Osm Parquetizer
⭐
58
A converter for the OSM PBFs to Parquet files
Spark Examples
⭐
40
Spark examples
Jpmml Sparkml Lightgbm
⭐
39
JPMML-SparkML plugin for converting LightGBM-Spark models to PMML
Spark Of Life
⭐
38
Example of running a Genetic Algorithm (Travelling Salesman) on Apache Spark
Spark Transformers
⭐
37
Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
Learning Spark With Java
⭐
32
Self-contained examples using Apache Spark with the functional features of Java 8
Netapp Hadoop Nfs Connector
⭐
29
This projects provides a NFSv3 connector for Hadoop. Using the connector, Apache Hadoop and Apache Spark can use NFSv3 server as their storage backend.
Metrics Spark Reporter
⭐
28
Dropwizard Metrics reporter for Apache Spark
Sparkproject
⭐
26
Using Apache Spark in an ArcMap Toolbox
Spash
⭐
25
Spash
Cloud Based Sql Engine Using Spark
⭐
25
Cloud-based SQL engine using SPARK where data is accessible as JDBC/ODBC data source via Spark ThriftServer.
Gospark
⭐
23
Go bindings for Apache Spark
Pulsar Adapters
⭐
23
Apache Pulsar Adapters
Apache Spark 2x For Java Developers
⭐
22
Apache Spark 2x for Java Developers, published by Packt
Spark Pmml Exporter Validator
⭐
20
Using JPMML Evaluator to validate the PMML models exported from Spark
Mmtf Spark
⭐
19
Methods for the parallel and distributed analysis and mining of the Protein Data Bank using MMTF and Apache Spark.
Spark Flight Connector
⭐
19
A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL
Dspbench
⭐
17
a suite of benchmark applications for distributed data stream processing systems
Proxima Platform
⭐
17
The Proxima platform.
Net.jgp.books.spark.ch02
⭐
16
Spark in Action, 2nd edition - chapter 2
Explainer
⭐
16
A machine learning model explainer that works on top of Apache Spark
Bigdata Projects
⭐
14
Student projects in Big Data field.
Spark Spring Boot Starter
⭐
14
Spring Boot Starter for Apache Spark
Spark Connector
⭐
14
A connector for Apache Spark to access Exasol
Apache Spark Examples
⭐
14
Apache Spark Examples
Metrics Spark Receiver
⭐
13
Apache Spark Streaming receiver for metrics-spark-reporter
Javarank
⭐
13
Recommendation engine in Java. Based on an ALS algorithm (Apache Spark). Train a new model after N seconds.
Streamsx.kafka
⭐
13
Repository for integration with Apache Kafka
Net.jgp.books.spark.ch07
⭐
12
Spark in Action, 2nd edition - chapter 7 - Ingestion from files
Packt_publishing_courses_by_tomasz_lelek
⭐
12
https://www.packtpub.com/books/info/authors/tomasz
Amazon Emr Optimize Data Processing
⭐
12
Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark
Biananes
⭐
11
Scalable fMRI Data Analysis
Sparkfhe Examples
⭐
11
SparkFHE project demo examples
Dead Salmon Brain
⭐
11
Apache Spark based framework for analysis A/B experiments
Net.jgp.books.spark.ch03
⭐
10
Spark in Action, 2nd edition - chapter 3
Datacooker Etl
⭐
10
Data transformation framework for ETL processing with SQL-like syntax and GIS extensions, based on Apache Spark
Net.jgp.labs.spark.datasources
⭐
10
Building custom data sources for Apache Spark, in Java.
Biojava Spark
⭐
9
💥 Algorithms that are built around BioJava and run on Apache Spark
Blaspark
⭐
8
Distributed linear algebra operations using Apache Spark
Bigdata
⭐
8
빅데이터 pipeline 구성 요소 기술들에 관한 coding 실습 및 연구
Laurelin
⭐
8
Allows reading ROOT TTrees into Apache Spark as DataFrames
Net.jgp.books.spark.ch08
⭐
8
Spark in Action, 2nd edition - chapter 8
Docker
⭐
8
Dockerfiles
Firehose
⭐
8
Firehose - Spark streaming 2.2 + Kafka 0.8_2
Enmasse Iot Demo
⭐
7
EnMasse - IoT demo
Net.jgp.books.spark.ch17
⭐
7
Spark in Action, 2nd edition - chapter 16 - exporting data, using delta lake
Spark X Means Clustering
⭐
6
Java implementation of X-Means clustering algorithm on Apache Spark
Spark Most Frequent Word Counter
⭐
6
This java program counts the most frequent word in a given file using Apache Spark
Jfall Sentiment
⭐
6
JFall Presentation: Sentiment Analysis of Social Media Posts with Apache Spark
Text Log Parser
⭐
5
A cli log parser based in Apache Spark. After parsing, cli utility pushes the data to a remote DB by using jdbc connection. By default, it try to push data into local MySQL instance, but you can change the configuration externally (see below sections).
Spark Newsreel Recommender
⭐
5
Hyper Spark
⭐
5
This is Spark running at 10Gb/s
Spark Examples
⭐
5
Repository of Example Spark Flows
Net.jgp.books.spark.ch09
⭐
5
Spark in Action, 2e - chapter 9 - Advanced ingestion: finding data sources and building your own
Deep Learning With Apache Spark
⭐
5
Sparkjavaexamples
⭐
5
Apache Spark Basics - Java Examples
Couchbase Spark Vs Rxjava Example
⭐
5
Net.jgp.books.spark.ch11
⭐
5
Spark in Action, 2nd edition - chapter 11 - Working with SQL
Ctakesspark
⭐
5
Attempt to integrate Apache cTakes with Apache Spark
Related Searches
Java Spring (21,350)
Java Plugin (12,518)
Java Spring Boot (11,982)
Java Video Game (8,093)
Java Gradle (8,072)
Java Jar (7,910)
Java Docker (6,180)
Java Database (6,015)
Java Mysql (5,954)
Java Sdk (5,864)
1-84 of 84 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.