Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for spark distributed computing
distributed-computing
x
spark
x
39 search results found
Fugue
⭐
1,821
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Elephas
⭐
1,548
Distributed Deep learning with Keras & Spark
Data Algorithms Book
⭐
973
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Metorikku
⭐
536
A simplified, lightweight ETL Framework based on Apache Spark
Sparktorch
⭐
297
Train and run Pytorch models on Apache Spark.
Geni
⭐
268
A Clojure dataframe library that runs on Spark
Js Spark
⭐
186
Realtime calculation distributed system. AKA distributed lodash
Sansa Stack
⭐
139
Big Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/
Dizk
⭐
117
Java library for distributed zero knowledge proof systems
Spark With Python
⭐
98
Fundamentals of Spark with Python (using PySpark), code examples
Mltoolkits
⭐
65
learningOrchestra is a distributed Machine Learning integration tool that facilitates and streamlines iterative processes in a Data Science project.
Spark Lp
⭐
64
Distributed Linear Programming Solver on top of Apache Spark
Frovedis
⭐
62
Framework of vectorized and distributed data analytics
Dcf
⭐
51
Yet another distributed compute framework
Lstm Tensorspark
⭐
35
Implementation of a LSTM with TensorFlow and distributed on Apache Spark
Dlsa
⭐
33
Distributed least squares approximation (dlsa) implemented with Apache Spark
Pyspark Algorithms
⭐
33
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Archived Sansa Query
⭐
31
SANSA Query Layer
Spark Ray Data Science
⭐
29
Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with Spark and Ray in the context of a data scientist's standard workflow.
Archived Sansa Inference
⭐
27
A general Inference API based on two of the most popular Big Data processing engines: Apache Spark and Apache Flink
Archived Sansa Owl
⭐
25
SANSA Stack OWL (Web Ontology Language) API
Open Stream Processing Benchmark
⭐
24
This repository contains the code base for the Open Stream Processing Benchmark.
Darima
⭐
23
Distributed ARIMA Models
Cs_interview
⭐
22
「Java学习+面试指南」思维导图,计算机自学指南,包括Java基础、JVM、数据库、mysql、r
Distributed Ml Pyspark
⭐
12
🔨使用Spark/Pytorch实现分布式算法,包括图/矩阵计算(graph/matrix computation)、随机算法、优化(optimization)和机器学习。参考刘铁岩《分布式机 323课程
Bigdata
⭐
12
Lecture: Big Data
Smartfd
⭐
10
SmartFD: Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms
Dgst
⭐
9
DGST: Efficient and Scalable Generalized Suffix Tree Construction on Apache Spark
Blaspark
⭐
8
Distributed linear algebra operations using Apache Spark
Spark Xarray
⭐
8
This is an experimental project that seeks to integrate PySpark and xarray for Climate Data Analysis.
Sparklyr.flint
⭐
7
Sparklyr extension making Flint time series library functionalities (https://github.com/twosigma/flint) easily accessible through R
Bigdata Essentials
⭐
7
All big data related tools/frameworks in once central repo.
Pical
⭐
6
(Work In Process) pita is a general distributed computation system with Erlang language base on DAG model. This project is inspired by DouBan 's DPark and Apache Spark.
Zappy
⭐
6
Distributed processing with NumPy and Zarr
Databrickstraining
⭐
6
Repository for Microsoft Databricks Training Events - Hosted by BlueGranite
Spark Utils
⭐
6
Comfy Utilities for Spark Job Authoring
Sparknotes
⭐
5
Spark 2.0学习笔记
Selective Search
⭐
5
Selective search partitions large scale dataset into subsets(shards) such that only few shards needs to be searched for a query, thus improving search efficiency and effectiveness
Spark Data Repair Plugin
⭐
5
Provide functionality to build statistical models to repair dirty tabular data in Spark
Related Searches
Scala Spark (3,279)
Python Spark (2,053)
Java Spark (1,587)
Apache Spark (1,207)
Spark Hadoop (1,188)
Jupyter Notebook Spark (1,151)
Spark Kafka (985)
Spark Streaming (817)
Spark Pyspark (812)
1-39 of 39 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.