Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for scala pyspark
pyspark
x
scala
x
59 search results found
Synapseml
⭐
4,960
Simple and Distributed Machine Learning
Spark Nlp
⭐
3,578
State of the Art Natural Language Processing
Mleap
⭐
1,479
MLeap: Deploy ML Pipelines to Production
Awesome Spark
⭐
1,461
A curated list of awesome Apache Spark packages and resources.
Sparkling Water
⭐
957
Sparkling Water provides H2O functionality inside Spark cluster
Scriptis
⭐
767
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Eat_pyspark_in_10_days
⭐
534
pyspark🍒🥭 is delicious,just eat it!😋😋
Spark Standalone Cluster On Docker
⭐
311
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. ⚡
Sagemaker Spark
⭐
285
A Spark library for Amazon SageMaker.
Hnswlib
⭐
233
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Gimel
⭐
230
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Azure Cosmosdb Spark
⭐
194
Apache Spark Connector for Azure Cosmos DB
Drunken Data Quality
⭐
167
Spark package for checking data quality
Spark Extension
⭐
152
A library that provides useful extensions to Apache Spark and PySpark.
Spark Iforest
⭐
147
Isolation Forest on Spark
Aut
⭐
128
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Aliyun Emapreduce Demo
⭐
123
Spark_python_ml_examples
⭐
81
Spark 2.0 Python Machine Learning examples
Azure Databricks Nyc Taxi Workshop
⭐
80
An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset
Learn By Examples
⭐
72
Real-world Spark pipelines examples
Jgit Spark Connector
⭐
67
jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.
Pypmml
⭐
64
Python PMML scoring library
Spark
⭐
60
Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References
Pyspark Setup Guide
⭐
54
A guide for setting up Spark + PySpark under Ubuntu linux
Spark Select
⭐
53
A library for Spark DataFrame using MinIO Select API
Mlflow Spark Summit 2019
⭐
52
MLFlow Spark Summit 2019 Presentation
Spark Training
⭐
52
Repository used for Spark Trainings
Spark Hive Udf
⭐
47
Example project showing how to use Hive UDFs in Apache Spark
Sparkudfexamples
⭐
46
Spark SQL UDF examples
Spark Dgraph Connector
⭐
41
A connector for Apache Spark and PySpark to Dgraph databases.
Azure Databricks
⭐
37
Azure Databricks - Advent of 2020 Blogposts
Aliyun Cupid Sdk
⭐
30
SDK for open source framwork to interact with MaxCompute
Isarn Sketches Spark
⭐
27
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Hands On Big Data Analytics With Pyspark
⭐
27
Hands-On Big Data Analytics with PySpark, Published by Packt
Courses
⭐
25
Just the stuff from the faculty (homework, projects, lectures)
Spark3d
⭐
22
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …
Spark
⭐
20
『빅데이터 분석을 위한 스파크 2 프로그래밍』 예제 코드
Spark Sframe
⭐
19
This project contains the code to translate between Apache Spark and SFrame.
Oshinko S2i
⭐
19
This is a place to put s2i images and utilities for spark application builders for openshift
Pmml4s Spark
⭐
19
PMML scoring library for Spark as SparkML Transformer
Spark Sparql Connector
⭐
17
spark-sparql-connector
Bigdata Workshop Es
⭐
17
Workshop Big Data en Español
Spark Fits
⭐
15
FITS data source for Spark SQL and DataFrames
Listenbrainz Labs
⭐
15
A collection tools/scripts to explore the ListenBrainz data using Apache Spark.
Pyspark
⭐
15
spark (scala and python)
Pypmml Spark
⭐
13
Python PMML scoring library for PySpark as SparkML Transformer
Dsci_553
⭐
12
USC ✌️ 2020 Spring DSCI 553 (Foundations and Applications of Data Mining) 数据挖掘基础与应用 Score: 9️⃣4️⃣
Sentry Spark
⭐
12
Apache Spark Sentry Integration
Nyc_taxi_pipeline
⭐
12
Design/Implement stream/batch architecture on NYC taxi data | #DE
Spark_bazel
⭐
11
Spark Application with Bazel
Sarplus
⭐
10
pronounced sUrplus as it's simply better if not best!
Vim Sparkshell
⭐
9
control spark-shell from vim
Divolte Spark
⭐
7
Utilities for using data created by Divolte collector in Spark, Spark Streaming and PySpark
Spark Tutorials
⭐
6
PySpark notebooks to learn Apache Spark (WIP)
Spark Slowly Changing Dimension
⭐
6
Spark implementation of Slowly Changing Dimension type 2
Zeppelin Clojure Interpreter
⭐
6
Clojure Plugin for zeppelin
Spark Datadog Relay
⭐
6
Implements SparkFirehoseListener for forwarding Spark events to statsd
Spark Traffic
⭐
5
使用Spark批量处理离线交通大数据
Docker Spark Anaconda
⭐
5
Spark and Anaconda in Docker
Related Searches
Scala Sbt (4,178)
Scala Spark (3,279)
Scala Akka (2,120)
Java Scala (1,794)
Scala Play Framework (1,309)
Plugin Scala (1,079)
Scala Kafka (969)
Scala Functional Programming (942)
Scala Scalajs (887)
Spark Pyspark (810)
1-59 of 59 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.