Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for dataframe apache spark
apache-spark
x
dataframe
x
24 search results found
Koalas
⭐
3,291
Koalas: pandas API on Apache Spark
Ballista
⭐
2,244
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Mobius
⭐
937
C# and F# language binding and extensions to Apache Spark
Sparkflow
⭐
301
Easy to use library to bring Tensorflow on Apache Spark
Rumble
⭐
194
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Pyspark Cheatsheet
⭐
140
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Aut
⭐
128
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Pulsar Spark
⭐
103
Spark Connector to read and write with Pulsar
Spark With Python
⭐
98
Fundamentals of Spark with Python (using PySpark), code examples
Net.jgp.labs.spark
⭐
63
Apache Spark examples exclusively in Java
Spark Json Schema
⭐
50
JSON schema parser for Apache Spark
Spark Nkp
⭐
47
Natural Korean Processor for Apache Spark
Spark Hive Udf
⭐
47
Example project showing how to use Hive UDFs in Apache Spark
Spark Dataframe Introduction
⭐
42
This is an introduction of Apache Spark DataFrames.
Spark Flow
⭐
32
Library for organizing batch processing pipelines in Apache Spark
Isarn Sketches Spark
⭐
27
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Chronicler Spark
⭐
25
InfluxDB connector to Apache Spark on top of Chronicler
Vectordisassembler
⭐
12
Greenplum Spark Connector
⭐
11
Example of using greenplum-spark connector
Sparkql
⭐
10
sparkql: Apache Spark SQL DataFrame schema management for sensible humans
Net.jgp.books.spark.ch03
⭐
10
Spark in Action, 2nd edition - chapter 3
Laurelin
⭐
8
Allows reading ROOT TTrees into Apache Spark as DataFrames
Spark Ecs Connector
⭐
7
[Archived] ArchiveECS connector for Apache Spark
Spark Privacy Preserver
⭐
7
Anonymizing Library for Apache Spark
Multipletest4spark
⭐
5
MT4S - Multiple Tests 4 Spark - a simple Junit/Scalatest testing framework for Apache Spark
Spark Kafka Example
⭐
5
Example for Data Reading from and Writing to from Kafka Topic using Apache Spark DataFrame and DataSet
Apachespark Pyspark 2023
⭐
5
PySpark es una biblioteca de procesamiento de datos distribuidos en Python que permite procesar grandes volúmenes de datos en clústeres utilizando el framework Apache Spark, ofreciendo un alto rendimiento y un conjunto de herramientas integradas para el análisis y manejo de datos a gran escala.
Related Searches
Python Dataframe (1,170)
Pandas Dataframe (737)
R Dataframe (581)
Jupyter Notebook Dataframe (580)
Scala Apache Spark (495)
1-24 of 24 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.