Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for scala dataframe
dataframe
x
scala
x
91 search results found
Smile
⭐
5,833
Statistical Machine Intelligence & Learning Engine
Ballista
⭐
2,244
Distributed compute platform implemented in Rust, and powered by Apache Arrow.
Graphframes
⭐
944
Spark Redis
⭐
926
A connector for Spark that allows reading and writing to/from Redis cluster
Metorikku
⭐
536
A simplified, lightweight ETL Framework based on Apache Spark
Spark Avro
⭐
535
Avro Data Source for Apache Spark
Shc
⭐
484
The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
Spark Scala Examples
⭐
443
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
Spark Solr
⭐
440
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
Spark Excel
⭐
421
A Spark plugin for reading and writing Excel files
Learningspark
⭐
406
Scala examples for learning to use Spark
Spark Fast Tests
⭐
385
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Neo4j Spark Connector
⭐
300
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
Spark Hbase Connector
⭐
287
Connect Spark to HBase for reading and writing data with ease
Geni
⭐
268
A Clojure dataframe library that runs on Spark
Sql Spark Connector
⭐
242
Apache Spark Connector for SQL Server and Azure SQL
Rasterframes
⭐
226
Geospatial Raster support for Spark DataFrames
Abris
⭐
215
Avro SerDe for Apache Spark structured APIs.
Isolation Forest
⭐
211
A Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Allaboutscala
⭐
168
Source code for www.allaboutscala.com tutorials
Spark Binlog
⭐
153
A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).
Aut
⭐
128
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Pulsar Spark
⭐
103
Spark Connector to read and write with Pulsar
Yurita
⭐
99
Anomaly detection framework @ PayPal
Spark Highcharts
⭐
80
Support Highcharts in Apache Zeppelin
Mleap
⭐
76
MLeap allows for easily putting Spark ML pipelines into production
Doric
⭐
73
Type safety for spark columns
Sparksql Protobuf
⭐
73
Read SparkSQL parquet file as RDD[Protobuf]
Spark Sftp
⭐
69
Spark connector for SFTP
Spark Bigquery
⭐
69
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Dataframe Rules Engine
⭐
62
Extensible Rules Engine for custom Dataframe / Dataset validation
Vectorpipe
⭐
60
Convert Vector data to VectorTiles with GeoTrellis.
Learn Spark
⭐
60
Examples To Help You Learn Apache Spark
Delta Plus
⭐
56
A library based on delta for Spark and MLSQL
Spark Tutorial
⭐
55
This tutorial provides a quick introduction to using Spark
Spark Salesforce
⭐
54
Spark data source for Salesforce
Spark Stringmetric
⭐
50
Spark functions to run popular phonetic and string matching algorithms
Spark Json Schema
⭐
50
JSON schema parser for Apache Spark
Facets Overview Spark
⭐
48
Spark Implementation of Google Facets Overview https://github.com/PAIR-code/facets
Spark Nkp
⭐
47
Natural Korean Processor for Apache Spark
Spark Hive Udf
⭐
47
Example project showing how to use Hive UDFs in Apache Spark
Spark Google Spreadsheets
⭐
46
Google Spreadsheets datasource for SparkSQL and DataFrames
Megasparkdiff
⭐
46
A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations of possible data sources. Multiple execution modes in multiple environments enable the user to generate a diff report as a Java/Scala-friendly DataFrame or as a file for future use. Comes with out of the box SparkFactory and SparkCompare tools.
Struct Type Encoder
⭐
44
Deriving Spark DataFrame schemas from case classes
Flowml
⭐
41
流程化 机器学习框架 基于 scala java语言 ,一站式自动机器学习平台 ,主要包括数据分析 特征工程 ,机器模型,自动部署,超参数优化,模型自动优化,自动扩容分配创建功能,类似第四范式、阿里PAI平台、 autoMl、亚马逊SageMaker
Sparkoptics
⭐
40
Optics for Spark DataFrames
Spark Hadoopoffice Ds
⭐
37
A Spark datasource for the HadoopOffice library
Crossbow
⭐
35
Single node, in-memory DataFrame analytics library.
Spark Tools
⭐
35
Spark In Practice Scala
⭐
33
Getting started with Spark, Spark streaming, Spark SQL and DataFrame.
Spark Flow
⭐
32
Library for organizing batch processing pipelines in Apache Spark
Hivemall Spark
⭐
31
A Hivemall wrapper for Spark
Spark Hats
⭐
29
Nested array transformation helper extensions for Apache Spark
Spark Cloudant
⭐
27
Cloudant integration with Spark as Spark SQL external datasource
Isarn Sketches Spark
⭐
27
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Spark Hive Streaming Sink
⭐
26
A sink to save Spark Structured Streaming DataFrame into Hive table
Chronicler Spark
⭐
25
InfluxDB connector to Apache Spark on top of Chronicler
Movies Analytics In Spark And Scala
⭐
24
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Scala Polars
⭐
21
Polars for Scala & Java projects!
Saddle
⭐
18
SADDLE: Scala Data Library
Spark Hive Streaming Sink
⭐
15
A sink to save Spark Structured Streaming DataFrame into Hive table
Spark Vcf
⭐
15
Spark VCF data source implementation for Dataframes
Spark Meta
⭐
14
Spark data profiling utilities
Spark To Tableau
⭐
14
Spark to Tableau Extractor library
Mison
⭐
14
Implementing MISON by Microsoft in C++ as a test
Sparkphoenix
⭐
14
Spark Example using Phoenix to interact with HBase
Vectordisassembler
⭐
12
Spark Example
⭐
11
Spark1.6和spark2.2的示例,包含kafka,flume,structuredstrea
Spark Constraints
⭐
10
SQL constraints in Spark!
Anatomy_of_spark_dataframe_api
⭐
10
Net.jgp.books.spark.ch03
⭐
10
Spark in Action, 2nd edition - chapter 3
Twittertrends
⭐
10
Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps
Word2phrase
⭐
10
Words -> Phrases; NLP
Scabillmatch
⭐
9
Policy diffusion in the US legislature
Hermes
⭐
8
A E2E test tool for Enceladus. Also general dataframe comparison tool
Geode Spark Connector
⭐
8
Spark Geode Connector supports Spark 2.1.0
Spark2cassandrabulkload
⭐
8
Spark Library for Bulk Loading into Cassandra
Parcsv
⭐
7
📂📰 Scala library for manipulating CSV dataframes.
Spark Adhoc Kafka
⭐
7
This is a datasource implementation for quick query in Kafka with Spark
Dataset Transform
⭐
7
Strongly typed Scala operations for working with Spark Datasets
Spark Ecs Connector
⭐
7
[Archived] ArchiveECS connector for Apache Spark
Spark Workday
⭐
6
Spark data source for Workday
Spark Learning
⭐
6
Example code which can help in getting started with spark
Spark Dataframe Demo
⭐
6
Spark DataFrame Demo For Meetup
Spark Netsuite
⭐
6
Spark NetSuite Connector
Scala Dataframe Libraries
⭐
6
Comparison of scala dataframe libraries for BMEG.
Spark Custom Api
⭐
6
Examples for adding custom API to RDD and DataFrame by using implicit.
Zeppelin Clojure Interpreter
⭐
6
Clojure Plugin for zeppelin
Spark User Feedback
⭐
6
Seq Datasource V2
⭐
6
Sequence Data Source for Apache Spark
Dcontext
⭐
5
Spark Smile
⭐
5
Integrating SMILE and Spark
Jsontodf
⭐
5
json to dataframe convertor
Multipletest4spark
⭐
5
MT4S - Multiple Tests 4 Spark - a simple Junit/Scalatest testing framework for Apache Spark
Spark Osm Datasource
⭐
5
Native Spark OSM PBF data source
Spark Kafka Example
⭐
5
Example for Data Reading from and Writing to from Kafka Topic using Apache Spark DataFrame and DataSet
Related Searches
Scala Sbt (4,178)
Scala Spark (3,279)
Scala Akka (2,120)
Java Scala (1,794)
Scala Play Framework (1,309)
Python Dataframe (1,170)
Plugin Scala (1,079)
Scala Kafka (969)
Scala Functional Programming (942)
Scala Scalajs (887)
1-91 of 91 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.