Tpcds

TPC-DS benchmarks including data generation with Spark and queries with Spark
Alternatives To Tpcds
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Benchm Ml1,839
2 years ago11mitR
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Spark Rapids61913 months ago19October 24, 20231,271apache-2.0Scala
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Streaming Benchmarks560
2 years ago26apache-2.0Jupyter Notebook
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
Spark Sql Perf452
2 years ago2January 25, 201652apache-2.0Scala
Spark Knn205
3 years ago16apache-2.0Scala
k-Nearest Neighbors algorithm on Spark
Spark Terasort116
a year ago2apache-2.0Java
Spark Terasort
Spark Tpc Ds Performance Test104
4 years ago6apache-2.0TSQL
Use the TPC-DS benchmark to test Spark SQL performance
Tpch Spark91
3 months ago1mitC
TPC-H queries in Apache Spark SQL using native DataFrames API
Benchm Databases90
7 years ago3R
A minimal benchmark of various tools (statistical software, databases etc.) for working with tabular data of moderately large sizes (interactive data analysis).
Bestconf53
2 years ago2apache-2.0Java
A tool automatically improving the performance of large-scale systems by finding better configuration settings
Alternatives To Tpcds
Select To Compare


Alternative Project Comparisons
Popular Spark Projects
Popular Benchmark Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Benchmark
Spark
Dlang
Hdfs
Data Generation