Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for spark data analysis
data-analysis
x
spark
x
46 search results found
Alluxio
⭐
6,612
Alluxio, data orchestration for analytics and machine learning in the cloud
Spark Py Notebooks
⭐
1,515
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Optimus
⭐
1,438
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Scriptis
⭐
767
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Data Science With Ruby
⭐
664
Practical Data Science with Ruby based tools.
Wedatasphere
⭐
624
WeDataSphere is a financial grade, one-stop big data platform suite.
Onedal
⭐
584
oneAPI Data Analytics Library (oneDAL)
Complete Life Cycle Of A Data Science Project
⭐
499
Complete-Life-Cycle-of-a-Data-Science-Project
Popmon
⭐
461
Monitor the stability of a Pandas or Spark dataframe ⚙︎
Zat
⭐
414
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Data Science
⭐
269
Projects and awesome list for all Data Science fields
Setl
⭐
173
A simple Spark-powered ETL framework that just works 🍺
Visions
⭐
166
Type System for Data Analysis in Python
Big Data Mapreduce Course
⭐
135
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Ml Resource
⭐
110
A concise resource repository for machine learning
Spark R Notebooks
⭐
109
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Udacity Data Engineer Nanodegree
⭐
64
Classwork projects and home works done through Udacity data engineering nano degree
Apachespark
⭐
59
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
Spotify Song Recommendation Ml
⭐
52
UC Berkeley team's submission for RecSys Challenge 2018
Learning Spark Lightning Fast Big Data Analysis
⭐
47
Learning Spark: Lightning-Fast Big Data Analysis reading notes
Big Data Analysis With Scala And Spark
⭐
45
My submissions for the Coursera MOOC "Big Data Analysis with Scala and Spark" given by EPFL.
Architect_big_data_solutions_with_spark
⭐
42
code, labs and lectures for the course
Ides
⭐
32
智能数据探索服务(Intelligent Data Exploration Service),一站式Data + AI数据解决方案!
Sparkdataset
⭐
28
Instant search for and access to many datasets in Pyspark.
Berkeleyx Apache Spark Labs
⭐
17
Rastercube
⭐
15
rastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Python_moztelemetry
⭐
13
Spark bindings for Mozilla Telemetry
Sparkfastdataanalysis
⭐
12
《Spark 快速大数据分析》学习笔记
Taller_sparkr
⭐
12
Taller SparkR para las Jornadas de Usuarios de R
Greenplum Spark Connector
⭐
11
Example of using greenplum-spark connector
Biananes
⭐
11
Scalable fMRI Data Analysis
Easterbunny
⭐
11
EasterBunny数据分析
Chicago Taxi Trips Analysis
⭐
10
Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset
Douyu Danmu Spark
⭐
10
Scrape the host's danmu information in Douyu_TV live-show and do the corresponding statistic analysis by SPARK and some Big Data technologies.
Spark Pipeline
⭐
9
Example End-to-End Data Pipeline with Apache Spark from Data Analysis to Data Product
Spark Tss
⭐
9
Spark Time Series Set data analysis
Rcoboldi
⭐
9
R COBOL DI (Data Integration) Package : Import COBOL CopyBook data files directly into R as properly structured data frames.
Anova_in_pyspark
⭐
8
Custom one-way ANOVA implementation using PySpark
Analysis
⭐
8
Repo for practical data science problems approaches, including notebook demo and working scripts | #DS | #analysis
Big Data Analysis With Python
⭐
8
Combine Spark and Python to process large datasets and unlock the power of parallel computing and machine learning
Traffic Data Analysis With Apache Spark Based On Mobile Robot Data
⭐
7
Mobile robot data were analyzed with Apache-Spark to extract five different statistical result such as travel time, waiting time, average speed, occupancy and density were produced.
Sparklyr.flint
⭐
7
Sparklyr extension making Flint time series library functionalities (https://github.com/twosigma/flint) easily accessible through R
Spark Data Analysis Projects
⭐
6
A collection of data analysis projects done using PySpark via Jupyter notebooks.
Youtubedataanalysis
⭐
5
Large Scale data analysis on Youtube Dataset using Spark and Hadoop
Online Retail Data Analysis
⭐
5
Analysis of the publicly available sales transactions of a webshop selling all-occasion gifts
Thoughtful Data Science
⭐
5
Thoughtful Data Science, published by Packt
Microsoft Big Data Scientist And Ai
⭐
5
Microsoft Big Data, Data Scientist, and AI
Dynamic Streaming
⭐
5
Interactive data analysis with Spark streaming session code repo
Related Searches
Scala Spark (3,279)
Python Data Analysis (2,363)
Python Spark (2,053)
Jupyter Notebook Data Analysis (1,768)
Java Spark (1,587)
Apache Spark (1,207)
Spark Hadoop (1,188)
Jupyter Notebook Spark (1,151)
Spark Kafka (985)
Machine Learning Data Analysis (941)
1-46 of 46 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.