Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for jupyter notebook pyspark
jupyter-notebook
x
pyspark
x
157 search results found
Machine Learning
⭐
2,607
🌎 machine learning tutorials (mainly in Python3)
Spark Py Notebooks
⭐
1,515
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Sparkmagic
⭐
1,272
Jupyter magics and kernels for working with remote Spark clusters
Pyspark Tutorial
⭐
959
PySpark-Tutorial provides basic algorithms using PySpark
Kuwala
⭐
610
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demograp
Learningpyspark
⭐
409
Code base for the Learning PySpark book (in preparation)
Spark Syntax
⭐
391
This is a repo documenting the best practices in PySpark.
Miscellaneous
⭐
382
Includes notes on Apache Spark, Spark for Physics, Jupyter notebook examples for Spark, Oracle and other DB systems.
Gather Deployment
⭐
347
Gathers Python deployment, infrastructure and practices.
Spark Standalone Cluster On Docker
⭐
311
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. ⚡
Learning Pyspark
⭐
294
Code repository for Learning PySpark by Packt
Spark Jupyter Aws
⭐
255
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Pyspark Tutorials
⭐
233
Code snippets and tutorials for working with social science data in PySpark
Data_science_blogs
⭐
232
A repository to keep track of all the code that I end up writing for my blog posts.
Sql Data Analysis And Visualization Projects
⭐
200
SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark.
Azure Cosmosdb Spark
⭐
194
Apache Spark Connector for Azure Cosmos DB
Cloud Dataproc
⭐
173
Cloud Dataproc: Samples and Utils
Hunter
⭐
170
A threat hunting / data analysis environment based on Python, Pandas, PySpark and Jupyter Notebook.
Spark Practice
⭐
153
Apache Spark (PySpark) Practice on Real Data
Geopyspark
⭐
151
GeoTrellis for PySpark
Pyspark Pictures
⭐
149
Learn the pyspark API through pictures and simple examples
Nyc Transport
⭐
144
A Unified Database of NYC transport (subway, taxi/Uber, and citibike) data.
Repo 2019
⭐
135
BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
Handyspark
⭐
129
HandySpark - bringing pandas-like capabilities to Spark dataframes
Mastering Big Data Analytics With Pyspark
⭐
118
Mastering Big Data Analytics with PySpark, Published by Packt
Ai Deployment
⭐
116
关注AI模型上线、模型部署
Spark Df Profiling
⭐
115
Create HTML profiling reports from Apache Spark DataFrames
Spark R Notebooks
⭐
109
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Machinelearning
⭐
106
Machine learning for beginner(Data Science enthusiast)
Dataproc Templates
⭐
103
Dataproc templates and pipelines for solving simple in-cloud data tasks
Spark With Python
⭐
98
Fundamentals of Spark with Python (using PySpark), code examples
Medium Articles
⭐
97
Repo for all my code on the articles I post on medium
Big Data Engineering Coursera Yandex
⭐
91
Big Data for Data Engineers Coursera Specialization from Yandex
Bitcoin Value Predictor
⭐
90
[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Kdd Cup 99 Spark
⭐
87
PySpark solution to the KDDCup99
Pyspark Predictive Maintenance
⭐
85
Predictive Maintenance using Pyspark
Pyspark Tutorial
⭐
82
Jupyter notebooks for pyspark tutorials given at the university
Anovos
⭐
78
Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Python Spark Streaming
⭐
73
Jupyterlab Sparkmonitor
⭐
72
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Mmtf Pyspark
⭐
64
Methods for the parallel and distributed analysis and mining of the Protein Data Bank using MMTF and Apache Spark.
Nsl Kdd
⭐
63
PySpark solution to the NSL-KDD dataset: https://www.unb.ca/cic/datasets/nsl.html
W2v
⭐
62
Word2Vec models with Twitter data using Spark. Blog:
Sparkml
⭐
61
Spark ML with pyspark
Spark
⭐
60
Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References
Pysparkgeoanalysis
⭐
60
🌐 Interactive Workshop on GeoAnalysis using PySpark
Big_data
⭐
55
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Pyspark Setup Guide
⭐
54
A guide for setting up Spark + PySpark under Ubuntu linux
Mmtf Workshop 2018
⭐
53
Structural Bioinformatics Training Workshop & Hackathon 2018
Spark Training
⭐
52
Repository used for Spark Trainings
Mlflow Spark Summit 2019
⭐
52
MLFlow Spark Summit 2019 Presentation
Learn Data Munging
⭐
37
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
Machine Learning With Pyspark
⭐
37
Source Code for 'Machine Learning with PySpark' by Pramod Singh
Azure Databricks
⭐
37
Azure Databricks - Advent of 2020 Blogposts
Getting_started_with_pyspark
⭐
34
Materials for class Getting Started with Pyspark
Spark Studyclub
⭐
31
Grupo de Estudios de Apache Spark organizado por la comunidad Data Engineering Latam
Mongo Spark Jupyter
⭐
29
Docker environment that spins up MongoDB replica set, Spark, and Jupyter Lab. Example code uses PySpark and the MongoDB Spark Connector.
Spark Tree Plotting
⭐
29
A simple tool for plotting Spark ML's Decision Trees
Sparkdataset
⭐
28
Instant search for and access to many datasets in Pyspark.
Sparkdltrigger
⭐
28
Repo for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"
Decorators4ds
⭐
27
Useful decorators every Data Scientist should know
Odsc_india_2018
⭐
26
My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Mmtf Genomics
⭐
25
Methods for mapping genomic data onto 3D protein structure.
Courses
⭐
25
Just the stuff from the faculty (homework, projects, lectures)
Spark Fundamentals
⭐
24
Elevate big data skills with Apache Spark's core concepts and examples
Springboard Data Science Immersive
⭐
23
Detecting Malicious Url Machine Learning
⭐
23
Data Science Learning Paths
⭐
22
Practical data science courses - from basic to intermediate
Spark For Data Engineers
⭐
22
Apache Spark for data engineers
Pyspark Setup Demo
⭐
21
Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Spark Tdd Example
⭐
20
A simple Spark TDD example
Lasagna
⭐
20
A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka
Coffee_boat
⭐
19
☕⛵WIP PySpark dependency management
Ds30_5
⭐
18
Data Science in 30 Minutes #5: Spark
Spark And Mllib Projects
⭐
18
This repository contains Spark, MLlib, PySpark and Dataframes projects
Reddit Streaming
⭐
18
streaming eight subreddits from reddit api using kafka producer & spark structured streaming.
Pyspark_dl_pipeline
⭐
17
Nyc Taxi Analysis
⭐
17
Analyzing 200 GB of NYC taxi dataset.
Pyspark For Data Processing
⭐
16
Code for my presentation: Using PySpark to Process Boat Loads of Data
Big Data Course
⭐
14
Practice course on Big Data
Pipeasy Spark
⭐
14
an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
Gee
⭐
14
Pytorch implementation of GEE: A Gradient-based Explainable Variational Autoencoder for Network Anomaly Detection
Python_tutorials
⭐
14
Practical Python 3 tutorials
Hdinsight Pyspark Cntk Integration
⭐
14
Instructions and examples for installing CNTK on an HDInsight cluster and running CNTK-Pyspark applications from Jupyter notebooks.
Gentropy
⭐
13
Open Targets python framework for post-GWAS analysis
Workshops
⭐
13
CSCAR workshop material
Pyspark Ml Examples
⭐
13
Spark ML Tutorial and Examples for Beginners
Docker Datascience Ultimate
⭐
13
Customized Jupyter Spark Docker images with everything you need
Nyc_taxi_trip_duration
⭐
13
Develop ML models predict taxi trip duration in NYC. Ranked : Top 6% | RMSLE : 0.377 (Kaggle) | #DS
Netflix Recommender System
⭐
13
ITCS 6190 : Cloud Computing for Data Analysis project. Movie Recommendation Engine for Netflix Data with custom functions implementation and library usage.
Workshop Spark
⭐
13
Código para workshops Spark com ambiente de desenvolvimento em docker
Dataquest
⭐
12
Data Science Massive Open Online Course: All the code, notes and supplementary materials generated during the course of my data scientific learning.
Dagster Graph Project
⭐
12
Repo demonstrating a Dagster pipeline to generate Neo4j Graph
Cheetsheet
⭐
12
Machine learning cheetsheets
Machinelearningsamples Iris
⭐
12
Iris Sample
Pyspark For Beginners
⭐
12
PySpark for Beginners by Packt Pyblishing
Relk
⭐
12
RELK -- The Research Elastic Stack (Kafka, Beats, Zookeeper, Logstash, ElasticSearch, Kibana, Spark, & Jupyter -- All in Docker)
Pyai Github 2024
⭐
11
《 Python人工智能编程实践(2024年度版)》全书数据和开源代码【出版中】
Mmtf Proteomics
⭐
11
Methods for mapping proteomics data on 3D protein structure.
Pyspark_amld2019
⭐
11
Workshop materials for AMLD2019 on PySpark.
Related Searches
Python Jupyter Notebook (12,976)
Jupyter Notebook Machine Learning (8,463)
Jupyter Notebook Dataset (6,824)
Jupyter Notebook Deep Learning (6,566)
Jupyter Notebook Tensorflow (4,771)
Jupyter Notebook Convolutional Neural Networks (4,218)
Jupyter Notebook Neural (3,926)
Jupyter Notebook Pytorch (3,877)
Jupyter Notebook Classification (3,847)
Jupyter Notebook Data Science (3,734)
1-100 of 157 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.