Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for spark pandas
pandas
x
spark
x
37 search results found
Data Science Ipython Notebooks
⭐
25,668
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Ibis
⭐
3,404
The flexibility of Python with the scale and performance of modern SQL.
Koalas
⭐
3,291
Koalas: pandas API on Apache Spark
Fugue
⭐
1,821
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Delta Sharing
⭐
654
An open protocol for secure data sharing
Python Data Science Cheatsheet
⭐
590
Python数据科学速查表
Eat_pyspark_in_10_days
⭐
534
pyspark🍒🥭 is delicious,just eat it!😋😋
Traceml
⭐
490
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Popmon
⭐
461
Monitor the stability of a Pandas or Spark dataframe ⚙︎
Zat
⭐
414
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Datacompy
⭐
339
Pandas and Spark DataFrame comparison for humans and more!
Sparklingpandas
⭐
338
Sparkling Pandas
Data Science
⭐
269
Projects and awesome list for all Data Science fields
Visions
⭐
166
Type System for Data Analysis in Python
Cape Dataframes
⭐
162
Privacy transformations on Spark and Pandas dataframes backed by a simple policy language.
Handyspark
⭐
129
HandySpark - bringing pandas-like capabilities to Spark dataframes
Data Science Tutorials
⭐
124
Python Tutorials for Data Science
Pypmml
⭐
64
Python PMML scoring library
Mylearningnotes
⭐
58
Because its never late to start taking notes and 'public' it...
Prosto
⭐
53
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Realtime Data Analytics Using Spark
⭐
39
Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc
Learn Data Munging
⭐
37
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
Sageworks
⭐
36
SageWorks: An easy to use Python API for creating and deploying SageMaker Models
Dx
⭐
34
Data Explorer for Python
Yaetos
⭐
32
Write data & AI pipelines in (SQL, Spark, Pandas) and deploy to the cloud, simplified
Aws Glue Docker
⭐
22
🐋 Docker image for AWS Glue Spark/Python
Pytd
⭐
17
Treasure Data Driver for Python
Football Manager
⭐
15
Data Analysis as a Football Manager
Dcos Jupyterlab Service
⭐
11
JupyterLab Notebook for Mesosphere DC/OS
Pyspark Dataframe Made Easy
⭐
10
pyspark dataframe made easy
Data_ai_for_all
⭐
10
Data Analysis, Analytics, Science, AI & ML, LLM etc.
Entitymatchingmodel
⭐
9
Entity Matching Model solves the problem of matchi large datasets.
Realtime Recommender
⭐
8
Spark-Kafka Realtime recommender Engine.
Italian Sentiment Analysis With Spark
⭐
7
Application of Sentiment Analysis of Italian tweet with Python and Spark
Improved Spark Viz
⭐
6
🐼 WIP Improved visualizations in Spark
Pyspark_pandas
⭐
6
Pyspark + pandas. This may get merged into the SparklingPandas project.
Alpine Python3 Numpy Pandas Sparkcontainer Spark Submit
⭐
5
Using python3.6 alpine base image adds java,pandas, numpy,pyspark and spark as rundeps. This image can be used as container image when you run spark-submit on k8.
Related Searches
Python Pandas (4,272)
Jupyter Notebook Pandas (3,288)
Scala Spark (3,279)
Python Spark (2,053)
Java Spark (1,587)
Pandas Numpy (1,382)
Apache Spark (1,207)
Spark Hadoop (1,188)
Jupyter Notebook Spark (1,151)
Pandas Matplotlib (1,026)
1-37 of 37 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.