Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for pandas pyspark
pandas
x
pyspark
x
27 search results found
Ibis
⭐
3,404
The flexibility of Python with the scale and performance of modern SQL.
Eat_pyspark_in_10_days
⭐
534
pyspark🍒🥭 is delicious,just eat it!😋😋
Pandapy
⭐
483
PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)
Datacompy
⭐
339
Pandas and Spark DataFrame comparison for humans and more!
Sparklingpandas
⭐
338
Sparkling Pandas
Hunter
⭐
170
A threat hunting / data analysis environment based on Python, Pandas, PySpark and Jupyter Notebook.
Handyspark
⭐
129
HandySpark - bringing pandas-like capabilities to Spark dataframes
Pypmml
⭐
64
Python PMML scoring library
Cuallee
⭐
56
A data quality acceleration library to get data sets verified in a friendly interface
Learn Data Munging
⭐
37
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
Spark_app_twitter
⭐
36
A data engineering project (Twitter monitor app)
Farsante
⭐
26
Fake Pandas / PySpark DataFrame creator
Data Science Learning Paths
⭐
22
Practical data science courses - from basic to intermediate
Dataquest
⭐
12
Data Science Massive Open Online Course: All the code, notes and supplementary materials generated during the course of my data scientific learning.
Pyai Github 2024
⭐
11
《 Python人工智能编程实践(2024年度版)》全书数据和开源代码【出版中】
Ml Kaggle Github 2022
⭐
11
《 Python机器学习及实践:从零开始通往Kaggle竞赛之路(2022年度版)》全书数据和开源代码
Pywrangler
⭐
11
Advanced data wrangling for python
Pyspark Dataframe Made Easy
⭐
10
pyspark dataframe made easy
Data Engineering
⭐
9
Common data manipulations in different languages and frameworks.
Improved Spark Viz
⭐
6
🐼 WIP Improved visualizations in Spark
Pyspark_pandas
⭐
6
Pyspark + pandas. This may get merged into the SparklingPandas project.
Machine_learning_cheatsheets
⭐
6
Cheatsheets
⭐
6
This repo contains all the cheatsheets that I found Important.
Data_engineer_end2end
⭐
5
End-to-end data engineer project
Alpine Python3 Numpy Pandas Sparkcontainer Spark Submit
⭐
5
Using python3.6 alpine base image adds java,pandas, numpy,pyspark and spark as rundeps. This image can be used as container image when you run spark-submit on k8.
Tiktok Analytics Aggregator
⭐
5
Collects Titktok profile data on demand. Created a web UI to interact with data collect. Leveraged Pandas, Pyspark, MongoDb, Flask, API
Phom_tseries
⭐
5
Persistent homology for time series data; presentation and examples with Dionysus, Ripser, Pandas and PySpark
Related Searches
Python Pandas (4,272)
Jupyter Notebook Pandas (3,288)
Pandas Numpy (1,382)
Pandas Matplotlib (1,026)
Data Science Pandas (948)
Machine Learning Pandas (799)
Spark Pyspark (773)
Pandas Dataframe (737)
Python Pyspark (689)
1-27 of 27 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.