Fugue

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Alternatives To Fugue
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Data Science Ipython Notebooks25,668
6 months ago34otherPython
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Ibis3,40424293 months ago68December 10, 2023157apache-2.0Python
The flexibility of Python with the scale and performance of modern SQL.
Koalas3,2911167 months ago47October 19, 2021112apache-2.0Python
Koalas: pandas API on Apache Spark
Fugue1,821233 months ago125November 09, 202334apache-2.0Python
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Delta Sharing65473 months ago33December 02, 202374apache-2.0Scala
An open protocol for secure data sharing
Python Data Science Cheatsheet590
6 years ago2
Python数据科学速查表
Eat_pyspark_in_10_days534
2 years ago1Python
pyspark🍒🥭 is delicious,just eat it!😋😋
Traceml49045122 months ago10November 25, 20216apache-2.0Python
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Popmon46129 months ago36July 18, 202315mitPython
Monitor the stability of a Pandas or Spark dataframe ⚙︎
Zat41413 months ago11January 26, 202310mitJupyter Notebook
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Alternatives To Fugue
Select To Compare


Alternative Project Comparisons
Popular Spark Projects
Popular Pandas Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Machine Learning
Sql
Spark
Pandas
Distributed Systems
Distributed Computing