Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Ibis | 3,404 | 24 | 29 | 3 months ago | 68 | December 10, 2023 | 157 | apache-2.0 | Python | |
The flexibility of Python with the scale and performance of modern SQL. | ||||||||||
Hopsworks | 1,041 | 3 months ago | 1 | September 11, 2019 | 12 | agpl-3.0 | Java | |||
Hopsworks - Data-Intensive AI platform with a Feature Store | ||||||||||
Devops Python Tools | 709 | 4 months ago | 37 | mit | Python | |||||
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. | ||||||||||
Pysparkling | 253 | 7 | 1 | a year ago | 69 | November 13, 2022 | 9 | other | Python | |
A pure Python implementation of Apache Spark's RDD and DStream interfaces. | ||||||||||
Zeppelin Notebooks | 206 | 5 years ago | 6 | Shell | ||||||
Gallery of Apache Zeppelin notebooks | ||||||||||
Spark With Python | 98 | 4 years ago | mit | Jupyter Notebook | ||||||
Fundamentals of Spark with Python (using PySpark), code examples | ||||||||||
Big Data Engineering Coursera Yandex | 91 | a year ago | 4 | mit | Jupyter Notebook | |||||
Big Data for Data Engineers Coursera Specialization from Yandex | ||||||||||
Cluster Pack | 44 | 1 | 4 | 4 months ago | 39 | November 07, 2023 | 5 | apache-2.0 | Python | |
A library on top of either pex or conda-pack to make your Python code easily available on a cluster | ||||||||||
Apache Spark Docker | 21 | 2 years ago | 2 | apache-2.0 | VBA | |||||
Dockerizing an Apache Spark Standalone Cluster | ||||||||||
Spark Hdfs On Kubernetes | 18 | 6 years ago | apache-2.0 | Shell | ||||||