Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Data Science Ipython Notebooks | 25,668 | 6 months ago | 34 | other | Python | |||||
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines. | ||||||||||
Dev Setup | 5,802 | 2 years ago | 34 | other | Python | |||||
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults. | ||||||||||
Seldon Server | 1,420 | 4 years ago | 44 | June 28, 2017 | 26 | apache-2.0 | Java | |||
Machine Learning Platform and Recommendation Engine built on Kubernetes | ||||||||||
Aws Glue Samples | 1,334 | 6 months ago | 37 | mit-0 | Python | |||||
AWS Glue code samples | ||||||||||
Devops Python Tools | 709 | 3 months ago | 37 | mit | Python | |||||
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. | ||||||||||
Flintrock | 627 | 4 | 5 months ago | 14 | November 27, 2023 | 36 | apache-2.0 | Python | ||
A command-line tool for launching Apache Spark clusters. | ||||||||||
Aws Glue Libs | 568 | 9 months ago | 96 | other | Python | |||||
AWS Glue Libraries are additions and enhancements to Spark for ETL operations. | ||||||||||
Data Engineering Interview Questions | 554 | 7 months ago | ||||||||
More than 2000+ Data engineer interview questions. | ||||||||||
Spark Redshift | 514 | 4 | 1 | 4 years ago | 10 | November 01, 2016 | 134 | apache-2.0 | Scala | |
Redshift data source for Apache Spark | ||||||||||
Agile_data_code_2 | 435 | a year ago | 7 | mit | Jupyter Notebook | |||||
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition |