Haas

The easiest way to launch a Hadoop cluster in the cloud
Alternatives To Haas
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Data Science Ipython Notebooks25,668
7 months ago34otherPython
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Mrjob2,58411222 years ago62December 15, 2021211otherPython
Run MapReduce jobs on Hadoop or Amazon Web Services
Devops Bash Tools2,224
3 months ago5mitShell
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
Nagios Plugins1,119
2 months ago71otherPython
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
Devops Python Tools709
4 months ago37mitPython
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Flintrock627
45 months ago14November 27, 202336apache-2.0Python
A command-line tool for launching Apache Spark clusters.
Aws Glue Libs568
9 months ago96otherPython
AWS Glue Libraries are additions and enhancements to Spark for ETL operations.
Data Engineering Interview Questions554
7 months ago
More than 2000+ Data engineer interview questions.
Spark Redshift514414 years ago10November 01, 2016134apache-2.0Scala
Redshift data source for Apache Spark
Cloudbreak348
3 months ago41apache-2.0Java
CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.
Alternatives To Haas
Select To Compare


Alternative Project Comparisons
Popular Hadoop Projects
Popular Amazon Web Services Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Ruby
Amazon Web Services
Cloud Computing
Hadoop