Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Ignite | 4,626 | 15 | 3 | 2 months ago | 36 | May 04, 2023 | 729 | apache-2.0 | Java | |
Apache Ignite | ||||||||||
Mrjob | 2,584 | 112 | 2 | a year ago | 62 | December 15, 2021 | 211 | other | Python | |
Run MapReduce jobs on Hadoop or Amazon Web Services | ||||||||||
Nagios Plugins | 1,118 | 25 days ago | 71 | other | Python | |||||
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc... | ||||||||||
Cloudbreak | 348 | 2 months ago | 41 | apache-2.0 | Java | |||||
CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features. | ||||||||||
Elasticluster | 334 | 3 | 8 months ago | 12 | October 22, 2014 | 182 | gpl-3.0 | Python | ||
Create clusters of VMs on the cloud and configure them with Ansible. | ||||||||||
Maas | 299 | 2 months ago | other | Python | ||||||
Official MAAS repository mirror (may be out of date). Development happens in Launchpad (https://git.launchpad.net/maas/). | ||||||||||
Hadoop Connectors | 278 | 22 | 58 | 2 months ago | 597 | November 03, 2023 | 63 | apache-2.0 | Java | |
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform. | ||||||||||
Weathertop | 226 | 3 years ago | 5 | mit | JavaScript | |||||
J2EE学习以及Linux组件学习的日常总结,适合想了解和温习基础知识的童鞋。目前计划包含的内容有设计模式、Springboot、SpringCloud;以及Linux开源组件Redis、Kafka、Nginx、ElasticSearch、Hadoop、Zookeeper等 | ||||||||||
Presto | 93 | 6 years ago | n,ull | apache-2.0 | Java | |||||
Teradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data | ||||||||||
Data Pipeline | 79 | 10 years ago | 2 | apache-2.0 | Python | |||||
Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform them and then output them (output might be writing to a file or loading them into a data analysis tool). It is designed to be modular and support various sources, transformation technologies and output types. The transformations can be chained together to form complex pipelines. |