Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Spydra | 132 | 2 years ago | 20 | December 08, 2020 | 12 | apache-2.0 | Java | |||
Ephemeral Hadoop clusters using Google Compute Platform | ||||||||||
Bdutil | 114 | 4 years ago | 32 | apache-2.0 | Shell | |||||
[DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine | ||||||||||
Solutions Google Compute Engine Cluster For Hadoop | 81 | 6 years ago | 8 | apache-2.0 | Python | |||||
This sample app will get up and running quickly with a Hadoop cluster on Google Compute Engine. For more information on running Hadoop on GCE, read the papers at https://cloud.google.com/resources/. | ||||||||||
Data Pipeline | 79 | 10 years ago | 2 | apache-2.0 | Python | |||||
Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform them and then output them (output might be writing to a file or loading them into a data analysis tool). It is designed to be modular and support various sources, transformation technologies and output types. The transformations can be chained together to form complex pipelines. | ||||||||||
Compute Hadoop Java Python | 28 | 9 years ago | 1 | apache-2.0 | Python | |||||
This software demonstrates one way to create and manage a cluster of Hadoop nodes running on Google Compute Engine. | ||||||||||
Hive Bigquery Storage Handler | 19 | 9 months ago | 8 | apache-2.0 | Java | |||||
Hive Storage Handler for interoperability between BigQuery and Apache Hive | ||||||||||
Solutions Apache Hive And Pig On Google Compute Engine | 19 | 6 years ago | apache-2.0 | Shell | ||||||
This sample app will get up and running quickly with Hive and/or Pig on a Hadoop cluster on Google Compute Engine. For more information on running Hadoop on GCE, read the papers at https://cloud.google.com/resources/. | ||||||||||
Nodejs Dataproc | 15 | 1 | 9 months ago | 39 | May 18, 2022 | apache-2.0 | ||||
This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node. | ||||||||||
Java Dataproc | 13 | 13 | 8 | 9 months ago | 99 | September 16, 2021 | 3 | apache-2.0 | ||
This library has moved to https://github.com/googleapis/google-cloud-java/tree/main/java-dataproc. | ||||||||||
Big Data Architecture | 10 | 9 years ago | ||||||||
国外互联网公司大数据技术架构研究 |