Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Airbyte | 12,918 | 11 | 3 months ago | 311 | December 08, 2023 | 5,111 | other | Python | ||
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted. | ||||||||||
Professional Services | 2,635 | 4 months ago | 41 | apache-2.0 | Python | |||||
Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially supported Google product. | ||||||||||
Gcp Variant Transforms | 114 | 2 years ago | 110 | apache-2.0 | Python | |||||
GCP Variant Transforms | ||||||||||
Kubernetes Bigquery Python | 106 | 4 years ago | 5 | apache-2.0 | Python | |||||
Example Kubernetes app that shows how to build a 'pipeline' to stream data into BigQuery. Uses Redis or Google Cloud PubSub | ||||||||||
Airbyte_serverless | 83 | 5 months ago | mit | Python | ||||||
Airbyte made simple (no UI, no database, no cluster) | ||||||||||
Dlp Dataflow Deidentification | 80 | 5 months ago | 15 | apache-2.0 | Java | |||||
Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP | ||||||||||
Data Pipeline | 79 | 10 years ago | 2 | apache-2.0 | Python | |||||
Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform them and then output them (output might be writing to a file or loading them into a data analysis tool). It is designed to be modular and support various sources, transformation technologies and output types. The transformations can be chained together to form complex pipelines. | ||||||||||
Cloudrun Tutorial | 76 | a year ago | 3 | apache-2.0 | C# | |||||
A tutorial showing some of the features of Cloud Run | ||||||||||
Prism | 70 | 3 months ago | 2 | apache-2.0 | Python | |||||
Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python. | ||||||||||
Bigquery | 62 | a year ago | 11 | Jupyter Notebook | ||||||
BigQuery import and processing pipelines |