Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for amazon web services glue
amazon-web-services
x
glue
x
2 search results found
Aws Glue Samples
⭐
1,334
AWS Glue code samples
Aws Glue Libs
⭐
568
AWS Glue Libraries are additions and enhancements to Spark for ETL operations.
Piicatcher
⭐
215
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
Aws Tutorial Code
⭐
196
AWS tutorial code.
Aws Etl Orchestrator
⭐
185
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Aws Glue Data Catalog Client For Apache Hive Metastore
⭐
184
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog
Data Engineering For Aws Immersion Day
⭐
153
Lab Instructions for Data Engineering Immersion Day
Aws Glue Schema Registry
⭐
109
AWS Glue Schema Registry Client library provides serializers / de-serializers for applications to integrate with AWS Glue Schema Registry Service. The library currently supports Avro, JSON and Protobuf data formats. See https://docs.aws.amazon.com/glue/latest/dg/schema- to get started.
Datajob
⭐
99
Build and deploy a serverless data pipeline on AWS with no effort.
Sagemaker Explaining Credit Decisions
⭐
87
Amazon SageMaker Solution for explaining credit decisions.
Amazon Deequ Glue
⭐
74
Automated data quality suggestions and analysis with Deequ on AWS Glue
Aws Glue Data Catalog Replication Utility
⭐
70
Replication utility for AWS Glue Data Catalog
Aws Utility Meter Data Analytics Platform Cn
⭐
66
Aws Dbs Refarch Datalake
⭐
47
Reference Architectures for Datalakes on AWS
Fhir Works On Aws Interface
⭐
38
The interface for the FHIR Works on AWS framework. This package is the glue that allows communication to flow between components
Workshop
⭐
30
BigData-JAWS 勉強会/イベント
Gromit
⭐
28
The glue that bonds AWS, terraform and Github Actions.
Aws Glue Catalog Sync Agent For Hive
⭐
28
Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog
Dbcat
⭐
27
Data Catalog for Databases and Data Warehouses
Serverless Glue
⭐
24
This is a plugin for Serverless framework that provide the possibility to deploy AWS Glue Jobs and Triggers
Aws Glue Databrew Jupyter Extension
⭐
23
Aws Glue Docker
⭐
22
🐋 Docker image for AWS Glue Spark/Python
Terraform Aws Kinesis Firehose
⭐
21
This code creates a Kinesis Firehose in AWS to send CloudWatch log data to S3.
Terraglue
⭐
21
Providing an easy way to deploy a Glue job in any AWS account using Terraform
Aws_glue_etl_docker
⭐
20
Helper library to run AWS Glue ETL scripts docker container for local testing of development in a Jupyter notebook
Amazon S3 Step Functions Ingestion Orchestration
⭐
19
Design pattern for orchestrating an incremental data ingestion pipeline using AWS Step Functions from an on premise location into an Amazon S3 datalake bucket
Fdiworkshop
⭐
18
Damons Data Lake
⭐
18
All the code related to building my own data lake
Serverless_data_pipeline_example
⭐
17
Build and Deploy A Serverless Data Pipeline on AWS
Amazon Redshift Commands Using Aws Glue
⭐
17
Use a AWS Glue Python Shell Job to connect to your Amazon Redshift cluster and execute a SQL script stored in Amazon S3.
Analyzing Reddit Sentiment With Aws
⭐
16
Learn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level tutorial.
Aws Glue Table Versions Cleanup Utility
⭐
15
Clickstream Producer For Apache Kafka
⭐
14
Glue Sneaql Demo
⭐
13
Amazon Personalize Data Conversion Pipeline
⭐
13
Sample data conversion pipeline for importing data into Amazon Personalize.
Stim
⭐
13
Speeding up development with glue that brings tools together
Glutil
⭐
13
Utilities for managing AWS Glue/Athena tables and partitions stored in S3
Prestorials
⭐
13
Tutorials and examples of how to deploy Presto and connect it to different data sources
Athena Glue Quicksight Demo
⭐
12
Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'
Aws Glue Test Data Generator
⭐
12
AWS Glue Configurable Test Data Generator for S3 Data Lakes and DynamoDB
Terraform Aws Glue
⭐
12
Terraform modules for provisioning and managing AWS Glue resources
Amazon Forecast Automation
⭐
11
Athena Cloudtrail Partitioner
⭐
11
Automate the daily partitioning of your CloudTrail bucket in Athena
Building A Data Lake With Aws Glue And Amazon S3
⭐
10
Visualize Cur Using Glue Es
⭐
10
This solution provides a serverless way to visualized AWS Cost and Usage report using AWS Glue and Elasticsearch Services.
S3 Selectable
⭐
10
S3 Select over Glue Table data on S3
Sc Gaws
⭐
9
Glue code to wrap around AWS and do useful things in Go
Quicksightathena01
⭐
9
Amazon QuickSight and Amazon Athena workshop. Workshop will focus on ingesting data into Athena, combining it with other data sources, and visualizaing it in QuickSight.
Aws Swf Fluent Php
⭐
9
Glue code around aws-sdk-php to allow fluent workflows definition
Terraform Aws Glue Dev Endpoint
⭐
9
Terraform code to create, update or delete AWS Glue dev endpoint(s)
End To End Ml Application
⭐
8
Build your own Machine Learning application with Amazon SageMaker, AWS Glue and Amazon API Gateway
Terraform Aws Cur
⭐
8
Terraform module for creating Cost and Usage Reports complete with Glue and Athena to make CUR data available to e.g. QuickSight.
Data Engineering Onboarding Starter
⭐
8
This repository contains a 10 step program to enter the world of Data Engineering
Sparksnake
⭐
8
Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR
Tech Radar
⭐
7
RIO Technology Radar
Aws Etl
⭐
7
This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/data/blob/main/A it's a zipped file with some .csvs inside that we will apply transformations.
Gluezeppelin
⭐
7
A docker container that encapsulates the setup required to run a local Zeppelin server against an AWS Glue Dev Endpoint.
Glue Enrich Cost And Usage
⭐
7
Glue Python Shell Job that adds AWS Organizations account tags to Cost and Usage Reports. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.
Aws Glue Monorepo Style
⭐
7
Example of AWS Glue Jobs and workflow deployment with terraform in monorepo style. Code here supports the miniseries of articles about AWS Glue and python.
Glue Devcontainer
⭐
7
Glue VSCode devcontainer setup
Aws Insurancelake Etl
⭐
7
This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, AWS Glue for data transformation, and AWS CDK Pipelines. It is originally based on the AWS blog Deploy data lake ETL jobs using CDK Pipelines, and complements the InsuranceLake Infrastructure project
Reinvent2018_aim416
⭐
6
AIM416 workshop material for AWS re:Invent 2018
Spark Glue Data Catalog
⭐
6
Apache Spark build compatible with AWS Glue Data Catalog.
Aws_glueetl_workshop
⭐
6
AWS_GlueETL_workshop
Pandasglue
⭐
5
Productivity for your Data Lake
Data Analytics For Businfo
⭐
5
It shows an effective way to correct bus arrival information using data analytics based on Amazon Serverless such as Kiness Data Stream, Kinesis Data Firehose, S3, and Lambda.
Docker_spark_history_ui
⭐
5
A dockerised version of the spark history server which enables us to access metrics in the spark ui from a log generated by AWS glue
Aws Glue Docker
⭐
5
Dockerfile to run AWS Glue Python scripts locally
Ml End To End Workshop
⭐
5
End to End machine learning process
Aws Glue Sbt Quickstart
⭐
5
Example of how to set SBT up for local development of AWS Glue Scripts
Spark Eks
⭐
5
Examples and custom spark images for working with the spark-on-k8s operator on AWS
Zeppelin Glue
⭐
5
docker-compose project for easier local AWS Glue development
Related Searches
Python Amazon Web Services (7,964)
Amazon Web Services Lambda Functions (7,449)
Amazon Web Services Terraform (4,243)
Amazon Web Services Serverless (4,018)
Amazon Web Services Hcl (3,473)
Shell Amazon Web Services (2,951)
Golang Amazon Web Services (2,930)
Docker Amazon Web Services (2,864)
Amazon Web Services Aws Lambda (2,621)
Amazon Web Services Cloudformation (2,431)
1-2 of 2 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.