Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python aws glue
aws-glue
x
python
x
22 search results found
Aws Sdk Pandas
⭐
3,813
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Piicatcher
⭐
215
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
Awesome Chalice
⭐
208
Discover the power of AWS Chalice, the ultimate framework for crafting seamless Python serverless applications. With Chalice, you can effortlessly build and manage HTTPS APIs, create web apps using popular front-end toolkits, and serve as the backend for cross-platform desktop and mobile apps developed with Qt for Python.
Dataall
⭐
196
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
Athena Glue Service Logs
⭐
111
Glue scripts for converting AWS Service Logs for use in Athena
Streamlit Application Deployment On Aws
⭐
45
Streamlit EDA Dashboard Powered by AWS Cloud
Amazon Athena Cross Account Catalog
⭐
28
🌉 Reference implementation for granting cross-account AWS Glue Data Catalog access from Amazon Athena
Aws Glue Docker
⭐
22
🐋 Docker image for AWS Glue Spark/Python
Aws Glue Schema Registry Python
⭐
19
Use the AWS Glue Schema Registry in Python projects.
Covid 19 Data Engineering Pipeline
⭐
19
A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation and CDK, deployable via Github Actions.
Lakecli
⭐
18
A CLI to manage and monitor permissions in AWS Lake Formation
Serverless_data_pipeline_example
⭐
17
Build and Deploy A Serverless Data Pipeline on AWS
Analyzing Reddit Sentiment With Aws
⭐
16
Learn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level tutorial.
Rheoceros
⭐
15
Cloud-based AI / ML workflow and data application development framework
Data Engineering Onboarding Starter
⭐
8
This repository contains a 10 step program to enter the world of Data Engineering
Transactional Datalake Using Apache Iceberg On Aws Glue
⭐
7
Stream CDC into an Amazon S3 data lake in Apache Iceberg format with AWS Glue Streaming and DMS
Aws Glue Monorepo Style
⭐
7
Example of AWS Glue Jobs and workflow deployment with terraform in monorepo style. Code here supports the miniseries of articles about AWS Glue and python.
Glue Devcontainer
⭐
7
Glue VSCode devcontainer setup
Aws Compliancemachinedontstop
⭐
6
Proof of Value Terraform Scripts to utilize Amazon Web Services (AWS) Security, Identity & Compliance Services to Support your AWS Account Security Posture.
Aws Glue Streaming Etl With Apache Iceberg
⭐
5
Streaming ETL job cases in AWS Glue to integrate Iceberg and creating an in-place updatable data lake on Amazon S3
Aws Glue Crawler Utilities
⭐
5
This repository has a collection of utilities for Glue Crawlers. These utilities come in the form of AWS CloudFormation templates or AWS CDK applications.
Data_engineer_end2end
⭐
5
End-to-end data engineer project
Related Searches
Python Django (28,897)
Python Docker (14,113)
Python Machine Learning (14,099)
Python Jupyter Notebook (12,976)
Python Html (10,924)
Python Amazon Web Services (7,946)
Python Pandas (6,193)
Python Shell (5,055)
Python Data Science (4,679)
Python Cloud Computing (4,600)
1-22 of 22 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.