Awesome Open Source
Search results for python data catalog
30 search results found
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Intake is a lightweight package for finding, investigating, loading and disseminating data.
🐳 The stupidly simple CLI workspace for your data warehouse.
Recap tracks and transform schemas across your whole application.
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Commons code used by the Data Catalog connectors, and links for the connectors sample code.
Datacatalog Connectors Rdbms
Sample code with integration between Data Catalog and RDBMS data sources.
Datacatalog Tag Engine
Tag Engine lets you automate the process of creating and populating metadata tags with Google Cloud's Data Catalog. Tag Engine is licensed under the Apache 2 license terms. Please make sure to read, understand and agree to the terms of the LICENSE and CONTRIBUTING files before proceeding.
Data catalog for everything in your company
End-to-end DataOps platform deployed by Terraform.
Open-source metadata collector based on ODD Specification
An end-to-end data lineage tool, detects table dependencies from SQL statements.
Datacatalog Connectors Bi
Sample code with integration between Data Catalog and BI data sources.
Datacatalog Connectors Hive
Sample code with integration between Data Catalog and Hive data source.
Datacatalog Tag Manager
Python package to manage Google Cloud Data Catalog tags, loading metadata from external sources -- currently supports the CSV file format
Build a data catalog by running a single call with reading privileges
articat: data artifact catalog
A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable front end that's just HTML.
Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard
A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help leverage Data Catalog features.
An Intake catalog for distributing open energy system data liberated by Catalyst Cooperative.
Gcp Datacatalog Python
Python samples to help Data Citizens who work with Google Cloud Data Catalog
Polar Eo Database
Polar Earth Observation Database of satellite sensors
Intake Nested Yaml Catalog
Supports a single YAML file hierarchical catalog to organize datasets and avoid a data swamp.
An analytics engineering sandbox focusing on real estates prices in Cook County, IL
Datacatalog Fileset Enricher
A Python package to enrich Google Cloud Data Catalog Fileset Entries with tags.
Datacatalog Custom Model Manager
Python package to load user-specified metadata models into Google Cloud Data Catalog, comprising Custom Entries, Tag Templates, and Tags
Datacatalog Tag Template Processor
A package to manage Google Cloud Data Catalog Tag Template scripts.
Python Dataset (14,792)
Python Docker (14,113)
Python Amazon Web Services (8,175)
Python Django (8,165)
Python Google (6,420)
Python Search (5,943)
Python Json (5,654)
Python Database (5,586)
Python Csv (5,078)
Python Cloud Computing (4,744)
1-30 of 30 search results
Follow Us On Twitter
Copyright 2018-2023 Awesome Open Source. All rights reserved.