Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for data warehouse
data-warehouse
x
152 search results found
Awesome Bigdata
⭐
12,800
A curated list of awesome big data frameworks, ressources and other awesomeness.
Gpdb
⭐
6,099
Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
Materialize
⭐
5,547
The data warehouse for operational workloads.
Rudder Server
⭐
3,841
Privacy and Security focused Segment-alternative, in Golang and React
Dinky
⭐
2,657
Dinky is a data development platform based on Apache Flink, enabling agile data development and deployment.
Hydra
⭐
2,427
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Dxy Covid 19 Data
⭐
2,179
2019新型冠状病毒疫情时间序列数据仓库 | COVID-19/2019-nCoV Infection Time Series Data Warehouse
Elementary
⭐
1,721
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Cubes
⭐
1,453
Light-weight Python OLAP framework for multi-dimensional data analysis
Udacity Data Engineering Projects
⭐
1,335
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Tensorbase
⭐
1,217
TensorBase is a new big data warehousing with modern efforts.
Dlt
⭐
1,069
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Hue
⭐
1,031
Open source SQL Query Assistant service for Databases/Warehouses
Bigquery Utils
⭐
994
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Brickhouse
⭐
961
Hive UDF's for the data warehouse
Scratchdb
⭐
956
Scratch is an open-source alternative to BigQuery, Redshift, and Snowflake. Runs on Clickhouse.
Optimus
⭐
707
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Vulcan Sql
⭐
570
Open-source Analytical Data API Framework for data apps. It turns SQL queries into RESTful APIs in no time!
Modern Data Warehouse Dataops
⭐
521
DataOps for the Modern Data Warehouse on Microsoft Azure. https://aka.ms/mdw-dataops.
Bigfunctions
⭐
490
Supercharge BigQuery with BigFunctions
Automate Dv
⭐
456
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
Indexr
⭐
422
An open-source columnar data format designed for fast & realtime analytic with big data.
Domainmod
⭐
419
DomainMOD is an open source application written in PHP & MySQL used to manage your domains and other internet assets in a central location. DomainMOD also includes a Data Warehouse framework that allows you to import your web server data so that you can view, export, and report on your live data.
Versatile Data Kit
⭐
389
One framework to develop, deploy and operate data workflows with Python and SQL.
Data Engineering Projects
⭐
322
Personal Data Engineering Projects
Yuniql
⭐
292
Free and open source schema versioning and database migration made natively with .NET/6. NEW THIS MAY 2022! v1.3.15 released!
Puppetdb
⭐
292
Centralized Puppet Storage
Datawarehouse
⭐
254
从数据仓库到用户画像,从数据建设到数据应用
Intermine
⭐
228
A powerful open source data warehouse system
Geomancer
⭐
194
Automated feature engineering for geospatial data
Mobydq
⭐
175
🐳 Tool to automate data quality checks on data pipelines
Dataplane
⭐
171
Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.
Analytics Readings
⭐
155
Readings for Analytics Engineers
Transformalize
⭐
153
Configurable Extract, Transform, and Load
Titan
⭐
152
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake data warehouse.
Cueobserve
⭐
144
Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases
Amazon Serverless Datalake Workshop
⭐
123
A workshop demonstrating the capabilities of S3, Athena, Glue, Kinesis, and Quicksight.
Analytics
⭐
109
Analytics - Open source data warehouse and reporting for Nextcloud
Cloudberrydb
⭐
104
Cloudberry Database - Next generation unified database for Analytics and AI
Simple_dbt_project
⭐
96
Code for dbt tutorial
Bulker
⭐
92
Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)
Airbyte_serverless
⭐
83
Airbyte made simple (no UI, no database, no cluster)
Ddbt
⭐
72
Dom's Data Build Tool
Appdynamics.dexter
⭐
68
Turn your APM data store into a Data Warehouse with advanced reporting, including entities, configuration, metrics, flowmaps, events, snapshots and call graph flame graphs
Beneath
⭐
64
Beneath is a serverless real-time data platform ⚡️
Metamapper
⭐
60
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Mara Schema
⭐
57
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Etl_with_python
⭐
57
ETL with Python - Taught at DWH course 2017 (TAU)
Mitzu
⭐
50
Mitzu is an open-source product analytics tool that queries directly the data warehouse
Dataligo
⭐
47
A library to accelerate ML and ETL pipeline by connecting all data sources
Datasphere Content
⭐
45
Use sample content to explorer SAP Datasphere. The downloads contain sample data as CSV files, but could also include model / metadata information. See the README files for details.
Couchwarehouse
⭐
45
Data warehouse for CouchDB
Pgwarehouse
⭐
45
Easily sync your Postgres database to a Snowflake, ClickHouse, or DuckDB warehouse.
Virtual Data Warehouse
⭐
44
The Virtual Data Warehouse is a code generation and template management tool. It is part of the data solution automation ecosystem - the 'engine' for data solution automation.
Accio
⭐
43
Accio - Query Your Data Warehouse Like Exploring One Big View.
Data Solution Framework
⭐
42
A library for data warehouse and data integration pattern and architecture documentation.
Alphasql
⭐
39
AlphaSQL provides Integrated Type and Schema Check and Parallelization for SQL file set mainly for BigQuery
Mcw Template Cloud Workshop
⭐
37
Official Microsoft Cloud Workshop Template
Sharpetl
⭐
36
Write ETL using your favorite SQL dialects
Space
⭐
36
Unified storage framework for the entire machine learning lifecycle
Ixmp
⭐
34
The ix modeling platform for integrated and cross-cutting scenario analysis
Cloud Data Lake
⭐
34
Data lake, data warehouse on GCP
Insights.js
⭐
33
Real user monitoring
Data Warehouse Automation Metadata Schema
⭐
33
Generic interface exchange format for Data Warehouse Automation and ETL generation.
Dpm
⭐
29
Data Package Manager: Generate code libraries tailored to specific datasets
Real Time Data Warehouse
⭐
29
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
Hive Cube
⭐
27
Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org
Dbt Firebolt
⭐
26
The dbt adapter for Firebolt
History
⭐
26
Download and warehouse historical trading data
Pyomop
⭐
25
Python package for managing OHDSI clinical data models.
Elasticflow
⭐
25
ElasticFlow(伊塔)是一个开源弹性流数据交换系统,支持在任意类型数据端之间通过简单配置就可
Arthur Redshift Etl
⭐
25
ELT Code for your Data Warehouse
Tweetsolaping
⭐
24
implementing an end-to-end tweets ETL/Analysis pipeline.
Bimlflex Community
⭐
24
Community-focused content to supplement working with BimlFlex.
Redshift Ruby Tutorial
⭐
23
Using AWS Redshift and Ruby to setup your data warehouse
Cdc_audit
⭐
23
change data capture via audit tables and triggers for mysql.
Varify
⭐
23
Clinical DNA Sequencing Analysis and Data Warehouse
Olap Cube
⭐
22
is an hypercube of data
Guzhenping Blog
⭐
20
写点一路的风景,都很普通,主要还是留给自己。请访问:http://guzhenping.com
Datacatalog Connectors Hive
⭐
19
Sample code with integration between Data Catalog and Hive data source.
Listof
⭐
16
📜 Simple and flexible application to manage configuration data aka lists of values.
Rules Based Modeling Engine
⭐
16
一款基于规则的可视化模型构建引擎。支持指标定义,规则定义,多数据源接入,RESTful API 查询
Ceds Data Warehouse
⭐
16
Modeled for longitudinal storage and reporting of P-20W data, the Common Education Data Standards (CEDS) Data Warehouse implements star schema data warehouse normalization techniques for improved query performance.
App Fastdata
⭐
16
VoltDB Click Stream Processing Example.
Intelli Swift Core
⭐
16
Distributed, Column-oriented storage, Realtime analysis, High performance Database
Mondrian Server
⭐
16
Mondrian 8 OLAP server with XMLA endpoint and Saiku in a self-contained .war file, configured though a single .properties file
Analytics Cloud Datasphere Community Content
⭐
15
Download content packages for SAP Analytics Cloud and SAP Datasphere. Find technical samples, best practices or business scenarios. Packages contain data models, visualisations and sample data (if applicable).
Online_store
⭐
15
End to end data engineering project
Google Sheets Etl
⭐
15
Live import all your Google Sheets to your data warehouse
Ds4fnp
⭐
15
Data Stacks For Fun & Nonprofits!
Data Engineering Mta Turnstile
⭐
14
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
Transmart Core
⭐
13
Core components and documentation of the tranSMART platform. https://i2b2transmart.org/
Content Data Api
⭐
13
Data warehouse that stores content and content metrics to help content owners measure and improve content on GOV.UK
Glue Sneaql Demo
⭐
13
Amazon Redshift Modernize Dw
⭐
13
Can you set up a data warehouse and create a dashboard in under 60 minutes? In this workshop, we show you how with Amazon Redshift, a fully managed cloud data warehouse that provides first-rate performance at the lowest cost for queries across your data warehouse and data lake. Learn the steps and best practices for deploying your data warehouse in your organization. Also, learn how to query petabytes of data in your data warehouse and exabytes of data, without loading or moving, in your Amazon
Cortana Intelligence Customer360
⭐
13
This repository contains instructions and code to deploy a customer 360 profile solution on Azure stack using the Cortana Intelligence Suite.
Backend_learning_notes
⭐
13
后端学习笔记,本项目存放了一些我阅读有关的技术类的书籍和部分源码阅读的笔记整理。 涉及范围包括后端开发中的计算机学科基础知识、高级语言的基础知识、源码阅读笔记、数据库知识、数据挖掘知 :-D
Objects Go
⭐
13
Segment Go library for Objects API
Cobra Policytool
⭐
13
Manage Apache Atlas and Ranger configuration for your Hadoop environment.
Data Brewery
⭐
12
Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage data warehouse workflow.
1-100 of 152 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.