Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for sql data engineering
data-engineering
x
sql
x
57 search results found
Mage Ai
⭐
6,324
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Risingwave
⭐
5,799
The distributed streaming database. Engineered to offer the simplest and most cost-efficient way for stream processing and management.
Data Engineer Handbook
⭐
5,650
This is a repo with links to everything you'd ever want to learn about data engineering
Cloudquery
⭐
5,380
The open source high performance data integration platform built for developers.
Sql Translator
⭐
3,842
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
Evidence
⭐
2,776
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown..
Data Diff
⭐
2,707
Compare tables within or across databases
Quadratic
⭐
2,485
Quadratic | Data Science Spreadsheet with Python & SQL
Data Science Roadmap
⭐
2,445
Data Science Roadmap from A to Z
Qsv
⭐
2,079
CSVs sliced, diced & analyzed.
Awesome Opensource Data Engineering
⭐
1,331
An Awesome List of Open-Source Data Engineering Projects
Data Engineering Wiki
⭐
934
The best place to learn data engineering. Built and maintained by the data engineering community.
Sqlmesh
⭐
931
SQLMesh is a data transformation framework that brings the benefits of DevOps to data teams. It enables data scientists, analysts, and engineers to efficiently run and deploy data transformations written in SQL or Python.
Blaze
⭐
784
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Dataform
⭐
757
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Data Engineering Interview Questions
⭐
554
More than 2000+ Data engineer interview questions.
Automate Dv
⭐
456
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
Versatile Data Kit
⭐
389
One framework to develop, deploy and operate data workflows with Python and SQL.
Yuniql
⭐
292
Free and open source schema versioning and database migration made natively with .NET/6. NEW THIS MAY 2022! v1.3.15 released!
Cuelake
⭐
266
Use SQL to build ELT pipelines on a data lakehouse.
Jupysql
⭐
261
Better SQL in Jupyter. 📊
Snowpark Python
⭐
215
Snowflake Snowpark Python API
Dbt Trino
⭐
172
The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)
Dbt Sqlserver
⭐
170
dbt adapter for SQL Server and Azure SQL
Sayn
⭐
117
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Datachecks
⭐
117
Open Source Data Quality Monitoring.
Movalytics Data Warehouse
⭐
116
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
Patterns Devkit
⭐
101
Data pipelines from re-usable components
Flowman
⭐
85
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
Dataengineeringpilipinas
⭐
80
Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in the Philippines. Data Engineering Pilipinas is a PyData group.
Beneath
⭐
64
Beneath is a serverless real-time data platform ⚡️
Apachespark
⭐
59
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
Awesome Data Science Resources
⭐
51
Resources about data science, machine learning, deep learning, data engineering, and SQL.
Mz Hack Day 2022
⭐
51
Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!
Work At Olist Data
⭐
41
Apply for a job at Olist's Data Team: https://olist.gupy.io/
Ibmdataengineeringcoursera
⭐
40
IBM Data Engineering Courses from Coursera
Prefect Dataplatform
⭐
37
Example repository showing how to build a data platform with Prefect, dbt and Snowflake
Databricks Certified Data Engineer Associate Questions
⭐
37
This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.
Debussy_concert
⭐
29
Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and pipelines.
Stairlight
⭐
25
A data lineage tool detects table dependencies from rendered SQL statements.
Jobanalytics_and_search
⭐
22
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Airflow Docker
⭐
19
This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and workflows.
Airflowetl
⭐
16
Blog post on ETL pipelines with Airflow
Big Data Engineering
⭐
15
Data Engineering Mta Turnstile
⭐
14
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
Sheetwork
⭐
14
A handy package to load Google Sheets to your database right from the CLI and with easy configuration via YAML files.
Data Engineering
⭐
12
A project portfolio to accompany my resume
Data Paths
⭐
11
Pydbtools
⭐
10
Python version of dbtools
Greatex
⭐
10
A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in Airflow.
Data Pipeline With Dbt Using Airflow On Gcp
⭐
10
This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. There are different tools that have been used in this project such as Astro, DBT, GCP, Airflow, Metabase.
Data Engineering
⭐
9
This is an all-in-one repository for Data Engineers, ideal for beginners & interview preparation, which includes Python as the main Programing language incorporating MySQL, MongoDB and Docker
Data Engineering
⭐
9
Common data manipulations in different languages and frameworks.
Faizs Data Portofolio
⭐
9
This documentation is like a quick snapshot of my project in the data field, showing off my skills and know-how in this area.
Preludio
⭐
8
Preludio is a data transformation language based on PRQL.
Data Engineering Interviews
⭐
7
Data engineering interviews Q&A for data community by data community
Babbling.fish
⭐
6
My personal blog about Data Engineering. Powered by Gatsby.
Data Engineer Portfolio
⭐
6
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Data Careers Handbook 2024
⭐
6
Data Career Handbook for all
Workspace
⭐
5
This repository provides containerized applications and microservices for the Information Systems and Databases Course @ Instituto Superior Técnico
Chartai
⭐
5
A Streamlit powered GPT-3 Application that allows you to chat with tabular data. In addition to AI Chart creation, insights are given too.
Providence
⭐
5
Apply Data Engineering to Personal Finance
Related Searches
Database Sql (5,501)
Python Sql (3,922)
Mysql Sql (2,867)
Java Sql (2,781)
Javascript Sql (2,662)
C Sharp Sql (2,429)
Postgresql Sql (2,411)
Php Sql (2,276)
Golang Sql (1,383)
Sql Table (1,358)
1-57 of 57 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.