Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python delta lake
delta-lake
x
python
x
15 search results found
Delta Rs
⭐
1,634
A native Rust library for Delta Lake, with bindings into Python
Dbldatagen
⭐
234
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Amazon Sagemaker Local Mode
⭐
220
Amazon SageMaker Local Mode Examples
Mack
⭐
188
Delta Lake helper methods in PySpark
Lakehouse Engine
⭐
154
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Faker Cli
⭐
61
Command-line interface to quickly generate fake CSV and JSON data
Apachespark
⭐
59
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
Deltalakereader
⭐
45
Read Delta tables without any Spark
Edc Mod1 Exercise Igti
⭐
42
Exercícios do módulo 1 - Bootcamp EDC - IGTI 2021
Dask Deltatable
⭐
34
A Delta Lake reader for Dask
Building Data Lakehouse
⭐
32
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
Pysparkcheatsheet
⭐
30
PySpark Cheatsheet
Spark Movies Etl
⭐
21
Spark data pipeline that ingests and transforms movie ratings data.
Olh
⭐
19
Open source stack lakehouse
Db2ixf
⭐
10
db2ixf is a python package with a CLI that simplifies the parsing and processing of IBM Integration eXchange Format (IXF) files.
Financial Data Project In Azure
⭐
8
Free High-Quality Financial Data in Azure
Cdk Emrserverless With Delta Lake
⭐
8
This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you could also launch an EMR notebook via cluster template to check the outcome from the EMR Serverless application.
Emr Serverless Spark Delta Lake 2.0
⭐
8
A quick example for Delta Lake running on AWS EMR Serverless Spark
Spark Databricks
⭐
6
🔥 Master Apache Spark & Databricks! Dive into a world of big data with exclusive insights from Udemy courses, personal notes, and practical guides. Whether you're starting out or scaling new heights in data engineering, this is your ultimate resource hub! 🌟🚀
Spark Structured Streaming Kafka
⭐
5
Spark Structured Streaming + Kafka + Delta pipeline.
Waterbear
⭐
5
Automated provisioning of an industry Lakehouse with enterprise data model
Delta Buddy
⭐
5
Introducing Delta-Buddy: Your ultimate Delta Lake companion! 🚀 Streamline your data journey with an AI-powered chatbot. Ask Delta-Buddy anything about your Delta Lake.
Lakeapi
⭐
5
API for distributing Data Lake Data
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Html (10,924)
Python Amazon Web Services (7,946)
1-15 of 15 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.