Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for spark databricks
databricks
x
spark
x
42 search results found
Redash
⭐
24,479
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Synapseml
⭐
4,967
Simple and Distributed Machine Learning
Sqlglot
⭐
4,652
Python SQL Parser and Transpiler
Spark
⭐
1,963
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Dbldatagen
⭐
234
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Azure Event Hubs Spark
⭐
225
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Azure Cosmosdb Spark
⭐
194
Apache Spark Connector for Azure Cosmos DB
Lakehouse Engine
⭐
154
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Variantspark
⭐
121
machine learning for genomic variants
Jupyterlab Integration
⭐
72
DEPRECATED: Integrating Jupyter with Databricks via SSH
Apachespark
⭐
59
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
Architect_big_data_solutions_with_spark
⭐
42
code, labs and lectures for the course
Databricks
⭐
41
Databricks Platform - Architecture, Security, Automation and much more!!
Azure Databricks
⭐
37
Azure Databricks - Advent of 2020 Blogposts
Pyjaws
⭐
36
PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows
Johnsnowlabs
⭐
35
Gateway into the John Snow Labs Ecosystem
Delta Go
⭐
26
Native Delta Lake Implementation in Go
Databricks Notebooks
⭐
22
Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )
Data Lineage Databricks To Purview
⭐
16
A proof of concept of how to integrate Spark Lineage in Azure Purview
Databricks Kube Operator
⭐
14
A Kubernetes operator to enable GitOps style deploys for Databricks resources
Storeitemdemand
⭐
13
(117th place - Top 26%) Deep learning using Keras and Spark for the "Store Item Demand Forecasting" Kaggle competition.
Octopufs
⭐
11
OctopuFS library helps managing cloud storage, ADLSgen2 specifically. It allows you to operate on files (moving, copying, setting ACLs) in very efficient manner. Designed to work on databricks, but should work on any other platform as well.
Azure Databricks Log4j To Appinsights
⭐
11
Connect your Spark Databricks clusters Log4J output to the Application Insights Appender
Mobile_trends_using_spark_and_dash
⭐
11
Asynchronous, classic OOP on the Spark engine with a light front-end
Spark Excel
⭐
10
A Spark data source for reading Microsoft Excel files
Pyspark Dataframe Made Easy
⭐
10
pyspark dataframe made easy
Timeseriesgan
⭐
9
GANs for Time series analysis (Synthetic data generation, anomaly detection and interpolation), Hypertuning using Optuna, MLFlow and Databricks
Pysparklyr
⭐
9
Extension to {sparklyr} that allows you to interact with Spark & Databricks Connect
Freeza Offset
⭐
9
Spark stream consumption commit in kafka consumer group
Sparkitecture
⭐
9
A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.
Dlt With Debug
⭐
8
A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT run and Non-DLT interactive notebook run.
Ready2019_aa_ai_200
⭐
8
A Beginner's Guide to Azure Databricks
Machinelearningsamples
⭐
7
MachineLearning examples using Spark MLIB and Databricks
Databrickstraining
⭐
6
Repository for Microsoft Databricks Training Events - Hosted by BlueGranite
Formacao Engenheiro De Dados Cloud E Big Data Azure Databricks
⭐
6
Formação Engenheiro de Dados Cloud e Big Data (Azure & DataBricks)
Pyspark Connectors
⭐
6
Spark Databricks
⭐
6
🔥 Master Apache Spark & Databricks! Dive into a world of big data with exclusive insights from Udemy courses, personal notes, and practical guides. Whether you're starting out or scaling new heights in data engineering, this is your ultimate resource hub! 🌟🚀
Dac
⭐
6
Databricks Admin Center
Waterbear
⭐
5
Automated provisioning of an industry Lakehouse with enterprise data model
Spark For Dummies
⭐
5
Mastering Spark 2 from the very beginning
Microsoft Big Data Scientist And Ai
⭐
5
Microsoft Big Data, Data Scientist, and AI
Az Databricks Realtime Alert System
⭐
5
Building a real-time alert monitoring pipeline that sends email notifications off of Azure Event Hubs, Azure Databricks, and a Azure Logic App
Related Searches
Scala Spark (3,279)
Python Spark (2,053)
Java Spark (1,587)
Apache Spark (1,207)
Spark Hadoop (1,188)
Jupyter Notebook Spark (1,151)
Spark Kafka (985)
Spark Streaming (817)
Spark Pyspark (812)
Shell Spark (705)
1-42 of 42 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.