Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for databricks
databricks
x
141 search results found
Redash
⭐
24,479
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Apijson
⭐
16,698
🏆 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构。 🏆 A JSON Transmission Protocol and an ORM Library 🚀 provides APIs and Docs without writing any code.
Dolly
⭐
10,354
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Synapseml
⭐
4,989
Simple and Distributed Machine Learning
Sqlglot
⭐
4,652
Python SQL Parser and Transpiler
Spark
⭐
1,963
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Delta Rs
⭐
1,634
A native Rust library for Delta Lake, with bindings into Python
Optscale
⭐
854
MLOps and FinOps platform to run ML/AI experiments and regular cloud workloads with optimal performance and cost.
Modern Data Warehouse Dataops
⭐
521
DataOps for the Modern Data Warehouse on Microsoft Azure. https://aka.ms/mdw-dataops.
Mlcraft
⭐
418
Synmetrix – open source semantic layer / Boost your LLM precision
Terraform Provider Databricks
⭐
417
Databricks Terraform Provider
Dbx
⭐
398
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
Mlops Platforms
⭐
308
Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...
Databricks Sdk Py
⭐
263
Databricks SDK for Python (Beta)
Dbldatagen
⭐
234
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Azure Event Hubs Spark
⭐
225
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Nutter
⭐
225
Testing framework for Databricks notebooks
Overwatch
⭐
211
Capture deep metrics on one or all assets within a Databricks workspace
Azure Cosmosdb Spark
⭐
194
Apache Spark Connector for Azure Cosmos DB
Analytics Toolbox Core
⭐
185
A set of UDFs and Procedures to extend BigQuery, Snowflake, Redshift, Postgres and Databricks with Spatial Analytics capabilities
Cicd Templates
⭐
166
Manage your Databricks deployments and CI with code.
Dbt Databricks
⭐
165
A dbt adapter for Databricks.
Lakehouse Engine
⭐
154
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Terraform Databricks Examples
⭐
148
Examples of using Terraform to deploy Databricks resources
Stowage
⭐
147
Bloat-free, no BS cloud storage SDK.
Variantspark
⭐
121
machine learning for genomic variants
Ucx
⭐
112
Your best companion for upgrading to Unity Catalog. UCX will guide you, the Databricks customer, through the process of upgrading your account, groups, workspaces, jobs etc. to Unity Catalog.
Databricks Sql Python
⭐
105
Databricks SQL Connector for Python
Azure.databricks.cicd.tools
⭐
96
Tools for Deploying Databricks Solutions in Azure
Cli
⭐
82
Databricks CLI
Jupyterlab Integration
⭐
72
DEPRECATED: Integrating Jupyter with Databricks via SSH
Spark
⭐
65
Open Source D-APM (Data-Application Performance Monitoring) for Apache Spark
Azure Databricks Client
⭐
59
Client library for Azure Databricks
Apachespark
⭐
59
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
Dstoolkit Mlops Databricks
⭐
58
ML Ops Accelerator: Databricks & Azure Machine Learning Unification
Databricks Api
⭐
57
A simplified, autogenerated API client interface using the databricks-cli package
Databricks_helpers
⭐
54
🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks
Mlflow Tracking Server
⭐
51
MLFLow Tracking Server based on Docker and AWS S3
Testing_bi_engine
⭐
45
TPC-H_SF10
Blackbricks
⭐
44
Black for Databricks notebooks
Azure Databricks Mlops Mlflow
⭐
44
Azure Databricks MLOps sample for Python based source code using MLflow without using MLflow Project.
Prefect Databricks
⭐
43
Prefect integrations for interacting with Databricks.
Architect_big_data_solutions_with_spark
⭐
42
code, labs and lectures for the course
Databricks Sdk Go
⭐
41
Databricks SDK for Go
Databricks
⭐
41
Databricks Platform - Architecture, Security, Automation and much more!!
Azure Databricks
⭐
37
Azure Databricks - Advent of 2020 Blogposts
Databricks Certified Data Engineer Associate Questions
⭐
37
This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.
Pyjaws
⭐
36
PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows
Johnsnowlabs
⭐
35
Gateway into the John Snow Labs Ecosystem
Terraform Azure Data
⭐
35
Terraform script to deploy almost all Azure Data Services
Databricks Grafana
⭐
34
Grafana Databricks integration allowing direct connection to Databricks to query and visualize Databricks data in Grafana.
Databricks Sql Go
⭐
31
Golang database/sql driver for Databricks SQL.
Databricks Rest Client
⭐
29
Delta Oms
⭐
28
DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse
Dash Dbx Sql
⭐
28
Simple Dash app demonstrating connection to Databricks via the Python SQL connector
Tf_azure_deployment
⭐
28
Azure Deployments using Terraform
Delta Go
⭐
26
Native Delta Lake Implementation in Go
Ml Azuredatabricks
⭐
24
Collection of Machine Learning Examples for Azure Databricks
Lhbench
⭐
24
Lakehouse storage system benchmark
Pace
⭐
24
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery, with definitions imported from Collibra, Datahub, ODD and the like.
Splunk Integration
⭐
23
Databricks Add-on for Splunk
Databricksconnectdocker
⭐
23
Docker Images with Databricks Connect Ready to go
Databricks Notebooks
⭐
22
Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )
Databricks Sdk Java
⭐
21
Databricks SDK for Java
Sqlalchemy Databricks
⭐
21
SQLAlchemy dialect for Databricks
Databricks Dbapi
⭐
19
DBAPI and SQLAlchemy dialect for Databricks Workspace and SQL Analytics clusters
Databricks.vsts.tools
⭐
18
VSTS Deployment Tasks for Databricks Objects
Terraform Provider Databricks
⭐
17
Terraform Databricks provider
Data Lineage Databricks To Purview
⭐
16
A proof of concept of how to integrate Spark Lineage in Azure Purview
Awesome Dolly
⭐
16
A curated list of Databricks' Dolly implementations, documentation, and use cases
Databricks Streamlit Demo
⭐
15
Demo of Streamlit application with Databricks SQL Endpoint
Sandbox
⭐
14
Experimental and low-maturity scripts
Databricks Kube Operator
⭐
14
A Kubernetes operator to enable GitOps style deploys for Databricks resources
Db Rocket
⭐
13
Keep your local python scripts installed and in sync with a databricks notebook. Shortens the feedback loop to develop projects using a hybrid environment.
Dbt_datawaves
⭐
13
Datawaves data models for Ethereum built using dbt
Azure Databricks Sdk Python
⭐
13
[archived] A Python SDK for the Azure Databricks REST API 2.0
Storeitemdemand
⭐
13
(117th place - Top 26%) Deep learning using Keras and Spark for the "Store Item Demand Forecasting" Kaggle competition.
Astro Provider Databricks
⭐
12
Orchestrate your Databricks notebooks in Airflow and execute them as Databricks Workflows
Demo Realtime Data Warehousing
⭐
12
Streaming data pipelines for real-time data warehousing. Includes fully managed connectors (PostgreSQL CDC, Snowflake).
Octopufs
⭐
11
OctopuFS library helps managing cloud storage, ADLSgen2 specifically. It allows you to operate on files (moving, copying, setting ACLs) in very efficient manner. Designed to work on databricks, but should work on any other platform as well.
Mobile_trends_using_spark_and_dash
⭐
11
Asynchronous, classic OOP on the Spark engine with a light front-end
Azure Databricks Log4j To Appinsights
⭐
11
Connect your Spark Databricks clusters Log4J output to the Application Insights Appender
Databricks Sdk R
⭐
11
Databricks SDK for R (Experimental)
Free Resources Books Papers
⭐
10
Books and Papers in Mathematics, Econometrics, Machine Learning, Finance etc for different levels that can be useful for Data Scientists, Developers and everyone whoo is interesting in STEM.
Batcomputer
⭐
10
A working example of DevOps & operationalisation applied to Machine Learning and AI
Pyspark Dataframe Made Easy
⭐
10
pyspark dataframe made easy
Spark Excel
⭐
10
A Spark data source for reading Microsoft Excel files
Devopsfordatabricks
⭐
10
Are you like me , a Senior Data Scientist, wanting to learn more about how to approach DevOps, specifically when you using Databricks (workspaces, notebooks, libraries etc) ? Set up using @Azure @Databricks
Apache Airflow Providers Transfers
⭐
10
Databricks Kubernetes Online Inference Poc
⭐
10
End-to-end proof of concept showing core MLOps practices to develop, deploy and monitor a machine learning model for online inference scenarios using Databricks and Kubernetes on Microsoft Azure.
Artificial Data Generator
⭐
9
Pipelines for generating large volumes of anonymous artificial data that share some of the characteristics of real NHS data
Timeseriesgan
⭐
9
GANs for Time series analysis (Synthetic data generation, anomaly detection and interpolation), Hypertuning using Optuna, MLFlow and Databricks
Freeza Offset
⭐
9
Spark stream consumption commit in kafka consumer group
Sondesh
⭐
9
Metadata Comparison Toolkit. As of now, V-1.0.0 only consists Comparison of two DDL file ( .sql ) or two DDL statement.
Gift
⭐
9
Gold Idea First Templates covering data, analytics and visualization.
Ondemandmlflowtrainandserve
⭐
9
A solution for on-demand training and serving of Machine Learning models, using Azure Databricks and MLflow
Learn Databricks
⭐
9
Notebooks to learn Databricks Lakehouse Platform
Terraform Databricks Workspace Management
⭐
9
Terraform module for Databricks Workspace Management: https://registry.terraform.io/providers/databricks
Terraform
⭐
9
Databricks Terraform
Pysparklyr
⭐
9
Extension to {sparklyr} that allows you to interact with Spark & Databricks Connect
1-100 of 141 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.