Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for spark data quality
data-quality
x
spark
x
7 search results found
Deequ
⭐
3,044
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Zingg
⭐
828
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Traceml
⭐
493
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Datavines
⭐
275
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
Whylogs Java
⭐
179
Profile and monitor your ML data pipeline end-to-end
Lakehouse Engine
⭐
154
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Soda Spark
⭐
49
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Data Flare
⭐
21
Data quality control tool built on spark and deequ
Hands On Great Expectations With Spark
⭐
12
How to evaluate the Quality of your Data with Great Expectations and Spark.
Huemul Bigdatagovernance
⭐
10
Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de der
Data Quality Monitoring
⭐
10
Data Quality Monitoring Tool
Related Searches
Scala Spark (3,279)
Python Spark (2,053)
Java Spark (1,587)
Apache Spark (1,207)
Spark Hadoop (1,188)
Jupyter Notebook Spark (1,151)
Spark Kafka (985)
Spark Streaming (817)
Spark Pyspark (812)
Spark Big Data (571)
1-7 of 7 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.