Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for scala data engineering
data-engineering
x
scala
x
22 search results found
Data Engineering Howto
⭐
2,949
A list of useful resources to learn Data Engineering from scratch
Metarank
⭐
1,949
A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
Feathr
⭐
1,886
Feathr – A scalable, unified data and AI engineering platform for enterprise
Every Single Day I Tldr
⭐
311
A daily digest of the articles or videos I've found interesting, that I want to share with you.
Geni
⭐
268
A Clojure dataframe library that runs on Spark
Setl
⭐
173
A simple Spark-powered ETL framework that just works 🍺
Spark Alchemy
⭐
169
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Flowman
⭐
85
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
Gallia Core
⭐
79
A schema-aware Scala library for data transformation
Waimak
⭐
73
Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Us Stock Prediction Using Ml And Spark
⭐
35
Predict stock price based on financial news feeds
Scala Ql
⭐
30
Statically typed query DSL for Scala.
Spark Distcp
⭐
18
A re-implementation of Hadoop DistCP in Apache Spark
Bridgefour
⭐
16
Bridge Four is a simple, functional, effectful, single-leader, multi worker, distributed compute system optimized for embarrassingly parallel workloads.
Data Brewery
⭐
12
Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage data warehouse workflow.
Akka Lift Ml
⭐
12
akka http service for serving spark machine learning models
Fake Data Pipeline
⭐
10
Data Generators -> Kafka -> Spark Streaming -> PostgreSQL -> Grafana
Huemul Bigdatagovernance
⭐
10
Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de der
Flink Example
⭐
6
Flink Example
Sparklyclean
⭐
6
Optimal distributed data deduplication and supervised learning pipeline using Apache Spark
Opensnowcat Collector
⭐
6
OpenSnowcat Collector, an open source fork of Snowplow (Apache 2.0 License)
Data Engineer Portfolio
⭐
6
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Data System Design
⭐
5
System Design, Solution Architecture, Data Systems Practice
Opensnowcat Enrich
⭐
5
OpenSnowcat Enricher (Apache 2.0 License)
Related Searches
Scala Sbt (4,178)
Scala Spark (3,279)
Scala Akka (2,120)
Java Scala (1,794)
Scala Play Framework (1,309)
Plugin Scala (1,079)
Scala Kafka (969)
Scala Functional Programming (942)
Scala Scalajs (887)
Scala Apache (705)
1-22 of 22 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.