Awesome Open Source

Programming Languages

Search results for hadoop spark sql

13 search results found

Kyuubi ⭐ 1,849

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Hadoop_study ⭐ 817

定期更新Hadoop生态圈中常用大数据组件文档重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图印象笔记 Scala版本简单demo 常用工具类去敏后的train code 持续更新!!!)

Useractionanalyzeplatform ⭐ 810

电商用户行为分析大数据平台

Bigdata Playground ⭐ 154

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Focusbigdata ⭐ 89

【大数据成神之路学习路径+面经+简历】

Apachespark ⭐ 59

This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.

Big_data ⭐ 55

Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.

Datapipelines Essentials Python ⭐ 45

Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations

Paraflow ⭐ 36

A real-time analytical system for ID-associated data

Sparkdemo ⭐ 34

spark全示例代码(java、scala) Spark most full instance code DEMO (java、scala)

Hadooplearning ⭐ 25

全套大数据基础学习教程，包含最基础的centos、maven。大数据主要包含hdfs、mr、yarn

Movies Analytics In Spark And Scala ⭐ 24

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

Bigdatatutorial ⭐ 18

bigdatatutorial

Huemul Bigdatagovernance ⭐ 10

Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de der

Imooc Sparksql ⭐ 10

SparkSQL慕课网日志分析及可视化展示

Kyuubi Docker ⭐ 9

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Bigdata Examples ⭐ 9

bigdata examples about spark and flink

Run TPC-DS against different databases including Hive, Spark SQL and IBM BigSQL

Easynotes ⭐ 6

EasyNotes（简记）- sync with gitbook.

Distributable_docker_sql_on_hadoop ⭐ 6

Toy Hadoop cluster combining various SQL-on-Hadoop variants

Related Searches

Java Hadoop (2,117)

Spark Hadoop (1,188)

Hadoop Hdfs (1,082)

Hadoop Mapreduce (851)

Shell Hadoop (766)

Python Hadoop (761)

Hadoop Hive (709)

Scala Hadoop (479)

Hadoop Hbase (464)

Docker Hadoop (452)

1-13 of 13 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.