Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for hive
hive
x
1,138 search results found
Cube
⭐
16,806
📊 Cube — The Semantic Layer for Building Data Applications
Apijson
⭐
16,586
🏆 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构。 🏆 A JSON Transmission Protocol and an ORM Library 🚀 provides APIs and Docs without writing any code.
Bigdata Notes
⭐
14,872
大数据入门指南 ⭐
Doris
⭐
11,243
Apache Doris is an easy-to-use, high performance and unified analytics database.
Trino
⭐
9,118
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
God Of Bigdata
⭐
8,483
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive.
Beehive
⭐
5,936
A flexible event/agent & automation system with lots of bees 🐝
Hive
⭐
5,222
Apache Hive
Iceberg
⭐
5,179
Apache Iceberg
Sqlglot
⭐
4,652
Python SQL Parser and Transpiler
Hive
⭐
3,859
Lightweight and blazing fast key-value database written in pure Dart.
Sql Generator
⭐
3,346
🔨 用 JSON 来生成结构化的 SQL 语句,基于 Vue3 + TypeScript + Vite + Ant Design + MonacoEditor 实现,项目简单(重逻辑轻页面)、适合练手~
Linkis
⭐
3,224
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Dataspherestudio
⭐
2,860
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Airpal
⭐
2,758
Web UI for PrestoDB.
Pypykatz
⭐
2,577
Mimikatz implementation in pure Python
Bigdataguide
⭐
2,355
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Parquet Mr
⭐
2,296
Apache Parquet
Szt Bigdata
⭐
2,055
深圳地铁大数据客流分析系统🚇🚄🌟
Quicksql
⭐
1,939
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Drill
⭐
1,856
Apache Drill is a distributed MPP query layer for self describing data
Kyuubi
⭐
1,849
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Secor
⭐
1,828
Secor is a service implementing Kafka log persistence
Pyhive
⭐
1,617
Python interface to Hive and Presto. 🐝
Querybook
⭐
1,615
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
Metacat
⭐
1,518
Mongo Hadoop
⭐
1,511
MongoDB Connector for Hadoop
Movie_recommend
⭐
1,441
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Carbondata
⭐
1,401
High performance data store solution
Bigdata Growth
⭐
1,256
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
Taier
⭐
1,220
Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display
Docs4dev
⭐
1,144
后端开发常用框架文档及中文翻译,包含 Spring 系列文档(Spring, Spring Boot, Spring Cloud, Spring Security, Spring Session),大数据(Apache Hive, HBase, Apache Flume),日志(Log4j2, Logback),Http Server(NGINX,Apache),Python,数据库(OpenTSDB,MySQL,Pos
Elephant Bird
⭐
1,100
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
Addax
⭐
1,034
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
Awesome Hadoop
⭐
987
A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources
Queryparser
⭐
985
Parsing and analysis of Vertica, Hive, and Presto SQL.
Brickhouse
⭐
961
Hive UDF's for the data warehouse
Stream Reactor
⭐
960
A collection of open source Apache 2.0 Kafka Connector maintained by Lenses.io.
Docker Hive
⭐
918
Hadoop_study
⭐
817
定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo 常用工具类 去敏后的train code 持续更新!!!)
Datacap
⭐
793
DataCap is integrated software for data transformation, integration, and visualization. Support a variety of data sources, file types, big data related database, relational database, NoSQL database, etc. Through the software can realize the management of multiple data sources, the data under the source of various operations conversion ...
Scriptis
⭐
767
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Nessie
⭐
762
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
Impyla
⭐
718
Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)
Hive Json Serde
⭐
706
Read - Write JSON SerDe for Apache Hive.
Yauaa
⭐
701
Yet Another UserAgent Analyzer
Coral
⭐
680
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
Sql Metadata
⭐
672
Uses tokenized query returned by python-sqlparse and generates query metadata
Nginx Lua Anti Ddos
⭐
649
A Anti-DDoS script to protect Nginx web servers using Lua with a HTML Javascript based authentication puzzle inspired by Cloudflare I am under attack mode an Anti-DDoS authentication page protect yourself from every attack type All Layer 7 Attacks Mitigating Historic Attacks DoS DoS Implications DDoS All Brute Force Attacks Zero day exploits Social Engineering Rainbow Tables Password Cracking Tools Password Lists Dictionary Attacks Time Delay Any Hosting Provider Any CMS or Custom Website Unlimi
Blinkdb
⭐
625
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
Wedatasphere
⭐
624
WeDataSphere is a financial grade, one-stop big data platform suite.
Yanagishima
⭐
584
Web UI for Trino, Hive and SparkSQL
Data Engineering Interview Questions
⭐
554
More than 2000+ Data engineer interview questions.
Tadpolefordbtools
⭐
512
Hivemall
⭐
508
Scalable machine learning library for Apache Hive/Spark/Pig
Drone
⭐
500
🍰 The missing library manager for Android Developers
Spiderman
⭐
498
基于 scrapy-redis 的通用分布式爬虫框架
Gis Tools For Hadoop
⭐
495
The GIS Tools for Hadoop are a collection of GIS tools for spatial analysis of big data.
Moonbox
⭐
487
Moonbox is a DVtaaS (Data Virtualization as a Service) Platform
Hadoop Ansible
⭐
416
Ansible playbook that installs a Hadoop cluster, with HBase, Hive, Presto for analytics, and Ganglia, Smokeping, Fluentd, Elasticsearch and Kibana for monitoring and centralized log indexing.
Python Registry
⭐
408
Pure Python parser for Windows Registry hives.
Regripper3.0
⭐
397
RegRipper3.0
Connectors
⭐
383
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
Zdh_web
⭐
379
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批
Datafaker
⭐
377
Datafaker is a large-scale test data and flow test data generation tool. Datafaker fakes data and inserts to varied data sources. 测试数据生成工具
Bigdata
⭐
358
💎🔥大数据学习笔记
Bnn
⭐
358
bee detection tensorflow conv net for a rasp pi on side of a hive
Big_data_architect_skills
⭐
353
一个大数据架构师应该掌握的技能
Flutter_data
⭐
352
Seamlessly manage persistent data in your Flutter apps
Trendingtopics
⭐
351
Rails app for tracking trends in server logs - powered by the Cloudera Hadoop Distribution on EC2
Hive
⭐
344
Ethereum end-to-end test harness
Spatial Framework For Hadoop
⭐
343
The Spatial Framework for Hadoop allows developers and data scientists to use the Hadoop data processing system for spatial data analysis.
Tutorials
⭐
321
StreamSets Tutorials
Hive
⭐
317
Fast. Scalable. Powerful. The Blockchain for Web3
Hive Testbench
⭐
315
Incubator Hivemall
⭐
308
Mirror of Apache Hivemall (incubating)
Woothee
⭐
304
User-Agent parser/classifier for multi languages
Rsbi Pom
⭐
302
睿思BI-数据仪表盘,开源商业智能,数据可视化系统
Skype Clone
⭐
302
Making a fully functional skype clone in flutter.
Sql Scripts
⭐
291
100+ SQL Scripts - PostgreSQL, MySQL, Google BigQuery, MariaDB, AWS Athena. DevOps / DBA / Analytics / performance engineering. Google BigQuery ML machine learning classification.
Transport
⭐
288
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
Cdh Twitter Example
⭐
284
Example application for analyzing Twitter data using CDH - Flume, Oozie, Hive
Jupysql
⭐
261
Better SQL in Jupyter. 📊
Facebook Hive Udfs
⭐
259
Facebook's Hive UDFs
Helicalinsight
⭐
256
Helical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.
Reair
⭐
254
ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.
Waggle Dance
⭐
252
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Hive Jdbc Uber Jar
⭐
252
Hive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
Hiverunner
⭐
252
An Open Source unit test framework for Hive queries based on JUnit 4 and 5
Omniduct
⭐
247
A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (including HDFS, Hive, Presto, MySQL, etc).
Hive
⭐
237
API driven OpenShift cluster provisioning and management
Hadoop Tutorials Examples
⭐
228
Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.
Ecency Mobile
⭐
228
Ecency Mobile - reimagined social blogging, contribute and get rewarded (for Android and iOS)
Bigdata_docker
⭐
226
Big Data Ecosystem Docker
Flink Notes
⭐
223
flink学习笔记
Regipy
⭐
222
Regipy is an os independent python library for parsing offline registry hives
Gohive
⭐
217
Go driver for Apache Hive
Hive Third Functions
⭐
211
Some useful custom hive udf functions, especial array, json, math, string functions.
Emr Dynamodb Connector
⭐
210
Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB
Hadoop Docker
⭐
210
基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
Related Searches
Hadoop Hive (703)
Java Hive (697)
Spark Hive (529)
1-100 of 1,138 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.