Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Dockerfiles | 1,171 | 5 months ago | 15 | mit | Shell | |||||
50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak | ||||||||||
Hadoop_study | 817 | 2 years ago | 21 | Java | ||||||
定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo 常用工具类 去敏后的train code 持续更新!!!) | ||||||||||
Devops Python Tools | 709 | 3 months ago | 37 | mit | Python | |||||
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. | ||||||||||
Spark Solr | 440 | 21 | 2 | a year ago | 102 | June 29, 2023 | 57 | apache-2.0 | Scala | |
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ. | ||||||||||
Sparkler | 401 | a year ago | 55 | apache-2.0 | Java | |||||
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark. | ||||||||||
Logisland | 106 | 2 | 34 | a year ago | 12 | January 24, 2023 | 183 | other | Java | |
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available. | ||||||||||
Soda | 64 | 4 years ago | 5 | apache-2.0 | Scala | |||||
Solr Dictionary Annotator (Microservice for Spark) | ||||||||||
Searchhub | 43 | 6 years ago | 35 | other | Python | |||||
Fusion demo app searching open-source project data from the Apache Software Foundation | ||||||||||
Almaren Framework | 28 | 3 | 5 months ago | 51 | October 12, 2023 | 3 | apache-2.0 | Scala | ||
The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. While still allowing you to take advantage of native Apache Spark features. You can still combine it with standard Spark code. | ||||||||||
Wasp | 25 | 15 | 7 months ago | 25 | September 14, 2023 | 4 | other | Scala | ||
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you. |