Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Poseidon | 1,543 | 6 years ago | 9 | bsd-3-clause | Go | |||||
A search engine which can hold 100 trillion lines of log data. | ||||||||||
Cloud Computing Search Engine | 7 | 10 years ago | Java | |||||||
A cloud-based web search engine computing Hadoop MapReduce on Amazon EC2 consisting of crawler, indexer, PageRank. | ||||||||||
Search 1047 | 4 | 8 years ago | apache-2.0 | Java | ||||||
A simple search engine based on Nutch and Hadoop. | ||||||||||
Guse | 3 | 2 years ago | Java | |||||||
Search Engine based on Hadoop MapReduce | ||||||||||
Pi202202 Alako Backend | 2 | 7 months ago | 12 | Shell | ||||||
This repository contains all the files related to project's back-end and search algorithm. | ||||||||||
Simple Search Engine With Hadoop | 2 | 7 years ago | Java | |||||||
A simple search engine with crawler, word breaker , inverted sorting list and home page. | ||||||||||
Map Reduce Inverted Index | 2 | 4 years ago | Java | |||||||
Creating an Inverted Index of words occurring in a large set of documents extracted from web pages using Hadoop MapReduce and Google Dataproc | ||||||||||
Nypost_searchengine | 1 | 4 years ago | TypeScript | |||||||
Crawled and stored metadata of web pages using multithreaded crawler. Used GCP Hadoop cluster to create inverted index. Developed custom page rank algorithm and exposed RESTful APIs with spellchecker and autocomplete features. | ||||||||||
Information Retrieval From Wikipedia Dataset | 1 | 7 years ago | Java | |||||||
search engine for wikipedia | ||||||||||
Search_engine Hadoop Spark | 1 | 5 years ago | Jupyter Notebook | |||||||
Build a simple search engine for structured data |
波塞冬,是希腊神话中的海神,在这里是寓意着海量数据的主宰者。
Poseidon 系统是一个日志搜索平台,可以在数百万亿条、数百PB大小的日志数据中快速分析和检索特定字符串。 360公司是一个安全公司,在追踪 APT(高级持续威胁)事件时,经常需要在海量的历史日志数据中检索某些信息, 例如某个恶意样本在某个时间段内的活动情况。在 Poseidon 系统出现之前,都是写 Map/Reduce 计算任务在 Hadoop 集群中做计算, 一次任务所需的计算时间从数小时到数天不等,大大制约了 APT 事件的追踪效率。 Poseidon 系统就是为了解决这个需求,能在几秒钟内从数百万亿条规模的数据集中找出我们需要的数据,大大提高工作效率; 同时,这些数据不需要额外存储,仍然存放在Hadoop集群中,节省了大量存储和计算资源。该系统可以应用于任何结构化或非结构化海量(从万亿到千万亿规模)数据的查询检索需求。
这里存放的是数据生成工具
目前仅仅用来存放该项目中用到的 protobuf
定义
存放了相关的技术文档。
这里存放的是各个HTTP微服务服务的程序