Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
Hakrawler	4,120			3 months ago	11	February 22, 2021	9	gpl-3.0	Go
Simple, fast web crawler designed for easy, quick discovery of endpoints and assets within a web application
Trafilatura	2,447		66	3 months ago	39	November 29, 2023	66	gpl-3.0	Python
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Discv4 Dns Lists	63			3 months ago			2

Rec A Sketch	42			7 years ago				mit	JavaScript
content discovery... IN 3D
Crawlkit	23	6	5	7 years ago	34	May 23, 2016	1	mit	JavaScript
A crawler based on Phantom. Allows discovery of dynamic content and supports custom scrapers.
Domain_discovery_tool_deprecated	23			7 years ago			21		JavaScript
Seed acquisition tool to bootstrap focused crawlers
Block Crawler	21			6 years ago			3	mit	JavaScript
🕸️ discovery tool for legally restricted or censored HTTP resources (code 451 / RFC7725)
Ndcrawl	19			7 years ago			1	mit	Python
CDP/LLDP Network Discovery Crawler via Python/Netmiko
Content Discovery Hit Lists	11			7 years ago				gpl-3.0	Roff
This repository contains hit lists to use for web application content discovery.
Nutch Indexer Discovery	9			6 years ago			1		Java
Watson Discovery Service indexing plugin for Apache Nutch

Alternatives To Hakrawler

Select To Compare

Hakrawler ⭐ 4,120

Simple, fast web crawler designed for easy, quick discovery of endpoints and assets within a web application

total releases 11most recent commit 3 months ago

Trafilatura ⭐ 2,447

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

dependent packages 66total releases 39most recent commit 3 months ago

Discv4 Dns Lists ⭐ 63

most recent commit 3 months ago

Rec A Sketch ⭐ 42

content discovery... IN 3D

most recent commit 7 years ago

Crawlkit ⭐ 23

A crawler based on Phantom. Allows discovery of dynamic content and supports custom scrapers.

dependent packages 5total releases 34most recent commit 7 years ago

Domain_discovery_tool_deprecated ⭐ 23

Seed acquisition tool to bootstrap focused crawlers

most recent commit 7 years ago

Block Crawler ⭐ 21

🕸️ discovery tool for legally restricted or censored HTTP resources (code 451 / RFC7725)

most recent commit 6 years ago

Ndcrawl ⭐ 19

CDP/LLDP Network Discovery Crawler via Python/Netmiko

most recent commit 7 years ago

Content Discovery Hit Lists ⭐ 11

This repository contains hit lists to use for web application content discovery.

most recent commit 7 years ago

Nutch Indexer Discovery ⭐ 9

Watson Discovery Service indexing plugin for Apache Nutch

most recent commit 6 years ago

Suggest An Alternative To hakrawler

Alternative Project Comparisons

Hakrawler vs Trafilatura

Hakrawler vs Discv4 Dns Lists

Hakrawler vs Rec A Sketch

Hakrawler vs Crawlkit

Hakrawler vs Domain_discovery_tool_deprecated

Hakrawler vs Block Crawler

Hakrawler vs Ndcrawl

Hakrawler vs Content Discovery Hit Lists

Hakrawler vs Nutch Indexer Discovery

Popular Discovery Projects

Applied Ml ⭐ 24,828

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

most recent commit 6 months ago

Theharvester ⭐ 10,249

E-mails, subdomains and names Harvester - OSINT

total releases 2latest release February 04, 2019most recent commit 18 days ago

Rengine ⭐ 6,446

reNgine is an automated reconnaissance framework for web applications with a focus on highly configurable streamlined recon process via Engines, recon data correlation and organization, continuous monitoring, backed by a database, and simple yet intuitive User Interface. reNgine makes it easy for penetration testers to gather reconnaissance with minimal configuration and with the help of reNgine's correlation, it just makes recon effortless.

most recent commit 3 months ago

Discovery ⭐ 5,474

☀️ Nepxion Discovery is a solution for Spring Cloud with blue green, gray, route, limitation, circuit breaker, degrade, isolation, tracing, dye, failover, active 蓝绿灰度发布、路由、限流、熔断、降级、隔离、追踪、流量染色、故障转移、多活

dependent packages 1total releases 438latest release March 20, 2023most recent commit 3 months ago

Taptargetview ⭐ 5,278

An implementation of tap targets from the Material Design guidelines for feature discovery.

total releases 3latest release July 09, 2021most recent commit 9 months ago

Popular Crawler Projects

Scrapy ⭐ 49,918

Scrapy, a fast high-level web crawling & scraping framework for Python.

dependent packages 445total releases 96latest release September 18, 2023most recent commit 3 months ago

Lux ⭐ 24,752

👾 Fast and simple video download library and CLI tool written in Go

dependent packages 8total releases 40latest release November 06, 2023most recent commit 25 days ago

Colly ⭐ 21,902

Elegant Scraper and Crawler Framework for Golang

dependent packages 328total releases 22latest release March 08, 2022most recent commit a month ago

Easyspider ⭐ 20,149

A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化

most recent commit 23 days ago

Proxy_pool ⭐ 19,442

Python ProxyPool for web spider

most recent commit 4 months ago

Popular Data Processing Categories