Awesome Open Source

Programming Languages

Wmirror

wmirror allows you to download any website from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer.

Categories > Data Processing > Crawler

Suggest Alternative

Stars

11

License

gpl-3.0

Most Recent Commit

2 years ago

Programming Language

Shell

Categories

Data Processing > Crawler

Applications > Wget

Alternatives To Wmirror

Project Name	Stars	Downloads	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
Awesome Datahoarding	892				7 months ago			4
List of data-hoarding related tools
Archivebot	328				5 months ago			169	mit	Python
ArchiveBot, an IRC bot for archiving websites
Bitextor	260				7 months ago			4	gpl-3.0	Python
Bitextor generates translation memories from multilingual websites
Google Group Crawler	213				2 years ago			6		Shell
[Deprecated] Get (almost) original messages from google group archives. Your data is yours.
Authority Data	83				3 months ago			1	gpl-3.0	Python
官方权威数据：统计年签，统计公报，互联网行业报告，工信部数据，ICT报告等 Official authoritative data (Chinese)
Fetchurls	79				2 years ago			1	mit	Shell
A bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.
Wget Lua	72				4 months ago			10	gpl-3.0	C
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Wmirror	11				2 years ago				gpl-3.0	Shell
wmirror allows you to download any website from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer.
Metagoofeel	10				a year ago				mit	Shell
Web crawler and downloader based on GNU Wget.
Pywebquery	10				12 years ago					Python
a jquery liked pythonic web crawler library ,it's based on BeautifulSoup and wget

Alternatives To Wmirror

Select To Compare

Awesome Datahoarding ⭐ 892

List of data-hoarding related tools

most recent commit 7 months ago

Archivebot ⭐ 328

ArchiveBot, an IRC bot for archiving websites

most recent commit 5 months ago

Bitextor ⭐ 260

Bitextor generates translation memories from multilingual websites

most recent commit 7 months ago

Google Group Crawler ⭐ 213

[Deprecated] Get (almost) original messages from google group archives. Your data is yours.

most recent commit 2 years ago

Authority Data ⭐ 83

官方权威数据：统计年签，统计公报，互联网行业报告，工信部数据，ICT报告等 Official authoritative data (Chinese)

most recent commit 3 months ago

Fetchurls ⭐ 79

A bash script to spider a site, follow links, and fetch urls (with built-in filtering) into a generated text file.

most recent commit 2 years ago

Wget Lua ⭐ 72

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

most recent commit 4 months ago

wmirror allows you to download any website from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer.

most recent commit 2 years ago

Metagoofeel ⭐ 10

Web crawler and downloader based on GNU Wget.

most recent commit a year ago

Pywebquery ⭐ 10

a jquery liked pythonic web crawler library ,it's based on BeautifulSoup and wget

most recent commit 12 years ago

Suggest An Alternative To WMIRROR

Alternative Project Comparisons

Wmirror vs Awesome Datahoarding

Wmirror vs Archivebot

Wmirror vs Bitextor

Wmirror vs Google Group Crawler

Wmirror vs Authority Data

Wmirror vs Fetchurls

Wmirror vs Wget Lua

Wmirror vs Metagoofeel

Wmirror vs Pywebquery

Popular Crawler Projects

Scrapy ⭐ 49,918

Scrapy, a fast high-level web crawling & scraping framework for Python.

dependent packages 445total releases 96latest release September 18, 2023most recent commit 3 months ago

pypi Scrapy} Downloads

👾 Fast and simple video download library and CLI tool written in Go

dependent packages 8total releases 40latest release November 06, 2023most recent commit 22 days ago

Colly ⭐ 21,902

Elegant Scraper and Crawler Framework for Golang

dependent packages 328total releases 22latest release March 08, 2022most recent commit a month ago

Easyspider ⭐ 20,149

A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化

most recent commit 20 days ago

Proxy_pool ⭐ 19,442

Python ProxyPool for web spider

most recent commit 4 months ago

Popular Wget Projects

Archivebox ⭐ 19,721

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

dependent packages 1total releases 26latest release November 04, 2023most recent commit 8 days ago

pypi archivebox} Downloads

Workflow ⭐ 12,038

C++ Parallel Computing and Asynchronous Networking Framework

most recent commit 3 months ago

yq is a portable command-line YAML, JSON, XML, CSV, TOML and properties processor

dependent packages 77total releases 132latest release December 04, 2023most recent commit 3 months ago

Iscript ⭐ 4,923

各种脚本 -- 关于虾米 xiami.com, 百度网盘 pan.baidu.com, 115网盘 115.com, 网易音乐 music.163.com, 百度音乐 music.baidu.com, 360网盘/云盘 yunpan.cn, 视频解析 flvxz.com, bt torrent ↔ magnet, ed2k 搜索, tumblr 图片下载, unzip

most recent commit a year ago

Gdown ⭐ 3,659

Google Drive Public File/Folder Downloader (curl/wget fails due to the security notice).

dependent packages 401total releases 84latest release March 25, 2023most recent commit 3 months ago

pypi gdown} Downloads

Popular Data Processing Categories

Jupyter Notebook

Related Searches

Get A Weekly Email With Trending Projects For These Categories

No Spam. Unsubscribe easily at any time.

Crawler

Wget

Privacy | About | Terms | Follow Us On Twitter

Downloads, Dependent Repos, Dependent Packages, Total Releases, Latest Releases data powered by Libraries.io.

Copyright 2018-2024 Awesome Open Source. All rights reserved.