Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for webarchive
webarchive
x
45 search results found
Downloadnet
⭐
3,578
💾 DownloadNet - All content you browse online available offline. Search through the full-text of all pages in your browser history. ⭐️ Star to support our work!
Pywb
⭐
1,259
Core Python Web Archiving Toolkit for replay and recording of web archives
Replayweb.page
⭐
574
Serverless replay of web archives directly in the browser
Oldweb Today
⭐
218
Browse emulated browsers connected to old web sites in your browser!
Warcio
⭐
173
Streaming WARC/ARC library for fast web archive IO
Squidwarc
⭐
163
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Vandal
⭐
144
Navigator for Web Archive
Aut
⭐
128
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Cdx_toolkit
⭐
121
A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine
Archivespark
⭐
118
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Browsertrix Cloud
⭐
113
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
Archivefuzz
⭐
69
Hunt down the secrets from the WebArchives for Fun and Profit
Node Warc
⭐
62
Parse And Create Web ARChive (WARC) files with node.js
Warclight
⭐
47
A Rails engine supporting the discovery of web archives.
Webhackurls
⭐
34
Simple python OSINT tool for urls recon thanks to the waybackmachine.
Chatnoir Resiliparse
⭐
33
A robust web archive analytics toolkit
Warcworker
⭐
33
A dockerized, queued high fidelity web archiver based on Squidwarc
Robustlinks
⭐
32
Links on the web break all the time, robustify them!
Gogetcrawl
⭐
29
Extract web archive data using Wayback Machine and Common Crawl
Quickcacheandarchivesearch
⭐
27
Quick Cache and Archive search buttons
Web Snap
⭐
25
Create "perfect" snapshots of web pages
Python Webarchive
⭐
24
Create WebKit/Safari .webarchive files on any platform
Notebooks
⭐
18
Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit.
Bookmark Archiver
⭐
18
🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...
Archive Query Log
⭐
18
📜 The Archive Query Log.
Ark
⭐
18
🚢 A self-hosted, personal archival application
Seeder
⭐
15
Seeder - Czech webarchive curating tool and public site
Shaman.dokan.warc
⭐
15
Mounts WARC files on Windows
Mixnode Warcreader Php
⭐
14
Read Web ARChive (WARC) files in PHP.
Mementoembed
⭐
13
A service that provides archive-aware oEmbed-compatible embeddable surrogates (social cards, thumbnails, etc.) for archived web pages (mementos).
Yggo
⭐
12
YGGo! Distributed Web Search Engine
Auk
⭐
11
Rails application for the Archives Unleashed Cloud.
Docker Aut
⭐
11
Docker image for the Archives Unleashed Toolkit
Ukwa Manage
⭐
10
Shepherding our web archives from crawl to access.
Devilfish
⭐
9
A utility for simultaneously creating full-page PDF snapshots and web archives of web pages in DEVONthink Pro.
Raintale
⭐
8
A Python utility for publishing a social media story built from archived web pages to multiple services.
Hadoopconcatgz
⭐
7
A Splitable Hadoop InputFormat for Concatenated GZIP Files and *.(w)arc.gz
Zotero Robust Links Extension
⭐
6
Create Robust Links from within Zotero
Cdx Summary
⭐
6
Summarize web archive capture index (CDX) files.
Webarchive To Singlefile
⭐
6
This command line converts .webarchive file to resources embed .html file
Veidemann Harvester
⭐
5
Aiu
⭐
5
A library for interacting with web archive collections at Archive-It, Trove, Pandora, and more.
Warcprotocol
⭐
5
Parser for WARC (aka WebArchive) files
Ukwa Gsheets Utils
⭐
5
Add-On for Google Sheets to help those working with web archives.
Rss Link Database
⭐
5
Bookmarked archived links
1-45 of 45 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.