Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for warc internet archiving
internet-archiving
x
warc
x
4 search results found
Archivebox
⭐
20,008
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Forum Dl
⭐
26
Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC
Internet Archiving Talk
⭐
8
🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.
Digestbox
⭐
7
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
Related Searches
Python Warc (95)
Archive Warc (87)
Warc Web Archiving (32)
Archivebox Internet Archiving (11)
Html Warc (9)
Digipres Internet Archiving (8)
Python Internet Archiving (7)
Docker Internet Archiving (4)
Javascript Internet Archiving (4)
Html Internet Archiving (4)
1-4 of 4 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.