Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Warcbase | 154 | 7 years ago | 38 | Java | ||||||
Warcbase is an open-source platform for managing analyzing web archives | ||||||||||
Aut | 128 | a year ago | 27 | November 17, 2022 | 3 | apache-2.0 | Scala | |||
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives. | ||||||||||
Bifrost | 98 | 5 years ago | 4 | epl-1.0 | Clojure | |||||
Safely archive data from Apache Kafka to S3 with no Hadoop dependencies :) | ||||||||||
Warc Hadoop | 31 | 23 | 1 | 10 years ago | 1 | May 10, 2014 | 4 | mit | Java | |
WARC (Web Archive) Input and Output Formats for Hadoop | ||||||||||
Graylog Plugin Output Webhdfs | 11 | 7 years ago | mit | Java | ||||||
WebHDFS Output plugin for Graylog | ||||||||||
Ukwa Manage | 10 | 8 months ago | 54 | apache-2.0 | Jupyter Notebook | |||||
Shepherding our web archives from crawl to access. | ||||||||||
Hawarp | 7 | 8 years ago | 1 | apache-2.0 | Arc | |||||
HAdoop-based Web Archive Record Processing | ||||||||||
Archive | 7 | 8 years ago | apache-2.0 | Java | ||||||
An archive app based on CDH, providing upload and retrieval REST API | ||||||||||
Tarfilesystem | 6 | 7 years ago | 5 | apache-2.0 | Java | |||||
The Tar FileSystem for Hadoop lives here |