Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Aut | 128 | 10 months ago | 27 | November 17, 2022 | 3 | apache-2.0 | Scala | |||
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives. | ||||||||||
Archivespark | 118 | 3 years ago | 7 | September 16, 2019 | 4 | mit | Scala | |||
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive. | ||||||||||
Sradbv2 | 19 | 5 years ago | 12 | R | ||||||
R Interface to the NCBI SRA metadata | ||||||||||
Notebooks | 18 | a year ago | apache-2.0 | Jupyter Notebook | ||||||
Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit. | ||||||||||
Nspark | 12 | 2 years ago | 2 | C | ||||||
Nspark dearchiver for RISC OS archives | ||||||||||
Docker Aut | 11 | a year ago | other | Dockerfile | ||||||
Docker image for the Archives Unleashed Toolkit | ||||||||||
Twut | 7 | 2 years ago | 1 | December 10, 2019 | 1 | apache-2.0 | Scala | |||
An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark. |