Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Mtdata | 115 | a year ago | 21 | November 25, 2022 | 22 | apache-2.0 | Python | |||
A tool that locates, downloads, and extracts machine translation corpora | ||||||||||
Verapdf Corpus | 66 | 6 months ago | 8 | |||||||
veraPDF test corpus for ISO 19005 (PDF/A) and ISO 14289 (PDF/UA) | ||||||||||
Pdf Corpora | 60 | a year ago | cc-by-4.0 | |||||||
An index of PDF-centric corpora | ||||||||||
Dialogueact Tagger | 42 | 3 years ago | 5 | HTML | ||||||
A resource to create a multi domain Dialog Act Tagger for conversational agents using publicly available data | ||||||||||
Asp Source | 18 | a year ago | 1 | |||||||
Source stories from the African Storybook Project in Markdown format | ||||||||||
Fast_umorph | 6 | 11 years ago | 1 | C++ | ||||||
Unsupervised morphology induction with OpenFst |