Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
News Corpus | 162 | 3 years ago | apache-2.0 | |||||||
Corpus tiếng việt | ||||||||||
Ocr2text | 67 | 2 years ago | 5 | mit | Python | |||||
Convert a PDF via OCR to a TXT file in UTF-8 encoding | ||||||||||
Superalloydigger | 46 | 4 months ago | 2 | mit | Python | |||||
The functions of superalloyDigger toolkit include batch downloading documents in XML and TXT format from the Elsevier database, locating target sentences from the full text and automatically extracting triple information in the form of <material name, property specifier, value>. | ||||||||||
Free French Treebank | 26 | 8 years ago | ||||||||
free French treebank | ||||||||||
Openconvert | 19 | 2 years ago | 4 | Java | ||||||
Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA) | ||||||||||
Rare Words Finder | 5 | 12 years ago | ||||||||
Find rare words in a corpus |