Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Readtext | 112 | 5 | 4 | 3 months ago | 10 | June 03, 2023 | 30 | R | ||
an R package for reading text files | ||||||||||
Gum | 76 | 6 months ago | 6 | other | Python | |||||
Repository for the Georgetown University Multilayer Corpus (GUM) | ||||||||||
Eventstoryline | 70 | 7 months ago | 3 | other | DM | |||||
Event StoryLine Corpus - annotated data, baselines and evaluation scripts, evaluation data. | ||||||||||
Folia | 60 | 2 | 2 | 9 months ago | 93 | October 08, 2021 | 21 | gpl-3.0 | Python | |
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas, and set definitions | ||||||||||
Craft | 58 | 2 years ago | 1 | other | Clojure | |||||
Deft_corpus | 57 | 4 years ago | 6 | other | Python | |||||
The Definition Extraction From Text corpus and relevant formatting scripts | ||||||||||
Ronec | 54 | a year ago | mit | Python | ||||||
Romanian Named Entity Corpus (RONEC) version 2.0 | ||||||||||
Broad_twitter_corpus | 52 | 2 years ago | 9 | other | Jupyter Notebook | |||||
The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors | ||||||||||
Morphorueval 2017 | 41 | 6 years ago | 13 | other | Python | |||||
Discoursegraphs | 34 | 1 | 1 | 3 years ago | 18 | March 14, 2021 | 46 | bsd-3-clause | Python | |
linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX). |