Folia

FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas, and set definitions
Alternatives To Folia
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Entity Recognition Datasets1,386
7 months ago7mitPython
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Propbank Release112
2 years ago11cc-by-sa-4.0
The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts
Tutorialbank85
a year agoHTML
Ud_russian Syntagrus77
6 months ago16otherPerl
Russian data from the SynTagRus corpus.
Gum76
5 months ago6otherPython
Repository for the Georgetown University Multilayer Corpus (GUM)
Kwdlc71
4 months ago12Python
Kyoto University Web Document Leads Corpus
Annis67443 months ago45February 03, 202344apache-2.0Java
ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with diverse types of annotation.
Quasar64
6 years ago1bsd-2-clausePython
Datasets for Question Answering by Search and Reading
Nested_named_entities60
8 months agoPython
Folia60229 months ago93October 08, 202121gpl-3.0Python
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas, and set definitions
Alternatives To Folia
Select To Compare


Alternative Project Comparisons
Popular Annotation Projects
Popular Corpus Projects
Popular Machine Learning Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Language
Natural Language Processing
Xml
Format
Corpus
File Format