Folia Alternatives

Name: proycon/folia
Brand: proycon/folia
SKU: project/proycon/folia
Rating: 4.44 (60 reviews)

FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas, and set definitions

Categories > Machine Learning > Natural Language Processing

Suggest Alternative

Stars

Alternatives

License

gpl-3.0

Open Issues

Most Recent Commit

almost 3 years ago

Programming Language

Python

Monthly Downloads

Dependent Repos

Dependent Packages

Total Releases

Latest Release

October 08, 2021

Categories

Programming Languages > Python

Machine Learning > Natural Language Processing

Data Formats > Xml

Data Processing > Corpus

Data Formats > File Format

Site

Repo

Alternatives To proycon/folia

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
juand-r/entity-recognition-datasets	1,386	0	0	almost 3 years ago	0		7	mit	Python
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
propbank/propbank-release	112	0	0	almost 4 years ago	0		11	cc-by-sa-4.0
The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts
Yale-LILY/TutorialBank	85	0	0	over 3 years ago	0		0		HTML
UniversalDependencies/UD_Russian-SynTagRus	77	0	0	over 2 years ago	0		16	other	Perl
Russian data from the SynTagRus corpus.
amir-zeldes/gum	76	0	0	over 2 years ago	0		6	other	Python
Repository for the Georgetown University Multilayer Corpus (GUM)
ku-nlp/KWDLC	71	0	0	over 2 years ago	0		12		Python
Kyoto University Web Document Leads Corpus
korpling/ANNIS	67	4	4	over 2 years ago	45	February 03, 2023	44	apache-2.0	Java
ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with diverse types of annotation.
bdhingra/quasar	64	0	0	over 8 years ago	0		1	bsd-2-clause	Python
Datasets for Question Answering by Search and Reading
proycon/folia	60	2	2	almost 3 years ago	93	October 08, 2021	21	gpl-3.0	Python
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas, and set definitions
nickyringland/nested_named_entities	60	0	0	almost 3 years ago	0		0		Python

Alternatives To proycon/folia

Select To Compare

juand-r/entity-recognition-datasets ⭐ 1,386

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

dependent packages 0 total releases 0 most recent commit almost 3 years ago

propbank/propbank-release ⭐ 112

The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts

dependent packages 0 total releases 0 most recent commit almost 4 years ago

Yale-LILY/TutorialBank ⭐ 85

dependent packages 0 total releases 0 most recent commit over 3 years ago

UniversalDependencies/UD_Russian-SynTagRus ⭐ 77

Russian data from the SynTagRus corpus.

dependent packages 0 total releases 0 most recent commit over 2 years ago

amir-zeldes/gum ⭐ 76

Repository for the Georgetown University Multilayer Corpus (GUM)

dependent packages 0 total releases 0 most recent commit over 2 years ago

ku-nlp/KWDLC ⭐ 71

Kyoto University Web Document Leads Corpus

dependent packages 0 total releases 0 most recent commit over 2 years ago

korpling/ANNIS ⭐ 67

ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with diverse types of annotation.

dependent packages 4 total releases 45 most recent commit over 2 years ago

bdhingra/quasar ⭐ 64

Datasets for Question Answering by Search and Reading

dependent packages 0 total releases 0 most recent commit over 8 years ago

proycon/folia ⭐ 60

dependent packages 2 total releases 93 most recent commit almost 3 years ago downloads badge

nickyringland/nested_named_entities ⭐ 60

dependent packages 0 total releases 0 most recent commit almost 3 years ago

Suggest An Alternative To folia

Alternative Project Comparisons

proycon/folia vs Entity Recognition Datasets

proycon/folia vs Propbank Release

proycon/folia vs Tutorialbank

proycon/folia vs Ud_russian Syntagrus

proycon/folia vs Gum

proycon/folia vs Kwdlc

proycon/folia vs Annis

proycon/folia vs Quasar

proycon/folia vs Folia

proycon/folia vs Nested_named_entities

Popular Annotation Projects

akullpp/awesome-java⭐ 38,906

A curated list of awesome frameworks, libraries and software for the Java programming language.

PaddlePaddle/PaddleOCR⭐ 36,076

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

HumanSignal/label-studio⭐ 27,816

Label Studio is a multi-type data labeling and annotation tool with standardized output format

HumanSignal/labelImg⭐ 25,030

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.

wkentaro/labelme⭐ 16,037

Image annotation with Python. Supports polygon, rectangle, circle, line, point, and AI-assisted annotation.

Popular Corpus Projects

nltk/nltk⭐ 12,699

NLTK Source

brightmart/nlp_chinese_corpus⭐ 8,344

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

nl8590687/ASRT_SpeechRecognition⭐ 7,253

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

stanfordnlp/GloVe⭐ 6,480

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings

codertimo/BERT-pytorch⭐ 5,605

Google AI 2018 BERT pytorch implementation

Popular Machine Learning Categories

Deep Learning

Machine Learning

Pytorch

Tensorflow

Natural Language Processing

Neural Network

Neural

Computer Vision

Convolutional Neural Networks

Opencv