Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for natural language processing corpora
corpora
x
natural-language-processing
x
6 search results found
Entity Recognition Datasets
⭐
1,386
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Indicnlp_catalog
⭐
487
A collaborative catalog of NLP resources for Indic languages
Corus
⭐
254
Links to Russian corpora + Python functions for loading and parsing
Lm Spanish
⭐
220
Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).
Open Korean Corpora
⭐
117
Open Korean NLP Dataset Curation for the Users All Around the Globe
Self_dialogue_corpus
⭐
86
The Self-dialogue Corpus - a collection of self-dialogues across music, movies and sports
Spanish Corpora
⭐
86
Unannotated Spanish 3 Billion Words Corpora
Ccae
⭐
59
The Official Repository for 👉 CCAE: A Corpus of Chinese-based Asian Englishes @ NLPCC 2023
Arabic News Article Classification
⭐
43
Automatic categorization of documents, consists in assigning a category to a text based on the information it contains. We'll follow different approach of Supervised Machine Learning.
Parallel Corpora Tools
⭐
39
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Corpusloaders.jl
⭐
31
A variety of loaders for various NLP corpora.
Awesome Cantonese Nlp
⭐
29
A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP
Gerparcor
⭐
16
German Parliamentary Corpus (GerParCor)
Textstelle
⭐
14
Textstelle is a collection of corpora for the creation of bots and other things that generate text 🤖
Opiec
⭐
12
Reading the data from OPIEC - an Open Information Extraction corpus
Lm Biomedical Clinical Es
⭐
12
Official source for Spanish pretrained biomedical and clinical language models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).
Wiki Dump Reader
⭐
10
Extract corpora from Wikipedia dumps
Potts
⭐
9
The Potsdam Twitter Sentiment Corpus
Corpus_similarity
⭐
9
Measure the similarity of text corpora for 74 languages
Awesome Swedish Nlp
⭐
6
A curated list of resources for natural language processing (NLP) in Swedish
Brat Peek
⭐
5
Framework for working with brat-annotated .ann files
Related Searches
Python Natural Language Processing (7,915)
Jupyter Notebook Natural Language Processing (4,405)
Machine Learning Natural Language Processing (3,939)
Deep Learning Natural Language Processing (2,414)
Pytorch Natural Language Processing (1,212)
Dataset Natural Language Processing (1,010)
Artificial Intelligence Natural Language Processing (1,010)
Tensorflow Natural Language Processing (909)
Javascript Natural Language Processing (843)
Natural Language Processing Chatbot (726)
1-6 of 6 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.