Corpora Alternatives

A collection of small corpuses of interesting data for the creation of bots and similar stuff.
Alternatives To Corpora
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Nltk12,69910,4962,2618 months ago59July 20, 2023268apache-2.0Python
NLTK Source
Nlp_chinese_corpus8,344
a year ago20mit
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Asrt_speechrecognition7,253
8 months ago1October 23, 2020101gpl-3.0Python
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Bert Pytorch5,605
1a year ago5October 23, 201863apache-2.0Python
Google AI 2018 BERT pytorch implementation
Tensorflow Wavenet5,362
a year ago176mitPython
A TensorFlow implementation of DeepMind's WaveNet paper
Nlp Datasets5,235
2 years ago7
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
Vespa5,1155588 months ago741November 30, 2023175apache-2.0Java
AI + Data, online. https://vespa.ai
Corpora4,757210 months ago1May 17, 201815JavaScript
A collection of small corpuses of interesting data for the creation of bots and similar stuff.
Go Fuzz4,67463509 months ago4October 19, 202356apache-2.0Go
Randomized testing for Go
Chinese Names Corpus3,719
9 months ago7apache-2.0
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
Alternatives To Corpora
Select To Compare


Alternative Project Comparisons
Popular Corpus Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Javascript
Corpus