Nlp Datasets

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
Alternatives To Nlp Datasets
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Nltk12,69910,4962,2612 months ago59July 20, 2023268apache-2.0Python
NLTK Source
Nlp_chinese_corpus8,344
10 months ago20mit
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Bert Pytorch5,605
18 months ago5October 23, 201863apache-2.0Python
Google AI 2018 BERT pytorch implementation
Nlp Datasets5,235
a year ago7
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
Nlp_tasks2,904
6 years agoapache-2.0
Natural Language Processing Tasks and References
Uer Py2,802
4 months ago132apache-2.0Python
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Cluedatasetsearch2,778
a year ago6Python
搜索所有中文NLP数据集,附常用英文NLP数据集
Awesome Deeplearning Resources2,739
2 months ago2mit
Deep Learning and deep reinforcement learning research papers and some codes
Trafilatura2,447662 months ago39November 29, 202366gpl-3.0Python
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Gpt2 Ml1,674
10 months ago22apache-2.0Python
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Alternatives To Nlp Datasets
Select To Compare


Alternative Project Comparisons
Popular Natural Language Processing Projects
Popular Corpus Projects
Popular Machine Learning Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Natural Language Processing
Corpus