Corpus_similarity

Measure the similarity of text corpora for 74 languages
Alternatives To Corpus_similarity
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Nltk12,69910,4962,2613 months ago59July 20, 2023268apache-2.0Python
NLTK Source
Nlp_chinese_corpus8,344
a year ago20mit
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Bert Pytorch5,605
19 months ago5October 23, 201863apache-2.0Python
Google AI 2018 BERT pytorch implementation
Nlp Datasets5,235
a year ago7
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
Nlp_tasks2,904
6 years agoapache-2.0
Natural Language Processing Tasks and References
Uer Py2,802
5 months ago132apache-2.0Python
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Cluedatasetsearch2,778
a year ago6Python
搜索所有中文NLP数据集,附常用英文NLP数据集
Awesome Deeplearning Resources2,739
3 months ago2mit
Deep Learning and deep reinforcement learning research papers and some codes
Trafilatura2,447663 months ago39November 29, 202366gpl-3.0Python
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Gpt2 Ml1,674
a year ago22apache-2.0Python
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Alternatives To Corpus_similarity
Select To Compare


Alternative Project Comparisons
Popular Corpus Projects
Popular Natural Language Processing Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Language
Natural Language Processing
Corpus