Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Nltk | 12,699 | 10,496 | 2,261 | 8 months ago | 59 | July 20, 2023 | 268 | apache-2.0 | Python | |
NLTK Source | ||||||||||
Nlp_chinese_corpus | 8,344 | a year ago | 20 | mit | ||||||
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP | ||||||||||
Bert Pytorch | 5,605 | 1 | a year ago | 5 | October 23, 2018 | 63 | apache-2.0 | Python | ||
Google AI 2018 BERT pytorch implementation | ||||||||||
Nlp Datasets | 5,235 | 2 years ago | 7 | |||||||
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP) | ||||||||||
Nlp_tasks | 2,904 | 6 years ago | apache-2.0 | |||||||
Natural Language Processing Tasks and References | ||||||||||
Uer Py | 2,802 | 10 months ago | 132 | apache-2.0 | Python | |||||
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo | ||||||||||
Cluedatasetsearch | 2,778 | 2 years ago | 6 | Python | ||||||
搜索所有中文NLP数据集,附常用英文NLP数据集 | ||||||||||
Awesome Deeplearning Resources | 2,739 | 8 months ago | 2 | mit | ||||||
Deep Learning and deep reinforcement learning research papers and some codes | ||||||||||
Trafilatura | 2,447 | 66 | 8 months ago | 39 | November 29, 2023 | 66 | gpl-3.0 | Python | ||
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments | ||||||||||
Gpt2 Ml | 1,674 | a year ago | 22 | apache-2.0 | Python | |||||
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型 |