Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Chinese Names Corpus | 3,719 | 4 months ago | 7 | apache-2.0 | ||||||
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。 | ||||||||||
Uer Py | 2,802 | 5 months ago | 132 | apache-2.0 | Python | |||||
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo | ||||||||||
Cluedatasetsearch | 2,778 | a year ago | 6 | Python | ||||||
搜索所有中文NLP数据集,附常用英文NLP数据集 | ||||||||||
Entity Recognition Datasets | 1,386 | 6 months ago | 7 | mit | Python | |||||
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types. | ||||||||||
Company Names Corpus | 1,106 | a year ago | 3 | apache-2.0 | ||||||
公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。 | ||||||||||
Bertweet | 542 | 4 months ago | mit | Python | ||||||
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020) | ||||||||||
Ner Lstm | 528 | 5 years ago | 12 | Python | ||||||
Named Entity Recognition using multilayered bidirectional LSTM | ||||||||||
Korpora | 500 | 3 | 2 years ago | 7 | January 11, 2021 | 28 | cc-by-4.0 | Python | ||
Korean corpus repository | ||||||||||
Chinese Nlp Corpus | 378 | 3 years ago | 1 | Python | ||||||
Collections of Chinese NLP corpus | ||||||||||
Turkish Bert | 364 | a year ago | 11 | Python | ||||||
Turkish BERT/DistilBERT, ELECTRA and ConvBERT models |