Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Nltk | 12,699 | 10,496 | 2,261 | 8 months ago | 59 | July 20, 2023 | 268 | apache-2.0 | Python | |
NLTK Source | ||||||||||
Nlp_chinese_corpus | 8,344 | a year ago | 20 | mit | ||||||
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP | ||||||||||
Asrt_speechrecognition | 7,253 | 8 months ago | 1 | October 23, 2020 | 101 | gpl-3.0 | Python | |||
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统 | ||||||||||
Bert Pytorch | 5,605 | 1 | a year ago | 5 | October 23, 2018 | 63 | apache-2.0 | Python | ||
Google AI 2018 BERT pytorch implementation | ||||||||||
Tensorflow Wavenet | 5,362 | a year ago | 176 | mit | Python | |||||
A TensorFlow implementation of DeepMind's WaveNet paper | ||||||||||
Nlp Datasets | 5,235 | 2 years ago | 7 | |||||||
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP) | ||||||||||
Vespa | 5,115 | 5 | 58 | 8 months ago | 741 | November 30, 2023 | 175 | apache-2.0 | Java | |
AI + Data, online. https://vespa.ai | ||||||||||
Corpora | 4,757 | 2 | 10 months ago | 1 | May 17, 2018 | 15 | JavaScript | |||
A collection of small corpuses of interesting data for the creation of bots and similar stuff. | ||||||||||
Go Fuzz | 4,674 | 6 | 350 | 9 months ago | 4 | October 19, 2023 | 56 | apache-2.0 | Go | |
Randomized testing for Go | ||||||||||
Chinese Names Corpus | 3,719 | 9 months ago | 7 | apache-2.0 | ||||||
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。 |