Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Uer Py | 2,802 | 6 months ago | 132 | apache-2.0 | Python | |||||
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo | ||||||||||
Atap | 367 | a year ago | 13 | apache-2.0 | Python | |||||
Code for Applied Text Analysis with Python | ||||||||||
Gutenberg Dammit | 108 | 5 years ago | 8 | Python | ||||||
I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this | ||||||||||
Gutenberg Poetry Corpus | 83 | 6 years ago | 2 | Jupyter Notebook | ||||||
A corpus of poetry from Project Gutenberg | ||||||||||
Gutenberg | 74 | 2 years ago | 2 | gpl-3.0 | Python | |||||
Pipeline to generate the Standardized Project Gutenberg Corpus | ||||||||||
Gwordlist | 68 | a year ago | 2 | Shell | ||||||
All the words from Google Books, sorted by frequency | ||||||||||
Video_music_book_datasets | 57 | 3 years ago | 1 | mit | ||||||
NLP NER datasets video/music/book bio | ||||||||||
Gutenberg Http | 54 | 5 years ago | 2 | apache-2.0 | Python | |||||
A HTTP interface to the Project Gutenberg corpus. | ||||||||||
Book Names Corpus | 45 | 3 years ago | apache-2.0 | |||||||
图书名语料库。含部分电影、游戏名称。 | ||||||||||
Proiel Treebank | 31 | a year ago | 2 | |||||||
Official releases of the PROIEL treebank of ancient Indo-European languages |