Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Gse | 2,352 | 14 | 21 | 7 months ago | 82 | January 16, 2023 | 12 | apache-2.0 | Go | |
Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others. | ||||||||||
Scattertext | 2,131 | 8 | 2 | 10 months ago | 148 | April 18, 2023 | 22 | apache-2.0 | Python | |
Beautiful visualizations of how language differs among document types. | ||||||||||
Budou | 1,135 | 1 | 2 years ago | 36 | November 07, 2019 | 6 | apache-2.0 | Python | ||
Budou is an automatic organizer tool for beautiful line breaking in CJK (Chinese, Japanese, and Korean). | ||||||||||
Ginza | 676 | 12 | 9 months ago | 19 | September 25, 2023 | 11 | mit | Python | ||
A Japanese NLP Library using spaCy as framework based on Universal Dependencies | ||||||||||
Awesome Japanese Nlp Resources | 522 | 5 months ago | cc0-1.0 | |||||||
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese | ||||||||||
Japanese Pretrained Models | 479 | 2 years ago | 3 | apache-2.0 | Python | |||||
Code for producing Japanese pretrained models provided by rinna Co., Ltd. | ||||||||||
Nagisa | 365 | 1 | 7 | 5 months ago | 22 | July 30, 2023 | 4 | mit | Python | |
A Japanese tokenizer based on recurrent neural networks | ||||||||||
Pykakasi | 349 | 11 | 18 | 2 years ago | 49 | April 14, 2022 | 1 | gpl-3.0 | Python | |
Lightweight converter from Japanese Kana-kanji sentences into Kana-Roman. | ||||||||||
Fugashi | 339 | 39 | 6 months ago | 67 | August 25, 2023 | 5 | mit | C++ | ||
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis. | ||||||||||
Jumanpp | 334 | a year ago | 30 | apache-2.0 | C++ | |||||
Juman++ (a Morphological Analyzer Toolkit) |