Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Gse | 2,352 | 14 | 21 | 6 months ago | 82 | January 16, 2023 | 12 | apache-2.0 | Go | |
Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others. | ||||||||||
Kagome | 769 | 27 | 3 months ago | 74 | September 27, 2023 | 4 | mit | Go | ||
Self-contained Japanese Morphological Analyzer written in pure Go | ||||||||||
Nagisa | 365 | 1 | 7 | 4 months ago | 22 | July 30, 2023 | 4 | mit | Python | |
A Japanese tokenizer based on recurrent neural networks | ||||||||||
Sudachipy | 318 | 2 years ago | 18 | apache-2.0 | Python | |||||
Python version of Sudachi, a Japanese tokenizer. | ||||||||||
Vibrato | 275 | 1 | 4 months ago | 11 | May 12, 2023 | 3 | apache-2.0 | Rust | ||
🎤 vibrato: Viterbi-based accelerated tokenizer | ||||||||||
Vaporetto | 206 | 3 | 6 months ago | 16 | April 01, 2023 | apache-2.0 | Rust | |||
🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer | ||||||||||
Unicopedia Plus | 144 | 8 months ago | mit | JavaScript | ||||||
Developer-oriented set of Unicode, Unihan & emoji utilities wrapped into one single app, built with Electron. | ||||||||||
Segmentation Kit | 53 | 4 years ago | 2 | mit | Perl | |||||
Speech Segmentation Toolkit using Julius | ||||||||||
Ja_sentence_segmenter | 46 | a year ago | mit | Python | ||||||
japanese sentence segmentation library for python | ||||||||||
Pyjuliusalign | 39 | 10 months ago | 13 | September 03, 2021 | 3 | other | Python | |||
One-button-press forced aligner for Japanese, using Julius. |