Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Sudachi | 684 | 3 | 8 months ago | 22 | June 23, 2023 | 20 | apache-2.0 | Java | ||
A Japanese Tokenizer for Business | ||||||||||
Uniseg | 500 | 31 | 7,628 | 3 months ago | 17 | February 21, 2023 | 2 | mit | Go | |
Unicode Text Segmentation, Word Wrapping, and String Width Calculation in Go | ||||||||||
Gkseg | 242 | 11 years ago | 3 | other | C | |||||
Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm | ||||||||||
Arabic Ocr | 192 | 7 months ago | 9 | mit | Python | |||||
OCR system for Arabic language that converts images of typed text to machine-encoded text. | ||||||||||
Ctc Segmentation | 192 | 2 | 2 years ago | 21 | October 11, 2022 | 4 | apache-2.0 | Python | ||
Segment an audio file and obtain utterance alignments. (Python package) | ||||||||||
Unicopedia Plus | 144 | 8 months ago | mit | JavaScript | ||||||
Developer-oriented set of Unicode, Unihan & emoji utilities wrapped into one single app, built with Electron. | ||||||||||
Nseg | 93 | 4 | 4 | 12 years ago | 9 | January 28, 2012 | mit | JavaScript | ||
Node.js Version of MMSG for Chinese Word Segmentation | ||||||||||
Cbl Js | 85 | 3 years ago | 27 | mit | JavaScript | |||||
JavaScript CAPTCHA solving library | ||||||||||
Scalpel | 52 | 39 | 2 | 8 years ago | 2 | December 21, 2012 | 2 | other | Ruby | |
A fast and accurate rule-based sentence segmentation tool for Ruby. | ||||||||||
Gocr | 50 | a year ago | 3 | Go | ||||||
OCR implementation with Golang |