Sombok

Unicode text segmentation package
Alternatives To Sombok
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Uniseg500317,6284 months ago17February 21, 20232mitGo
Unicode Text Segmentation, Word Wrapping, and String Width Calculation in Go
Unicode Segmentation4962,8645115 months ago21January 31, 202326otherRust
Grapheme Cluster and Word boundaries according to UAX#29 rules
Unicopedia Plus144
9 months agomitJavaScript
Developer-oriented set of Unicode, Unihan & emoji utilities wrapped into one single app, built with Electron.
Proposal Intl Segmenter118
2 years ago12HTML
Unicode text segmentation for ECMAScript
Segment74165260a year ago3December 19, 20225apache-2.0Go
A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29
Text612a year ago2June 29, 20201otherElixir
Text detection and processing for Elixir
Hyphenation487114 months ago18August 19, 20216apache-2.0Rust
Text hyphenation for Rust
Uax293567 months ago40May 26, 20231mitGo
A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes.
Uuseg21
4 months ago2iscOCaml
Unicode text segmentation for OCaml
Segments171392 years ago17July 08, 20226apache-2.0Python
Unicode Standard tokenization routines and orthography profile segmentation
Alternatives To Sombok
Select To Compare


Alternative Project Comparisons
Popular Segmentation Projects
Popular Unicode Projects
Popular Machine Learning Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
C
Segmentation
Unicode