Wordsegmentationtm

Fast Word Segmentation with Triangular Matrix
Alternatives To Wordsegmentationtm
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Sentencepiece8,8511207872 months ago34May 02, 202332apache-2.0C++
Unsupervised text tokenizer for Neural Network-based text generation.
Pkuseg Python6,00148a year ago22June 19, 2020119mitPython
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
Subword Nmt1,93718182 years ago8December 08, 20212mitPython
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
Pythainlp90224512 months ago101November 26, 202335apache-2.0Python
Thai Natural Language Processing in Python.
Jieba Rs5855158 months ago40July 16, 20239mitRust
The Jieba Chinese Word Segmentation Implemented in Rust
Ekphrasis583
7a year ago54May 17, 202218mitPython
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Vncorenlp472
a year agootherJava
A Vietnamese natural language processing toolkit (NAACL 2018)
Nagisa365172 months ago22July 30, 20234mitPython
A Japanese tokenizer based on recurrent neural networks
Pycantonese290
10 months ago24December 28, 20215mitPython
Cantonese Linguistics and NLP
Python Wordsegment268
4 years ago8otherPython
English word segmentation, written in pure-Python, and based on a trillion-word corpus.
Alternatives To Wordsegmentationtm
Select To Compare


Alternative Project Comparisons
Popular Segmentation Projects
Popular Word Segmentation Projects
Popular Machine Learning Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
C Sharp
Segmentation
Spellcheck
Word Segmentation