Friso

High performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.
Alternatives To Friso
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Friso449
7 months ago7apache-2.0C
High performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.
Microtokenizer119313 years ago53September 28, 2021mitPython
一个微型&算法全面的中文分词引擎 | A micro tokenizer for Chinese
Berserker16
5 years ago3mitPython
Berserker - BERt chineSE woRd toKenizER
Alternatives To Friso
Select To Compare


Alternative Project Comparisons
Popular Tokenizer Projects
Popular Chinese Word Segmentation Projects
Popular Compilers Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Php
C
Lua
Bindings
Ocaml
Sphinx
Tokenizer
Full Text Search
Textrank
Chinese Word Segmentation