Jtokenizer

A Java library for splitting text into constituent words. This can be tricky for non-trivial examples, therefore the jTokenizer package was designed to combine a set of tokenizers that range from basic whitespace tokenizers to more complex ones that deal intuitively with natural language.
Alternatives To Jtokenizer
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Elasticsearch Analysis Kuromoji Ipadic Neologd109534 years ago42January 10, 20194otherJava
Elasticsearch's Analyzer for Kuromoji with Neologd
Lucene Bo10
5 months ago6June 14, 201921apache-2.0Java
Lucene analyzer for Tibetan
Twitter Korean Tokenizer Api10
9 years ago1apache-2.0CSS
API and UI Interface for Twitter Korean tokenizer https://github.com/twitter/twitter-korean-text
Tkt Elasticsearch9
8 years ago1Java
elasticsearch plugin of twitter-korean-text for korean analyzer
Jtokenizer7
12 years agootherJava
A Java library for splitting text into constituent words. This can be tricky for non-trivial examples, therefore the jTokenizer package was designed to combine a set of tokenizers that range from basic whitespace tokenizers to more complex ones that deal intuitively with natural language.
Alternatives To Jtokenizer
Select To Compare


Alternative Project Comparisons
Popular Jar Projects
Popular Tokenizer Projects
Popular Build Tools Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Java
Jar
Tokenizer