Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for unicode tokenizer
tokenizer
x
unicode
x
12 search results found
Text
⭐
1,172
Making text a first-class citizen in TensorFlow.
Jflex
⭐
523
The fast scanner generator for Java™ with full Unicode support
Tokenizer
⭐
224
Fast and customizable text tokenization library with BPE and SentencePiece support
Uax29
⭐
35
A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes.
Nlpt
⭐
35
Natural Language Processing Toolkit written in Go (DEPRECATED see individual packages prefixed nlpt-)
Sqlite3 Unicodesn
⭐
29
SQLite unicode full-text-search tokenizer with Snowball stemming
Unicode Tokenizer
⭐
20
Unicode Tokenizer following the Unicode Line Breaking algorithm
Pyidaungsu
⭐
19
Python library for Myanmar language
Greeb
⭐
16
Greeb is a simple Unicode-aware regexp-based tokenizer.
Tokenizer
⭐
11
Natural Language Tokenizer
Mascara
⭐
6
A natural language tokenizer
Cjk Tokenizer
⭐
5
Related Searches
Character Unicode (849)
Python Unicode (840)
Javascript Unicode (743)
Python Tokenizer (341)
C Plus Plus Unicode (337)
1-12 of 12 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.