Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Tokenizer | 224 | 15 | 5 | 9 months ago | 68 | January 11, 2023 | 2 | mit | C++ | |
Fast and customizable text tokenization library with BPE and SentencePiece support | ||||||||||
Icu Tokenizer | 13 | 2 years ago | 1 | June 18, 2020 | 1 | mit | Python | |||
ICU based universal language tokenizer |