Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Ekphrasis | 583 | 7 | 2 years ago | 54 | May 17, 2022 | 18 | mit | Python | ||
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets). | ||||||||||
Hottosns Bert | 41 | 3 years ago | 2 | other | Python | |||||
hottoSNS-BERT: 大規模SNSコーパスによる文分散表現モデル | ||||||||||
Tokenizer | 34 | 5 years ago | 4 | Python | ||||||
Tokenizer for Twitter and Reddit data | ||||||||||
Twitter Korean Tokenizer Api | 10 | 9 years ago | 1 | apache-2.0 | CSS | |||||
API and UI Interface for Twitter Korean tokenizer https://github.com/twitter/twitter-korean-text | ||||||||||
Happierfuntokenizing | 9 | 7 years ago | 1 | Python | ||||||
This code implements a basic, Twitter-aware tokenizer. | ||||||||||
Tkt Elasticsearch | 9 | 8 years ago | 1 | Java | ||||||
elasticsearch plugin of twitter-korean-text for korean analyzer |