| fastnlp/fastNLP |
2,940 |
|
0 |
2 |
about 3 years ago |
24 |
October 31, 2022 |
62 |
apache-2.0 |
Python |
| fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation. |
| kk7nc/Text_Classification |
1,621 |
|
0 |
0 |
over 3 years ago |
0 |
|
1 |
mit |
Python |
| Text Classification Algorithms: A Survey |
| roshan-research/hazm |
1,381 |
|
17 |
13 |
6 months ago |
20 |
October 01, 2023 |
12 |
mit |
Python |
| Persian NLP Toolkit |
| pemistahl/lingua-go |
1,064 |
|
0 |
15 |
over 2 years ago |
18 |
September 05, 2023 |
6 |
apache-2.0 |
Go |
| The most accurate natural language detection library for Go, suitable for short text and mixed-language text |
| cbaziotis/ekphrasis |
583 |
|
7 |
0 |
over 3 years ago |
54 |
May 17, 2022 |
18 |
mit |
Python |
| Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets). |
| abadojack/whatlanggo |
580 |
|
4 |
58 |
about 3 years ago |
2 |
March 06, 2019 |
12 |
mit |
Go |
| Natural language detection library for Go |
| open-korean-text/open-korean-text |
552 |
|
6 |
6 |
about 3 years ago |
14 |
August 07, 2018 |
13 |
apache-2.0 |
Scala |
| Open Korean Text Processor - An Open-source Korean Text Processor |
| proycon/pynlpl |
466 |
|
16 |
3 |
almost 3 years ago |
102 |
March 13, 2019 |
3 |
gpl-3.0 |
Python |
| PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation). |
| ChenghaoMou/text-dedup |
399 |
|
0 |
0 |
over 2 years ago |
0 |
|
5 |
apache-2.0 |
Jupyter Notebook |
| All-in-one text de-duplication |
| haven-jeon/PyKoSpacing |
348 |
|
0 |
0 |
over 2 years ago |
0 |
|
1 |
gpl-3.0 |
Python |
| Automatic Korean word spacing with Python |