Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Fastnlp | 2,940 | 2 | 10 months ago | 24 | October 31, 2022 | 62 | apache-2.0 | Python | ||
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation. | ||||||||||
Text_classification | 1,621 | a year ago | 1 | mit | Python | |||||
Text Classification Algorithms: A Survey | ||||||||||
Hazm | 1,091 | 17 | 13 | 9 days ago | 20 | October 01, 2023 | 12 | mit | Python | |
Persian NLP Toolkit | ||||||||||
Lingua Go | 1,064 | 15 | 2 months ago | 18 | September 05, 2023 | 6 | apache-2.0 | Go | ||
The most accurate natural language detection library for Go, suitable for short text and mixed-language text | ||||||||||
Ekphrasis | 583 | 7 | a year ago | 54 | May 17, 2022 | 18 | mit | Python | ||
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets). | ||||||||||
Whatlanggo | 580 | 4 | 58 | a year ago | 2 | March 06, 2019 | 12 | mit | Go | |
Natural language detection library for Go | ||||||||||
Open Korean Text | 552 | 6 | 6 | a year ago | 14 | August 07, 2018 | 13 | apache-2.0 | Scala | |
Open Korean Text Processor - An Open-source Korean Text Processor | ||||||||||
Pynlpl | 466 | 16 | 3 | 6 months ago | 102 | March 13, 2019 | 3 | gpl-3.0 | Python | |
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation). | ||||||||||
Text Dedup | 399 | 3 months ago | 5 | apache-2.0 | Jupyter Notebook | |||||
All-in-one text de-duplication | ||||||||||
Pykospacing | 348 | 4 months ago | 1 | gpl-3.0 | Python | |||||
Automatic Korean word spacing with Python |