Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Sling | 1,873 | 3 years ago | 1 | apache-2.0 | C++ | |||||
SLING - A natural language frame semantics parser | ||||||||||
Wikipedia2vec | 899 | 4 | 2 | 3 months ago | 31 | April 03, 2021 | 6 | other | Python | |
A tool for learning vector representations of words and entities from Wikipedia | ||||||||||
Wit | 896 | 4 months ago | 3 | other | ||||||
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages. | ||||||||||
Wordninja | 648 | 4 | 25 | a year ago | 7 | August 10, 2019 | 11 | mit | Python | |
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies. | ||||||||||
Chakin | 313 | 1 | 5 years ago | 7 | March 27, 2019 | 8 | mit | Python | ||
Simple downloader for pre-trained word vectors | ||||||||||
Adam_qas | 298 | 4 years ago | 8 | gpl-3.0 | Python | |||||
ADAM - A Question Answering System. Inspired from IBM Watson | ||||||||||
Aravec | 242 | 3 years ago | 3 | Jupyter Notebook | ||||||
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models. | ||||||||||
Spikex | 220 | 3 years ago | 5 | May 31, 2021 | 1 | other | Python | |||
SpikeX - SpaCy Pipes for Knowledge Extraction | ||||||||||
Nlp Data Augmentation | 215 | 3 years ago | ||||||||
Data Augmentation for NLP. NLP数据增强 | ||||||||||
Wp2txt | 160 | 1 | a year ago | 29 | May 13, 2023 | 1 | mit | Ruby | ||
A command-line toolkit to extract text content and category data from Wikipedia dump files |