Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Wit | 896 | 6 months ago | 3 | other | ||||||
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages. | ||||||||||
Trankit | 693 | 2 | 6 months ago | 20 | March 26, 2022 | 24 | apache-2.0 | Python | ||
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing | ||||||||||
Xl Sum | 209 | a year ago | Python | |||||||
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. | ||||||||||
Text | 112 | 4 months ago | 17 | R | ||||||
Using Transformers from HuggingFace in R | ||||||||||
Fastrtext | 97 | 1 | 1 | 5 years ago | 11 | October 27, 2019 | 8 | other | C++ | |
R wrapper for fastText | ||||||||||
Lima | 92 | 6 months ago | 22 | December 22, 2022 | 49 | other | C++ | |||
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit. | ||||||||||
Multilingual Latent Dirichlet Allocation Lda | 73 | 2 years ago | 2 | mit | Python | |||||
A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python. | ||||||||||
Kwx | 57 | 6 months ago | 25 | January 28, 2023 | 11 | bsd-3-clause | Python | |||
BERT, LDA, and TFIDF based keyword extraction in Python | ||||||||||
Masakhane Community | 40 | a year ago | 5 | mit | ||||||
All our community docs! Start here! Lets put Africa on the NLP Map | ||||||||||
Wikirec | 18 | 6 months ago | 34 | July 09, 2022 | 8 | bsd-3-clause | Python | |||
Recommendation engine framework based on Wikipedia data |