Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Laser | 3,460 | 5 months ago | 1 | November 21, 2023 | 51 | other | Jupyter Notebook | |||
Language-Agnostic SEntence Representations | ||||||||||
Muse | 2,844 | 3 years ago | 71 | other | Python | |||||
A library for Multilingual Unsupervised or Supervised word Embeddings | ||||||||||
Contextualized Topic Models | 1,141 | 4 | 4 months ago | 30 | November 03, 2022 | 10 | mit | Python | ||
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021. | ||||||||||
Conceptnet Numberbatch | 1,114 | 2 years ago | 7 | other | Python | |||||
Bpemb | 1,068 | 15 | 86 | 2 years ago | 13 | September 23, 2022 | 4 | mit | Python | |
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE) | ||||||||||
Text2text | 268 | 4 months ago | 134 | October 21, 2023 | 27 | other | Python | |||
Text2Text: Crosslingual NLP/G toolkit | ||||||||||
Laserembeddings | 163 | 1 | 4 | 2 years ago | 11 | December 12, 2021 | 3 | bsd-3-clause | Python | |
LASER multilingual sentence embeddings as a pip package | ||||||||||
Mimick | 149 | 4 years ago | gpl-3.0 | Python | ||||||
Code for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017) | ||||||||||
Vecalign | 122 | a year ago | 5 | apache-2.0 | Python | |||||
Improved Sentence Alignment in Linear Time and Space | ||||||||||
Text | 112 | 3 months ago | 17 | R | ||||||
Using Transformers from HuggingFace in R |