Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for machine learning multilingual
machine-learning
x
multilingual
x
18 search results found
Wit
⭐
896
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
Trankit
⭐
693
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Xl Sum
⭐
209
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
Text
⭐
112
Using Transformers from HuggingFace in R
Fastrtext
⭐
97
R wrapper for fastText
Lima
⭐
92
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Multilingual Latent Dirichlet Allocation Lda
⭐
73
A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.
Kwx
⭐
57
BERT, LDA, and TFIDF based keyword extraction in Python
Masakhane Community
⭐
40
All our community docs! Start here! Lets put Africa on the NLP Map
Wikirec
⭐
18
Recommendation engine framework based on Wikipedia data
Plmpapers
⭐
12
A paper list of pre-trained language models (PLMs).
Swim Ir
⭐
11
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask prompting.
Langdist
⭐
10
Multilingual Language Modeling Toolkit
Sentiment_analysis_multilingual_corpora
⭐
8
a generic approach to the supervised sentiment analysis of social media content in foreign languages
Acl20 Code Switching Patterns
⭐
7
Code-switching patterns can be an effective route to improve performance of downstream NLP applications: A case study of humour, sarcasm and hate speech detection
Unified_multilingual_dataset_of_emotional_human_utterances
⭐
5
A unified dataset of multilingual emotional human utterances
Semeval2022 Task8 Tonyx
⭐
5
Deep-learning system proposed by HFL for SemEval-2022 Task 8: Multilingual News Similarity
Mol
⭐
5
Multilingual Offensive Lexicon consists of the first contextual lexicon for abusive language detection, which is composed of 1,000 explicit and implicit terms and expressions with any pejorative connotation annotated with contextual information
Related Searches
Python Machine Learning (14,103)
Jupyter Notebook Machine Learning (12,247)
Machine Learning Neural Network (4,397)
Machine Learning Tensorflow (4,050)
Machine Learning Data Science (3,802)
Machine Learning Artificial Intelligence (2,074)
Machine Learning Computer Vision (1,966)
Dataset Machine Learning (1,873)
Machine Learning Pytorch (1,834)
Machine Learning Natural Language Processing (1,786)
1-18 of 18 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.