Glue X

We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.
Alternatives To Glue X
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Nlp_chinese_corpus8,344
a year ago20mit
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Baichuan23,527
5 months ago231apache-2.0Python
A series of large language models developed by Baichuan Intelligent Technology
Baichuan 13b2,579
10 months ago76apache-2.0Python
A 13B large language model developed by Baichuan Intelligent Technology
Deepmoji1,462
9 months ago10mitPython
State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.
Beir1,41183 months ago29July 21, 202357apache-2.0Python
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Torchmoji882
a year ago21mitPython
😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc
Bolt823
a year ago38mitC++
Bolt is a deep learning library with high performance and heterogeneous flexibility.
Long Range Arena635
6 months ago27apache-2.0Python
Long Range Arena for Benchmarking Efficient Transformers
Fastrag591
5 months ago1apache-2.0Python
Efficient Retrieval Augmentation and Generation Framework
Indonlu518
2 years ago5apache-2.0Jupyter Notebook
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
Alternatives To Glue X
Select To Compare


Alternative Project Comparisons
Popular Natural Language Processing Projects
Popular Benchmark Projects
Popular Machine Learning Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Natural Language Processing
Benchmark