Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for benchmark bert
benchmark
x
bert
x
18 search results found
Nlp_chinese_corpus
⭐
8,344
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Clue
⭐
3,345
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Beir
⭐
1,411
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Indonlu
⭐
521
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
Klue
⭐
379
📖 Korean NLU Benchmark
Fewclue
⭐
347
FewCLUE 小样本学习测评基准,中文版
Chineseblue
⭐
212
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Awesome Llm Eval
⭐
183
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, learderboard, papers, docs and models, mainly for Evaluation on LLMs.
Tf Metal Experiments
⭐
123
TensorFlow Metal Backend on Apple Silicon Experiments (just for fun)
Glue X
⭐
111
We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.
Bert_ocr.pytorch
⭐
51
Unofficial PyTorch implementation of 2D Attentional Irregular Scene Text Recognizer
Marbert
⭐
49
UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic
Rumedbench
⭐
20
https://arxiv.org/abs/2201.06499
Karen
⭐
19
KAREN: Unifying Hatespeech Detection and Benchmarking
Indolem
⭐
19
IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.
Pragmeval
⭐
18
Discourse Based Evaluation of Language Understanding
Filipino Text Benchmarks
⭐
13
Open-source benchmark datasets and pretrained transformer models in the Filipino language.
Sentiment_embeddings
⭐
5
A scientific benchmark and comparison of the performance of sentiment analysis models in NLP on small to medium datasets
Related Searches
Python Benchmark (2,040)
C Plus Plus Benchmark (1,219)
Javascript Benchmark (1,165)
Golang Benchmark (1,080)
Benchmark Benchmarking (1,073)
Java Benchmark (993)
C Benchmark (902)
Benchmark Performance (776)
Python Bert (761)
Natural Language Processing Bert (674)
1-18 of 18 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.