Awesome Open Source

Programming Languages

Search results for benchmark bert

18 search results found

Nlp_chinese_corpus ⭐ 8,344

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Indonlu ⭐ 521

The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)

📖 Korean NLU Benchmark

Fewclue ⭐ 347

FewCLUE 小样本学习测评基准，中文版

Chineseblue ⭐ 212

Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)

Awesome Llm Eval ⭐ 183

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, learderboard, papers, docs and models, mainly for Evaluation on LLMs.

Tf Metal Experiments ⭐ 123

TensorFlow Metal Backend on Apple Silicon Experiments (just for fun)

We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.

Bert_ocr.pytorch ⭐ 51

Unofficial PyTorch implementation of 2D Attentional Irregular Scene Text Recognizer

UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic

Rumedbench ⭐ 20

https://arxiv.org/abs/2201.06499

KAREN: Unifying Hatespeech Detection and Benchmarking

IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.

Pragmeval ⭐ 18

Discourse Based Evaluation of Language Understanding

Filipino Text Benchmarks ⭐ 13

Open-source benchmark datasets and pretrained transformer models in the Filipino language.

Sentiment_embeddings ⭐ 5

A scientific benchmark and comparison of the performance of sentiment analysis models in NLP on small to medium datasets

Related Searches

Python Benchmark (2,040)

C Plus Plus Benchmark (1,219)

Javascript Benchmark (1,165)

Golang Benchmark (1,080)

Benchmark Benchmarking (1,073)

Java Benchmark (993)

C Benchmark (902)

Benchmark Performance (776)

Python Bert (761)

Natural Language Processing Bert (674)

1-18 of 18 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.