Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for natural language processing benchmark
benchmark
x
natural-language-processing
x
27 search results found
Nlp_chinese_corpus
⭐
8,344
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Baichuan2
⭐
3,527
A series of large language models developed by Baichuan Intelligent Technology
Baichuan 13b
⭐
2,579
A 13B large language model developed by Baichuan Intelligent Technology
Deepmoji
⭐
1,462
State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.
Beir
⭐
1,370
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Torchmoji
⭐
882
😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc
Bolt
⭐
823
Bolt is a deep learning library with high performance and heterogeneous flexibility.
Long Range Arena
⭐
635
Long Range Arena for Benchmarking Efficient Transformers
Fastrag
⭐
591
Efficient Retrieval Augmentation and Generation Framework
Indonlu
⭐
496
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
Indicnlp_catalog
⭐
487
A collaborative catalog of NLP resources for Indic languages
Rnnlg
⭐
476
RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.
Langtest
⭐
430
Deliver safe & effective language models
Dialoglue
⭐
256
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
Lm Spanish
⭐
220
Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).
Chineseblue
⭐
212
Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)
Awesome Llm Eval
⭐
183
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, learderboard, papers, docs and models, mainly for Evaluation on LLMs.
Trustllm
⭐
164
TrustLLM: Trustworthiness in Large Language Models
Glue X
⭐
111
We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.
Vtext
⭐
110
Simple NLP in Rust with Python bindings
Tasksource
⭐
103
Datasets collection and standardization preprocessings for NLP extreme multitask learning
Segment
⭐
98
The jieba-analysis tool for java.(基于结巴分词词库实现的更加灵活优雅易用,高性能的 java 分词实现。支持词性标注。)
Lex Glue
⭐
87
LexGLUE: A Benchmark Dataset for Legal Language Understanding in English
Optimum Transformers
⭐
71
Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.
Indonlg
⭐
64
The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, pre-trained IndoGPT and IndoBART models, and a starter code! (EMNLP 2021)
Factchd
⭐
54
Code for the paper "FactCHD: Benchmarking Fact-Conflicting Hallucination Detection".
Text Style Transfer Benchmark
⭐
52
Text style transfer benchmark
Onnx_transformers
⭐
49
Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.
Napolab
⭐
41
A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.
Masakhane Community
⭐
40
All our community docs! Start here! Lets put Africa on the NLP Map
Embeddings
⭐
33
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
Text Classification Cn
⭐
33
中文文本分类实践,基于搜狗新闻语料库,采用传统机器学习方法以及预训练模型等方法
Noisemix
⭐
27
NoiseMix - data generation for natural language
Vihos
⭐
26
Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)
Image2text
⭐
24
A deep learning project to tell a story with an image or a video.
Muld
⭐
24
The Multitask Long Document Benchmark
Spacy Benchmarks
⭐
21
💫 Runtime performance comparison of spaCy against other NLP libraries
Stepgame
⭐
20
[AAAI 2022] Dataset and pytorch codes for the paper titled "StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts" in AAAI 2022 (Oral)
Ex_lsh
⭐
20
A configurable implementation of locality-sensitive hashing in Elixir
Rumedbench
⭐
20
https://arxiv.org/abs/2201.06499
Karen
⭐
19
KAREN: Unifying Hatespeech Detection and Benchmarking
Indolem
⭐
19
IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.
Pragmeval
⭐
18
Discourse Based Evaluation of Language Understanding
Tinysegmenter.jl
⭐
18
Julia version of TinySegmenter, compact Japanese tokenizer
Word Benchmarks
⭐
17
Benchmarks for intrinsic word embeddings evaluation.
Nlu_benchmark_dataset
⭐
16
自然语言理解 基准测试 数据集 | Benchmark datasets for Natural Language Understanding (NLU)
Swords
⭐
14
The Stanford Word Substitution (Swords) Benchmark
Benchmark Nlp
⭐
13
NLP benchmark test sentences and full results
Useb
⭐
12
Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/2104.06979.
Lepiszcze
⭐
11
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
Paraphrasebench
⭐
11
A benchmark to test linguistic robustness.
Tecb De
⭐
10
German Text Embedding Clustering Benchmark
Entity_knowledge_in_bert
⭐
10
This repository contains the code for the CONLL 2019 paper "Investigating Entity Knowledge in BERT with Simple Neural End-To-End Entity Linking". The code is provided as a documentation for the paper and also for follow-up research.
Dpl
⭐
10
[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.
Text Representation
⭐
10
Text representation works, such as : paper, code, review, datasets, blogs, thesis and so on.
Dl_text_classification
⭐
10
Collection of Deep Learning Text Classification Models in Keras; Includes a GPU tutorial.
Advbench
⭐
8
Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP".
Nlp Benchmark
⭐
7
NLP-Benchmark
Nlp Standards
⭐
6
🗣 Talks on NLP checklists and sheets to standardize the development pipeline for transparency and accountability @ TUMunich & TADA
Embedding Benchmark
⭐
6
Word Embedding benchmark project By Shahid Beheshti University NLP Lab
Tatoeba Mt Benchmark
⭐
5
Tatoeba machine translation benchmark and implementations of different seq2seq algorithms.
Sentiment_embeddings
⭐
5
A scientific benchmark and comparison of the performance of sentiment analysis models in NLP on small to medium datasets
Xslue
⭐
5
ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy
Related Searches
Python Natural Language Processing (7,915)
Jupyter Notebook Natural Language Processing (4,405)
Machine Learning Natural Language Processing (3,939)
Deep Learning Natural Language Processing (2,414)
Python Benchmark (1,941)
C Plus Plus Benchmark (1,219)
Pytorch Natural Language Processing (1,212)
Javascript Benchmark (1,165)
Golang Benchmark (1,080)
Benchmark Benchmarking (1,073)
1-27 of 27 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.