Awesome Open Source

Programming Languages

Search results for natural language processing benchmark

natural-language-processing x

27 search results found

Nlp_chinese_corpus ⭐ 8,344

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

Baichuan2 ⭐ 3,527

A series of large language models developed by Baichuan Intelligent Technology

Baichuan 13b ⭐ 2,579

A 13B large language model developed by Baichuan Intelligent Technology

Deepmoji ⭐ 1,462

State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Torchmoji ⭐ 882

😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc

Bolt is a deep learning library with high performance and heterogeneous flexibility.

Long Range Arena ⭐ 635

Long Range Arena for Benchmarking Efficient Transformers

Fastrag ⭐ 591

Efficient Retrieval Augmentation and Generation Framework

Indonlu ⭐ 496

The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)

Indicnlp_catalog ⭐ 487

A collaborative catalog of NLP resources for Indic languages

RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

Langtest ⭐ 430

Deliver safe & effective language models

Dialoglue ⭐ 256

DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue

Lm Spanish ⭐ 220

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Chineseblue ⭐ 212

Chinese Biomedical Language Understanding Evaluation benchmark (ChineseBLUE)

Awesome Llm Eval ⭐ 183

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, learderboard, papers, docs and models, mainly for Evaluation on LLMs.

Trustllm ⭐ 164

TrustLLM: Trustworthiness in Large Language Models

We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.

Simple NLP in Rust with Python bindings

Tasksource ⭐ 103

Datasets collection and standardization preprocessings for NLP extreme multitask learning

The jieba-analysis tool for java.（基于结巴分词词库实现的更加灵活优雅易用，高性能的 java 分词实现。支持词性标注。）

Lex Glue ⭐ 87

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

Optimum Transformers ⭐ 71

Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.

The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, pre-trained IndoGPT and IndoBART models, and a starter code! (EMNLP 2021)

Code for the paper "FactCHD: Benchmarking Fact-Conflicting Hallucination Detection".

Text Style Transfer Benchmark ⭐ 52

Text style transfer benchmark

Onnx_transformers ⭐ 49

Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.

A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.

Masakhane Community ⭐ 40

All our community docs! Start here! Lets put Africa on the NLP Map

Embeddings ⭐ 33

Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language

Text Classification Cn ⭐ 33

中文文本分类实践，基于搜狗新闻语料库，采用传统机器学习方法以及预训练模型等方法

Noisemix ⭐ 27

NoiseMix - data generation for natural language

Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)

Image2text ⭐ 24

A deep learning project to tell a story with an image or a video.

The Multitask Long Document Benchmark

Spacy Benchmarks ⭐ 21

💫 Runtime performance comparison of spaCy against other NLP libraries

Stepgame ⭐ 20

[AAAI 2022] Dataset and pytorch codes for the paper titled "StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts" in AAAI 2022 (Oral)

A configurable implementation of locality-sensitive hashing in Elixir

Rumedbench ⭐ 20

https://arxiv.org/abs/2201.06499

KAREN: Unifying Hatespeech Detection and Benchmarking

IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.

Pragmeval ⭐ 18

Discourse Based Evaluation of Language Understanding

Tinysegmenter.jl ⭐ 18

Julia version of TinySegmenter, compact Japanese tokenizer

Word Benchmarks ⭐ 17

Benchmarks for intrinsic word embeddings evaluation.

Nlu_benchmark_dataset ⭐ 16

自然语言理解基准测试数据集 | Benchmark datasets for Natural Language Understanding (NLU)

The Stanford Word Substitution (Swords) Benchmark

Benchmark Nlp ⭐ 13

NLP benchmark test sentences and full results

Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/2104.06979.

Lepiszcze ⭐ 11

This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

Paraphrasebench ⭐ 11

A benchmark to test linguistic robustness.

German Text Embedding Clustering Benchmark

Entity_knowledge_in_bert ⭐ 10

This repository contains the code for the CONLL 2019 paper "Investigating Entity Knowledge in BERT with Simple Neural End-To-End Entity Linking". The code is provided as a documentation for the paper and also for follow-up research.

[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.

Text Representation ⭐ 10

Text representation works, such as : paper, code, review, datasets, blogs, thesis and so on.

Dl_text_classification ⭐ 10

Collection of Deep Learning Text Classification Models in Keras; Includes a GPU tutorial.

Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP".

Nlp Benchmark ⭐ 7

Nlp Standards ⭐ 6

🗣 Talks on NLP checklists and sheets to standardize the development pipeline for transparency and accountability @ TUMunich & TADA

Embedding Benchmark ⭐ 6

Word Embedding benchmark project By Shahid Beheshti University NLP Lab

Tatoeba Mt Benchmark ⭐ 5

Tatoeba machine translation benchmark and implementations of different seq2seq algorithms.

Sentiment_embeddings ⭐ 5

A scientific benchmark and comparison of the performance of sentiment analysis models in NLP on small to medium datasets

ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy

Related Searches

Python Natural Language Processing (7,915)

Jupyter Notebook Natural Language Processing (4,405)

Machine Learning Natural Language Processing (3,939)

Deep Learning Natural Language Processing (2,414)

Python Benchmark (1,941)

C Plus Plus Benchmark (1,219)

Pytorch Natural Language Processing (1,212)

Javascript Benchmark (1,165)

Golang Benchmark (1,080)

Benchmark Benchmarking (1,073)

1-27 of 27 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.