Cluedatasetsearch Alternatives

Name: CLUEbenchmark/CLUEDatasetSearch
Brand: CLUEbenchmark/CLUEDatasetSearch
SKU: project/CLUEbenchmark/CLUEDatasetSearch
Rating: 4.94 (2778 reviews)

搜索所有中文NLP数据集，附常用英文NLP数据集

Categories > Data Processing > Dataset

Suggest Alternative

Stars

2,778

Alternatives

License

No license specified

Open Issues

Most Recent Commit

over 3 years ago

Programming Language

Python

Dependent Repos

Dependent Packages

Total Releases

Categories

Programming Languages > Python

Data Processing > Dataset

Machine Learning > Natural Language Processing

Community > Chinese

Data Processing > Corpus

Machine Learning > Sentiment Analysis

Web User Interface > Emotion

Machine Learning > Text Classification

Machine Learning > Ner

Content Management > Knowledge Graph

Social Media > Weibo

Machine Learning > Machine Translation

Machine Learning > Text Summarization

Site

Repo

Alternatives To CLUEbenchmark/CLUEDatasetSearch

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
sebastianruder/NLP-progress	22,082	0	0	over 2 years ago	0		52	mit	Python
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
huggingface/datasets	17,925	9	760	over 2 years ago	76	November 16, 2023	665	apache-2.0	Python
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
doccano/doccano	8,652	0	0	over 2 years ago	32	July 20, 2023	278	mit	Python
Open source annotation tool for machine learning practitioners.
brightmart/nlp_chinese_corpus	8,344	0	0	about 3 years ago	0		20	mit
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
lonePatient/awesome-pretrained-chinese-nlp-models	3,738	0	0	over 2 years ago	0		1	mit	Python
Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合
pytorch/text	3,411	341	146	over 2 years ago	29	November 15, 2023	314	bsd-3-clause	Python
Models, data loaders and abstractions for language processing, powered by PyTorch
CLUEbenchmark/CLUEDatasetSearch	2,778	0	0	over 3 years ago	0		6		Python
搜索所有中文NLP数据集，附常用英文NLP数据集
QData/TextAttack	2,597	0	5	over 2 years ago	46	September 11, 2023	52	mit	Python
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
PetrochukM/PyTorch-NLP	2,180	9	10	about 3 years ago	19	November 04, 2019	24	bsd-3-clause	Python
Basic Utilities for PyTorch Natural Language Processing (NLP)
github/CodeSearchNet	2,054	0	0	over 4 years ago	0		7	mit	Jupyter Notebook
Datasets, tools, and benchmarks for representation learning of code.

Alternatives To CLUEbenchmark/CLUEDatasetSearch

Select To Compare

sebastianruder/NLP-progress ⭐ 22,082

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

dependent packages 0 total releases 0 most recent commit over 2 years ago

huggingface/datasets ⭐ 17,925

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

dependent packages 760 total releases 76 most recent commit over 2 years ago downloads badge

doccano/doccano ⭐ 8,652

Open source annotation tool for machine learning practitioners.

dependent packages 0 total releases 32 most recent commit over 2 years ago downloads badge

brightmart/nlp_chinese_corpus ⭐ 8,344

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

dependent packages 0 total releases 0 most recent commit about 3 years ago

lonePatient/awesome-pretrained-chinese-nlp-models ⭐ 3,738

Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合

dependent packages 0 total releases 0 most recent commit over 2 years ago

pytorch/text ⭐ 3,411

Models, data loaders and abstractions for language processing, powered by PyTorch

dependent packages 146 total releases 29 most recent commit over 2 years ago downloads badge

CLUEbenchmark/CLUEDatasetSearch ⭐ 2,778

搜索所有中文NLP数据集，附常用英文NLP数据集

dependent packages 0 total releases 0 most recent commit over 3 years ago

QData/TextAttack ⭐ 2,597

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

dependent packages 5 total releases 46 most recent commit over 2 years ago downloads badge

PetrochukM/PyTorch-NLP ⭐ 2,180

Basic Utilities for PyTorch Natural Language Processing (NLP)

dependent packages 10 total releases 19 most recent commit about 3 years ago downloads badge

github/CodeSearchNet ⭐ 2,054

Datasets, tools, and benchmarks for representation learning of code.

dependent packages 0 total releases 0 most recent commit over 4 years ago

Suggest An Alternative To CLUEDatasetSearch

Alternative Project Comparisons

CLUEbenchmark/CLUEDatasetSearch vs Nlp Progress

CLUEbenchmark/CLUEDatasetSearch vs Datasets

CLUEbenchmark/CLUEDatasetSearch vs Doccano

CLUEbenchmark/CLUEDatasetSearch vs Nlp_chinese_corpus

CLUEbenchmark/CLUEDatasetSearch vs Awesome Pretrained Chinese Nlp Models

CLUEbenchmark/CLUEDatasetSearch vs Text

CLUEbenchmark/CLUEDatasetSearch vs Cluedatasetsearch

CLUEbenchmark/CLUEDatasetSearch vs Textattack

CLUEbenchmark/CLUEDatasetSearch vs Pytorch Nlp

CLUEbenchmark/CLUEDatasetSearch vs Codesearchnet

Popular Dataset Projects

public-apis/public-apis⭐ 276,890

A collective list of free APIs

awesomedata/awesome-public-datasets⭐ 57,596

A topic-centric list of HQ open datasets.

apache/superset⭐ 56,358

Apache Superset is a Data Visualization and Data Exploration Platform

aymericdamien/TensorFlow-Examples⭐ 43,109

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

HumanSignal/label-studio⭐ 27,816

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Popular Natural Language Processing Projects

huggingface/transformers⭐ 119,240

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

d2l-ai/d2l-zh⭐ 53,401

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

apachecn/ailearning⭐ 42,369

AiLearning：数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2

hankcs/HanLP⭐ 36,433

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

google-research/bert⭐ 36,099

TensorFlow code and pre-trained models for BERT

Popular Data Processing Categories

Jupyter Notebook

Dataset

Sql

Validation

Pipeline

Translation

Data Science

Classification

Transaction

Scraper