Herbert Alternatives

Name: allegro/HerBERT
Brand: allegro/HerBERT
SKU: project/allegro/HerBERT
Rating: 4.43 (29 reviews)

HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.

Categories > Data Processing > Corpus

Suggest Alternative

Stars

Alternatives

License

No license specified

Open Issues

Most Recent Commit

about 5 years ago

Dependent Repos

Dependent Packages

Total Releases

Categories

Data Processing > Corpus

Machine Learning > Language Model

Compilers > Tokenizer

Repo

Alternatives To allegro/HerBERT

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
brightmart/nlp_chinese_corpus	8,344	0	0	about 3 years ago	0		20	mit
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
codertimo/BERT-pytorch	5,605	1	0	almost 3 years ago	5	October 23, 2018	63	apache-2.0	Python
Google AI 2018 BERT pytorch implementation
CLUEbenchmark/CLUE	3,345	0	0	about 3 years ago	0		73		Python
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
VinAIResearch/BERTweet	542	0	0	over 2 years ago	0		0	mit	Python
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
songys/Chatbot_data	293	0	0	over 3 years ago	0		0	mit
Chatbot_data_for_Korean
yumeng5/LOTClass	231	0	0	over 4 years ago	0		0	apache-2.0	Python
[EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach
hooshvare/parsbert	222	0	0	over 3 years ago	0		6	apache-2.0	Jupyter Notebook
🤗 ParsBERT: Transformer-based Model for Persian Language Understanding
GAIR-NLP/MathPile	192	0	0	over 2 years ago	0		2	apache-2.0	JavaScript
Generative AI for Math: MathPile
iPieter/RobBERT	180	0	0	over 2 years ago	0		15	mit	Jupyter Notebook
A Dutch RoBERTa-based language model
lopuhin/transformer-lm	155	0	0	over 5 years ago	0		8		Python
Transformer language model (GPT-2) with sentencepiece tokenizer

Alternatives To allegro/HerBERT

Select To Compare

brightmart/nlp_chinese_corpus ⭐ 8,344

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

dependent packages 0 total releases 0 most recent commit about 3 years ago

codertimo/BERT-pytorch ⭐ 5,605

Google AI 2018 BERT pytorch implementation

dependent packages 0 total releases 5 most recent commit almost 3 years ago downloads badge

CLUEbenchmark/CLUE ⭐ 3,345

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

dependent packages 0 total releases 0 most recent commit about 3 years ago

VinAIResearch/BERTweet ⭐ 542

BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)

dependent packages 0 total releases 0 most recent commit over 2 years ago

songys/Chatbot_data ⭐ 293

Chatbot_data_for_Korean

dependent packages 0 total releases 0 most recent commit over 3 years ago

yumeng5/LOTClass ⭐ 231

[EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach

dependent packages 0 total releases 0 most recent commit over 4 years ago

hooshvare/parsbert ⭐ 222

🤗 ParsBERT: Transformer-based Model for Persian Language Understanding

dependent packages 0 total releases 0 most recent commit over 3 years ago

GAIR-NLP/MathPile ⭐ 192

Generative AI for Math: MathPile

dependent packages 0 total releases 0 most recent commit over 2 years ago

iPieter/RobBERT ⭐ 180

A Dutch RoBERTa-based language model

dependent packages 0 total releases 0 most recent commit over 2 years ago

lopuhin/transformer-lm ⭐ 155

Transformer language model (GPT-2) with sentencepiece tokenizer

dependent packages 0 total releases 0 most recent commit over 5 years ago

Suggest An Alternative To HerBERT

Alternative Project Comparisons

allegro/HerBERT vs Nlp_chinese_corpus

allegro/HerBERT vs Bert Pytorch

allegro/HerBERT vs Clue

allegro/HerBERT vs Bertweet

allegro/HerBERT vs Chatbot_data

allegro/HerBERT vs Lotclass

allegro/HerBERT vs Parsbert

allegro/HerBERT vs Mathpile

allegro/HerBERT vs Robbert

allegro/HerBERT vs Transformer Lm

Popular Corpus Projects

nltk/nltk⭐ 12,699

NLTK Source

nl8590687/ASRT_SpeechRecognition⭐ 7,253

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

stanfordnlp/GloVe⭐ 6,480

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings

ibab/tensorflow-wavenet⭐ 5,362

A TensorFlow implementation of DeepMind's WaveNet paper

niderhoff/nlp-datasets⭐ 5,235

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

Popular Language Model Projects

huggingface/transformers⭐ 119,240

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

xtekky/gpt4free⭐ 52,083

The official gpt4free repository | various collection of powerful language models

dair-ai/Prompt-Engineering-Guide⭐ 40,069

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

LAION-AI/Open-Assistant⭐ 36,197

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

tatsu-lab/stanford_alpaca⭐ 24,846

Code and documentation to train Stanford's Alpaca models, and generate the data.

Popular Data Processing Categories

Jupyter Notebook

Dataset

Sql

Validation

Pipeline

Translation

Data Science

Classification

Transaction

Scraper