Gpt 2 Training

Training GPT-2 on a Russian language corpus
Alternatives To Gpt 2 Training
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Corus254
9 months ago10July 24, 202366mitJupyter Notebook
Links to Russian corpora + Python functions for loading and parsing
Ted Multilingual Parallel Corpus152
8 years ago6
TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted from TED talks www.ted.com for 109 world languages.
Ud_russian Syntagrus77
5 months ago16otherPerl
Russian data from the SynTagRus corpus.
Russian_news_corpus76
7 years ago1apache-2.0
Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Gpt 2 Training65
3 years ago7Python
Training GPT-2 on a Russian language corpus
Taiga_site54
4 years ago6CSS
Nerus51
9 months ago7April 09, 2020mitPython
Large silver standart Russian corpus with NER, morphology and syntax markup
Morphorueval 201741
6 years ago13otherPython
Russian Ulmfit27
4 years agoJupyter Notebook
AWD-LSTM language model trained on newspaper corpora with fast.ai
Spacy_russian_tokenizer26
5 years ago1Python
Custom Russian tokenizer for spaCy
Alternatives To Gpt 2 Training
Select To Compare


Alternative Project Comparisons
Popular Corpus Projects
Popular Russian Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Dataset
Corpus
Russian