Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for corpus multilingual
corpus
x
multilingual
x
36 search results found
Laser
⭐
3,460
Language-Agnostic SEntence Representations
Chatterbot Corpus
⭐
1,219
A multilingual dialog corpus
Wordless
⭐
649
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
Awesome Spanish Nlp
⭐
324
Curated list of Linguistic Resources for doing NLP & CL on Spanish
Github Typo Corpus
⭐
289
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
Dl4mt C2c
⭐
145
Bible Corpus
⭐
134
A multilingual parallel corpus created from translations of the Bible.
Mldoc
⭐
132
A Corpus for Multilingual Document Classification in Eight Languages.
Bert Qa
⭐
119
BERT for question answering starting with HotpotQA
Mtdata
⭐
115
A tool that locates, downloads, and extracts machine translation corpora
Tupa
⭐
67
Transition-based UCCA Parser
Finbert
⭐
61
BERT model trained from scratch on Finnish
Cross Language Dataset
⭐
50
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
Flair Lms
⭐
47
Language Models for Zalando's flair library
Ewiser
⭐
40
A Word Sense Disambiguation system integrating implicit and explicit external knowledge.
Languagecodes
⭐
35
We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).
Extractive_rc_by_runtime_mt
⭐
30
Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"
Opus 100 Corpus
⭐
27
Exquisite Corpus
⭐
25
Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.
How I Extracted Ted Talks For Parallel Corpus
⭐
22
Rectr
⭐
19
💒 Reproducible Extraction of Cross-lingual Topics using R
Asp Source
⭐
18
Source stories from the African Storybook Project in Markdown format
Corpus_dataset_for_chinese_nlp
⭐
18
中文 NLP 语料库数据集
Erisha
⭐
17
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.
Biobertpt
⭐
16
Biomedical and Clinical BERT for Portuguese Language
Bible Corpus Tools
⭐
12
A collection of tools for reading/processing the multilingual Bible corpus
Hatemail Corpus
⭐
11
Corpus of incoming hatemail, for linguistic analysis.
Lasertrain
⭐
11
Bert Chinese Annotation
⭐
10
BERT 代码中文注释
Langdist
⭐
10
Multilingual Language Modeling Toolkit
Ntu Mc
⭐
9
Nanyang Technological University - Multilingual Corpus (STB subcorpora)
Parsing Mbert
⭐
7
Code for "Parsing with Multilingual BERT, a Small Corpus, and a Small Treebank" by Ethan C. Chau, Lucy H. Lin, and Noah A. Smith
Mc2_corpus
⭐
7
MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
Dictorpus
⭐
7
Multilingual text corpus integrated with machine-readable dictionary (DICTionary + cORPUS).
Wikinflection
⭐
6
Generating an inflectional corpus out of Wiktionary.
Flair Pos Tagging
⭐
5
Flair Embeddings for PoS Tagging: A Multilingual Evaluation
Yorubatwi Embedding
⭐
5
Related Searches
Python Corpus (2,462)
Natural Language Processing Corpus (510)
Python Multilingual (383)
Dataset Corpus (342)
Java Corpus (308)
Language Multilingual (276)
Language Corpus (261)
1-36 of 36 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.