Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for parallel corpus
corpus
x
parallel
x
46 search results found
Language Style Transfer
⭐
491
Bible Corpus
⭐
134
A multilingual parallel corpus created from translations of the Bible.
Bicleaner
⭐
134
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
Korean Parallel Corpora
⭐
129
Korean Parallel Corpus
Awesome Danish
⭐
110
A curated list of awesome resources for Danish language technology
Lingtrain Aligner
⭐
98
Lingtrain Aligner — ML powered library for the accurate texts alignment.
Small_parallel_enja
⭐
61
50k English-Japanese Parallel Corpus for Machine Translation Benchmark.
Wikipedia Parallel Titles
⭐
53
Tools for extracting parallel corpora from article titles across languages in Wikipedia
Cross Language Dataset
⭐
50
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
Naki
⭐
49
List of research and engineering of NLP for American Native/Indigenous Languages.
Seq2seq
⭐
49
Library to train parallel-aligned sequence data based on Keras
Parallel Corpora Tools
⭐
39
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Americasnlp2021
⭐
37
Textprep
⭐
33
Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Translation tasks. It is designed especially for logographic languages such as Chinese and Japanese.
Parsentextract
⭐
29
A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.
Odia Nlp Resource Catalog
⭐
26
Spaced Out
⭐
26
Vocab drill using parallel corpora, plus classic spaced-repetition drill
Exquisite Corpus
⭐
25
Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.
How I Extracted Ted Talks For Parallel Corpus
⭐
22
Self Training Text Generation
⭐
18
Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"
En Az Parallel Corpus
⭐
18
English-Azerbaijani parallel language corpus
Covid19 Datashare
⭐
17
A repo for sharing language resources related to the outbreak (in machine readable format)
Esquite
⭐
17
Framework para corpus paralelos | Framework for parallel corpora
Movie2paralleldb
⭐
12
Automatic parallel speech database extractor from dubbed movies
Google Ngrams
⭐
12
Shell scripts to assist downloading & processing the Google n-grams corpora
Dual Learning
⭐
11
Implementation of Dual Learning NMT & Joint Training on tensorflow
Polite Dialogue Generation
⭐
11
Code for "Polite Dialogue Generation Without Parallel Data"
Keops
⭐
11
Tool for manual evaluation of parallel sentences.
Nlp Augment
⭐
10
A collection of utilities used in exploring data augmentation of low-resource parallel corpuses.
Smtdata
⭐
9
Datasets for machine translation
Mypar
⭐
8
myPar: Myanmar Parallel Corpora for Machine Translation R&D
Sentimentator
⭐
8
Tool for sentiment analysis annotation
Nlp Sentence Compression
⭐
8
Paraphrasic Sentence Compression using Deep-Link Bilingual Phrase Alignments.
Timealign
⭐
7
Parallel corpus annotation and visualization
Babel
⭐
7
Translation without parallel corpora.
Anuvaad Parallel Corpus
⭐
6
Mtrain
⭐
6
Training automation for neural and statistical machine translation engines
Contexto
⭐
6
An open source contextual dictionary
Le Traducteur
⭐
6
A Neural Machine Translation framework built with PyTorch and AllenNLP.
Vae4vc
⭐
6
Unsuppse
⭐
5
Unsupervised parallel sentence extraction from comparable corpora
Glosbe Translation Memory Crawler
⭐
5
parallel corpora for any languages supported by glosbe.com
Parallel Sentences Identifier
⭐
5
zNLP : Identifying parallel sentences in Chinese-English comparable corpora for the BUCC 2017 Shared Task (https://comparable.limsi.fr/bucc2017/bucc2017-tas
Perfectextractor
⭐
5
Extracting present perfects (and related forms) from parallel corpora
Icealign
⭐
5
Search Tool for the Icelandic Parallel Corpus
Mwe Tools
⭐
5
A set of useful tools for use with multiword expression extraction from parallel corpora for Moses statistical machine translation system
Related Searches
Python Corpus (2,447)
Python Parallel (1,211)
C Plus Plus Parallel (1,094)
Natural Language Processing Corpus (510)
Dataset Corpus (342)
Java Corpus (308)
Language Corpus (261)
1-46 of 46 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.