Parsentextract

A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.
Alternatives To Parsentextract
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Language Style Transfer491
3 years ago21apache-2.0Roff
Bible Corpus134
a year ago2cc0-1.0
A multilingual parallel corpus created from translations of the Bible.
Bicleaner13414 months ago35March 29, 2023gpl-3.0Python
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
Korean Parallel Corpora129
a year ago1
Korean Parallel Corpus
Awesome Danish110
a year agoother
A curated list of awesome resources for Danish language technology
Lingtrain Aligner98
5 months ago53November 26, 20233gpl-3.0Python
Lingtrain Aligner — ML powered library for the accurate texts alignment.
Small_parallel_enja61
5 years agoRoff
50k English-Japanese Parallel Corpus for Machine Translation Benchmark.
Wikipedia Parallel Titles53
9 years ago4Perl
Tools for extracting parallel corpora from article titles across languages in Wikipedia
Cross Language Dataset50
7 years ago1other
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
Naki49
3 years agogpl-3.0
List of research and engineering of NLP for American Native/Indigenous Languages.
Alternatives To Parsentextract
Select To Compare


Alternative Project Comparisons
Popular Corpus Projects
Popular Parallel Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Tensorflow
Parallel
Corpus
Machine Translation