Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Nmt | 6,085 | 6 months ago | 275 | apache-2.0 | Python | |||||
TensorFlow Neural Machine Translation Tutorial | ||||||||||
Practical Pytorch | 4,272 | 2 years ago | 91 | mit | Jupyter Notebook | |||||
Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained | ||||||||||
Olivia | 3,460 | a month ago | 15 | February 26, 2021 | 25 | mit | Go | |||
💁♀️Your new best friend powered by an artificial neural network | ||||||||||
Nlp_tasks | 2,904 | 5 years ago | apache-2.0 | |||||||
Natural Language Processing Tasks and References | ||||||||||
Opennmt | 2,334 | 3 years ago | 48 | mit | Lua | |||||
Open Source Neural Machine Translation in Torch (deprecated) | ||||||||||
Mt Reading List | 2,289 | 8 months ago | 4 | bsd-3-clause | TeX | |||||
A machine translation reading list maintained by Tsinghua Natural Language Processing Group | ||||||||||
Subword Nmt | 1,937 | 18 | 13 | 7 months ago | 8 | December 08, 2021 | 2 | mit | Python | |
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation | ||||||||||
Openseq2seq | 1,393 | 2 years ago | 85 | apache-2.0 | Python | |||||
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP | ||||||||||
Seq2seq Attn | 1,167 | 2 years ago | 14 | mit | Lua | |||||
Sequence-to-sequence model with LSTM encoder/decoders and attention | ||||||||||
Sockeye | 1,158 | 2 | a month ago | 80 | May 05, 2022 | 2 | apache-2.0 | Python | ||
Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch |
The primary motivation of NeurST is to facilitate NLP researchers to get started on end-to-end speech translation (ST) and build advanced neural machine translation (NMT) models.
See here for a full list of NeurST examples. And we present recent progress of end-to-end ST technology at https://st-benchmark.github.io/.
NeurST is based on TensorFlow2 and we are working on the pytorch version.
March 29, 2022: Release of GigaST dataset: a large-scale speech translation corpus.
Aug 16, 2021: Release of models and results for IWSLT 2021 offline ST and simultaneous translation task.
June 15, 2021: Integration of LightSeq for training speedup, see the experimental branch.
March 28, 2021: The v0.1.1 release includes the instructions of weight pruning and quantization aware training for transformer models, and several more features. See the release note for more details.
Dec. 25, 2020: The v0.1.0 release includes the overall design of the code structure and recipes for training end-to-end ST models. See the release note for more details.
Model
, Task
, Dataset
etc. and combine each other.Byteps
/ Horovod
is also supported for large-scale scenarios.NeurST provides reference implementations of various models and benchmarks. Please see examples for model links and NeurST benchmark on different datasets.
Install NeurST from source:
git clone https://github.com/bytedance/neurst.git
cd neurst/
pip3 install -e .
If there exists ImportError during running, manually install the required packages at that time.
@InProceedings{zhao2021neurst,
author = {Chengqi Zhao and Mingxuan Wang and Qianqian Dong and Rong Ye and Lei Li},
booktitle = {the 59th Annual Meeting of the Association for Computational Linguistics (ACL): System Demonstrations},
title = {{NeurST}: Neural Speech Translation Toolkit},
year = {2021},
month = aug,
}
Any questions or suggestions, please feel free to contact us: [email protected], [email protected].
We thank Bairen Yi, Zherui Liu, Yulu Jia, Yibo Zhu, Jiaze Chen, Jiangtao Feng, Zewei Sun for their kind help.