Speech Transformer Alternatives

Name: kaituoxu/Speech-Transformer
Brand: kaituoxu/Speech-Transformer
SKU: project/kaituoxu/Speech-Transformer
Rating: 4.68 (714 reviews)

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Categories > Community > Pytorch

Suggest Alternative

Stars

714

Alternatives

License

No license specified

Open Issues

Most Recent Commit

over 3 years ago

Programming Language

Python

Dependent Repos

Dependent Packages

Total Releases

Categories

Programming Languages > Python

Machine Learning > Pytorch

Community > Chinese

Machine Learning > Attention

Machine Learning > Asr

Machine Learning > Kaldi

Machine Learning > Visdom

Repo

Alternatives To kaituoxu/Speech-Transformer

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
kaituoxu/Speech-Transformer	714	0	0	over 3 years ago	0		5		Python
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
yeyupiaoling/PPASR	701	0	0	over 2 years ago	38	September 07, 2023	1	apache-2.0	Python
基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
speechio/chinese_text_normalization	570	0	0	over 3 years ago	0		4	mit	Python
Chinese text normalization for speech processing
yeyupiaoling/PaddlePaddle-DeepSpeech	536	0	0	almost 3 years ago	0		2	apache-2.0	Python
基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。
Ailln/cn2an	522	1	16	almost 3 years ago	56	August 21, 2023	16	mit	Python
📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）
yeyupiaoling/Whisper-Finetune	502	0	0	over 2 years ago	0		1	apache-2.0	C
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
shibing624/parrots	318	0	0	over 2 years ago	7	November 03, 2022	5	apache-2.0	Python
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese. 中文语音识别、文字转语音，基于语音库实现，易扩展。
aishell-foundation/DaCiDian	198	0	0	about 6 years ago	0		2		Python
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
chenmingxiang110/Chinese-automatic-speech-recognition	157	0	0	about 3 years ago	0		3	mit	Jupyter Notebook
Chinese speech recognition
jackaduma/LAS_Mandarin_PyTorch	104	0	0	about 3 years ago	0		6	mit	Python
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

Alternatives To kaituoxu/Speech-Transformer

Select To Compare

kaituoxu/Speech-Transformer ⭐ 714

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

dependent packages 0 total releases 0 most recent commit over 3 years ago

yeyupiaoling/PPASR ⭐ 701

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

dependent packages 0 total releases 38 most recent commit over 2 years ago downloads badge

speechio/chinese_text_normalization ⭐ 570

Chinese text normalization for speech processing

dependent packages 0 total releases 0 most recent commit over 3 years ago

yeyupiaoling/PaddlePaddle-DeepSpeech ⭐ 536

基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。

dependent packages 0 total releases 0 most recent commit almost 3 years ago

Ailln/cn2an ⭐ 522

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

dependent packages 16 total releases 56 most recent commit almost 3 years ago downloads badge

yeyupiaoling/Whisper-Finetune ⭐ 502

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

dependent packages 0 total releases 0 most recent commit over 2 years ago

shibing624/parrots ⭐ 318

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese. 中文语音识别、文字转语音，基于语音库实现，易扩展。

dependent packages 0 total releases 7 most recent commit over 2 years ago downloads badge

aishell-foundation/DaCiDian ⭐ 198

DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)

dependent packages 0 total releases 0 most recent commit about 6 years ago

chenmingxiang110/Chinese-automatic-speech-recognition ⭐ 157

Chinese speech recognition

dependent packages 0 total releases 0 most recent commit about 3 years ago

jackaduma/LAS_Mandarin_PyTorch ⭐ 104

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

dependent packages 0 total releases 0 most recent commit about 3 years ago

Suggest An Alternative To Speech-Transformer

Alternative Project Comparisons

kaituoxu/Speech-Transformer vs Speech Transformer

kaituoxu/Speech-Transformer vs Ppasr

kaituoxu/Speech-Transformer vs Chinese_text_normalization

kaituoxu/Speech-Transformer vs Paddlepaddle Deepspeech

kaituoxu/Speech-Transformer vs Cn2an

kaituoxu/Speech-Transformer vs Whisper Finetune

kaituoxu/Speech-Transformer vs Parrots

kaituoxu/Speech-Transformer vs Dacidian

kaituoxu/Speech-Transformer vs Chinese Automatic Speech Recognition

kaituoxu/Speech-Transformer vs Las_mandarin_pytorch

Popular Chinese Projects

iptv-org/iptv⭐ 74,798

Collection of publicly available IPTV channels from all over the world

Anduin2017/HowToCook⭐ 57,819

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Chinese only).

ElemeFE/element⭐ 53,857

A Vue.js 2.0 UI Toolkit for Web

d2l-ai/d2l-zh⭐ 53,401

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

chinese-poetry/chinese-poetry⭐ 45,313

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。

Popular Asr Projects

kaldi-asr/kaldi⭐ 13,453

kaldi-asr/kaldi is the official location of the Kaldi project.

PaddlePaddle/PaddleSpeech⭐ 12,641

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

NVIDIA/NeMo⭐ 9,041

NeMo: a toolkit for conversational AI

espnet/espnet⭐ 7,563

End-to-End Speech Processing Toolkit

m-bain/whisperX⭐ 7,510

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Popular Community Categories

Social

Chinese

Conference

Japanese

Feedback

Meetup

Cats

Open Data

China

Stanford