README.md 685 字节
Newer Older
H
Hui Zhang 已提交
1 2
# Speech Application based on PaddleSpeech

K
KP 已提交
3 4
([简体中文](./README_cn.md)|English)

H
Hui Zhang 已提交
5 6
The directory containes many speech applications in multi scenarios.

7
* audio searching - mass audio similarity retrieval
K
KP 已提交
8 9 10 11 12
* audio tagging - multi-label tagging of an audio file
* automatic_video_subtitiles - generate subtitles from a video
* metaverse - 2D AR with TTS  
* punctuation_restoration - restore punctuation from raw text
* speech recogintion - recognize text of an audio file 
H
Hui Zhang 已提交
13 14 15
* speech translation - end to end speech translation  
* story talker - book reader based on OCR and TTS  
* style_fs2 - multi style control for FastSpeech2 model  
K
KP 已提交
16
* text_to_speech - convert text into speech