Chinese Text-to-Speech(TTS)
汉字 => ["han4", "zi4"] => .wav audio
Read this page in 简体中文
A list of all Chinese characters is obtained from 倉頡平台2012, a Chinese input software.
git clone https://github.com/junzew/HanTTS.git
cd HanTTS
pip install --user -r requires.txt
Download syllables.zip from SourceForge, and decompress under the directory HanTTS.
- Either run locally:
python main.py - Or through web interface:
cdinto theserverfolderNavigate tonpm install node app.jslocalhost:3000in a browser
- Record five tones of each pinyin listed in
mapping.json - Group them by the first letter (a,b,c,d, etc.), and save under folder
./recordingas{letter}.wav - Then run
python process.py {letter}for eachletterto split recordings into individual pinyin - Move valid .wav files from
./preto the folder./syllables
Based on the text-to-speech-sample project by @alexram1313