From d13d7becf4afc3a9682a9c5ff60c0fabd6872fae Mon Sep 17 00:00:00 2001 From: lifuchen Date: Tue, 10 Mar 2020 08:53:27 +0000 Subject: [PATCH] change the image size in transformer_tts readme --- examples/transformer_tts/README.md | 7 ++++++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/examples/transformer_tts/README.md b/examples/transformer_tts/README.md index 1f1922c..ce9cd5b 100644 --- a/examples/transformer_tts/README.md +++ b/examples/transformer_tts/README.md @@ -10,8 +10,13 @@ wget https://data.keithito.com/data/speech/LJSpeech-1.1.tar.bz2 tar xjvf LJSpeech-1.1.tar.bz2 ``` ## Model Architecture +
+
+
+
+TransformerTTS model architecture +
-![TransformerTTS model architecture](./images/model_architecture.jpg) The model adopts the multi-head attention mechanism to replace the RNN structures and also the original attention mechanism in [Tacotron2](https://arxiv.org/abs/1712.05884). The model consists of two main parts, encoder and decoder. We also implement the CBHG model of Tacotron as the vocoder part and convert the spectrogram into raw wave using Griffin-Lim algorithm. ## Project Structure -- GitLab