update demos readme

48700c84 · 小湉湉 · 51c092ef · 48700c84 · 48700c84
隐藏空白更改
内联并排

Showing with 19 addition and 1 deletion

demos/story_talker/README.md demos/story_talker/README.md +5 -0

demos/style_fs2/README.md demos/style_fs2/README.md +14 -1

未找到文件。
--- a/demos/story_talker/README.md
+++ b/demos/story_talker/README.md
 # Story Talker
+## Introduction
+Storybooks are very important children's enlightenment books, but parents usually don't have enough time to read storybooks for their children. For very young children, they may not understand the Chinese characters in storybooks. Or sometimes, children just want to "listen" but don't want to "read".
 You can use `PaddleOCR` to get the text of a storybook, and read it by the `TTS` mudule of `PaddleSpeech`.
+## Usage
 Run the following command line to get started:
 ```
 ./run.sh
 ```
+The result has shown on our [notebook](https://github.com/PaddlePaddle/PaddleSpeech/blob/develop/docs/tutorial/tts/tts_tutorial.ipynb).
--- a/demos/style_fs2/README.md
+++ b/demos/style_fs2/README.md
 # Style FastSpeech2
-You can change the `pitch`、`duration` and `energy` of `FastSpeech2`, then get some interesting results.
+## Introduction
+[FastSpeech2](https://arxiv.org/abs/2006.04558)  is a classical acoustic model for Text-to-Speech synthesis, which introduces controllable speech input, including `phoneme duration`、`energy` and `pitch`. 
+In the prediction phase, you can change these controllable variables to get some interesting results.
+For example:
+1. The `duration` control in `FastSpeech2` can control the speed of audios will keep the `pitch`. (in some speech tool, increase the speed will increase the pitch, and vice versa.)
+2. When we set `pitch` of one sentence to a mean value and set `tones` of phones to `1`, we will get a `robot-style` timbre.
+3. When we raise the `pitch` of an adult female (with a fixed scale ratio), we will get a `child-style` timbre.
+The `duration` and `pitch` of different phonemes in a sentence can have different scale ratios. You can set different scale ratios to emphasize or weaken the pronunciation of some phonemes.
+## Usage
 Run the following command line to get started:
 ```
 ./run.sh