diff --git a/README.md b/README.md index 181f78cf6870a9214c97b7554f1e4e05ad093284..81ce163616f16cb33de5e567b422612f33d3a456 100644 --- a/README.md +++ b/README.md @@ -28,10 +28,23 @@ CMUdict from nltk is used to transform text into phonemes. import nltk nltk.download("cmudict") ``` + +If you have completed all the above installations, but still report an error at runtime: + +``` OSError: sndfile library not found ``` + +You need to install ```libsndfile``` using your distribution’s package manager. e.g. install via: + +``` sudo apt-get install libsndfile1 ``` + ## Supported models - [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](https://arxiv.org/abs/1710.07654) +- [Neural Speech Synthesis with Transformer Network](https://arxiv.org/abs/1809.08895) +- [FastSpeech: Fast, Robust and Controllable Text to Speech](https://arxiv.org/abs/1905.09263). ## Examples - [Train a deepvoice 3 model with ljspeech dataset](./parakeet/examples/deepvoice3) +- [Train a transformer_tts model with ljspeech dataset](./parakeet/examples/transformer_tts) +- [Train a fastspeech model with ljspeech dataset](./parakeet/examples/fastspeech) diff --git a/examples/fastspeech/README.md b/examples/fastspeech/README.md index bd7c4d5023be13202e89d342c25ddf64a3e0df3f..7264b804cca17c014ef65bfb6c106dc6b1edd721 100644 --- a/examples/fastspeech/README.md +++ b/examples/fastspeech/README.md @@ -3,44 +3,6 @@ Paddle fluid implementation of Fastspeech, a feed-forward network based on Trans We implement Fastspeech model in paddle fluid with dynamic graph, which is convenient for flexible network architectures. -## Installation - -### Install paddlepaddle -This implementation requires the latest develop version of paddlepaddle. You can either download the compiled package or build paddle from source. -1. Install the compiled package, via pip, conda or docker. See [**Installation Mannuals**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) for more details. - -2. Build paddlepaddle from source. See [**Compile From Source Code**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/compile/fromsource_en.html) for more details. Note that if you want to enable data parallel training for multiple GPUs, you should set `-DWITH_DISTRIBUTE=ON` with cmake. - -### Install parakeet -You can choose to install via pypi or clone the repository and install manually. - -1. Install via pypi. - ```bash - pip install parakeet - ``` - -2. Install manually. - ```bash - git clone - cd Parakeet/ - pip install -e . - -### Download cmudict for nltk -You also need to download cmudict for nltk, because convert text into phonemes with `cmudict`. - -```python -import nltk -nltk.download("punkt") -nltk.download("cmudict") -``` - -If you have completed all the above installations, but still report an error at runtime: - -``` OSError: sndfile library not found ``` - -You need to install ```libsndfile``` using your distribution’s package manager. e.g. install via: - -``` sudo apt-get install libsndfile1 ``` ## Dataset diff --git a/examples/transformer_tts/README.md b/examples/transformer_tts/README.md index 475161daf6e4506190e792890a3691c0ca9a83c9..ee961b5d2f573f1fca9e3257710812617b490df0 100644 --- a/examples/transformer_tts/README.md +++ b/examples/transformer_tts/README.md @@ -3,45 +3,6 @@ Paddle fluid implementation of TransformerTTS, a neural TTS with Transformer. Th We implement TransformerTTS model in paddle fluid with dynamic graph, which is convenient for flexible network architectures. -## Installation - -### Install paddlepaddle -This implementation requires the latest develop version of paddlepaddle. You can either download the compiled package or build paddle from source. -1. Install the compiled package, via pip, conda or docker. See [**Installation Mannuals**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) for more details. - -2. Build paddlepaddle from source. See [**Compile From Source Code**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/compile/fromsource_en.html) for more details. Note that if you want to enable data parallel training for multiple GPUs, you should set `-DWITH_DISTRIBUTE=ON` with cmake. - -### Install parakeet -You can choose to install via pypi or clone the repository and install manually. - -1. Install via pypi. - ```bash - pip install parakeet - ``` - -2. Install manually. - ```bash - git clone - cd Parakeet/ - pip install -e . - -### Download cmudict for nltk -You also need to download cmudict for nltk, because convert text into phonemes with `cmudict`. - -```python -import nltk -nltk.download("punkt") -nltk.download("cmudict") -``` - -If you have completed all the above installations, but still report an error at runtime: - -``` OSError: sndfile library not found ``` - -You need to install ```libsndfile``` using your distribution’s package manager. e.g. install via: - -``` sudo apt-get install libsndfile1 ``` - ## Dataset We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/). diff --git a/setup.py b/setup.py index ca630fce25d504b8ea8f2c076a14d318d5b3befb..1cd6e8ab159c41540ba9d61463300c4efefa726a 100644 --- a/setup.py +++ b/setup.py @@ -36,7 +36,7 @@ setup_info = dict( install_requires=[ 'numpy', 'nltk', 'inflect', 'librosa', 'unidecode', 'numba', 'tqdm', 'matplotlib', 'tensorboardX', 'tensorboard', 'scipy', - 'ruamel.yaml', 'pandas', 'sox', + 'ruamel.yaml', 'pandas', 'sox', 'soundfile', ], # Package info