>Note: Make sure current directory is `models/deep_speech_2/cloud/`
>Note:
>Make sure [PaddleCloud client](https://github.com/PaddlePaddle/cloud/blob/develop/doc/usage_cn.md#%E4%B8%8B%E8%BD%BD%E5%B9%B6%E9%85%8D%E7%BD%AEpaddlecloud) has be installed and current directory is `models/deep_speech_2/cloud/`
## Step1 Configure data set
## Step-1 Configure data set
You can configure your input data and output path in pcloud_submit.sh:
Configure your input data and output path in pcloud_submit.sh:
-`TRAIN_MANIFEST`: Absolute path of train data manifest file in local file system.This file has format as bellow:
...
...
@@ -15,19 +16,18 @@ You can configure your input data and output path in pcloud_submit.sh:
": "nor is mister ..."}
```
-`TEST_MANIFEST`: Absolute path of train data manifest file in local filesystem.This file has format like TRAIN_MANIFEST.
-`TEST_MANIFEST`: Absolute path of train data manifest file in local filesystem. This file has format like `TRAIN_MANIFEST`.
-`VOCAB_FILE`: Absolute path of vocabulary file in local filesytem.
-`MEAN_STD_FILE`: Absolute path of vocabulary file in local filesytem.
-`MEAN_STD_FILE`: Absolute path of normalizer's statistic file in local filesytem.
-`CLOUD_DATA_DIR:` Absolute path in PaddleCloud filesystem. We will upload local train data to this directory.
-`CLOUD_MODEL_DIR`: Absolute path in PaddleCloud filesystem. PaddleCloud trainer will save model to this directory.
>Note: Upload will be skipped if target file has existed in `CLOUD_DATA_DIR`.
>Note: Upload will be skipped if target file has existed in ${CLOUD_DATA_DIR}.
## Step-2 Configure computation resource
## Step2 Configure computation resource
Configure computation resource in pcloud_submit.sh:
You can configure computation resource in pcloud_submit.sh:
```
# Configure computation resource and submit job to PaddleCloud
paddlecloud submit \
...
...
@@ -44,10 +44,10 @@ You can configure computation resource in pcloud_submit.sh: