@@ -57,9 +57,6 @@ The overall network architecture of MASS is shown below, which is Transformer(Va
...
@@ -57,9 +57,6 @@ The overall network architecture of MASS is shown below, which is Transformer(Va
MASS is consisted of 6-layer encoder and 6-layer decoder with 1024 embedding/hidden size, and 4096 intermediate size between feed forward network which has two full connection layers.
MASS is consisted of 6-layer encoder and 6-layer decoder with 1024 embedding/hidden size, and 4096 intermediate size between feed forward network which has two full connection layers.
@@ -124,7 +121,8 @@ MASS script and code structure are as follow:
...
@@ -124,7 +121,8 @@ MASS script and code structure are as follow:
│ ├──all.bpe.codes // BPE codes table(this file should be generated by user).
│ ├──all.bpe.codes // BPE codes table(this file should be generated by user).
│ ├──all_en.dict.bin // Learned vocabulary file(this file should be generated by user).
│ ├──all_en.dict.bin // Learned vocabulary file(this file should be generated by user).
├── scripts
├── scripts
│ ├──run.sh // Train & evaluate model script.
│ ├──run_ascend.sh // Ascend train & evaluate model script.
│ ├──run_gpu.sh // GPU train & evaluate model script.
│ ├──learn_subword.sh // Learn BPE codes.
│ ├──learn_subword.sh // Learn BPE codes.
│ ├──stop_training.sh // Stop training.
│ ├──stop_training.sh // Stop training.
├── requirements.txt // Requirements of third party package.
├── requirements.txt // Requirements of third party package.
...
@@ -329,18 +327,24 @@ Almost all of the options and arguments needed could be assigned conveniently, i
...
@@ -329,18 +327,24 @@ Almost all of the options and arguments needed could be assigned conveniently, i
For more detailed information about the attributes, refer to the file `config/config.py`.
For more detailed information about the attributes, refer to the file `config/config.py`.
## Training & Evaluation process
## Training & Evaluation process
For training a model, the shell script `run.sh` is all you need. In this scripts, the environment variable is set and the training script `train.py` under `mass` is executed.
For training a model, the shell script `run_ascend.sh` or `run_gpu.sh` is all you need. In this scripts, the environment variable is set and the training script `train.py` under `mass` is executed.
You may start a task training with single device or multiple devices by assigning the options and run the command in bash:
You may start a task training with single device or multiple devices by assigning the options and run the command in bash: