- Download dataset for general distill and task distill such as GLUE.
- Prepare a pre-trained bert model and a fine-tuned bert model for specific task such as GLUE.
## Running the Example
### General Distill
- Set options in `src/gd_config.py`, including lossscale, optimizer and network.
- Set options in `scripts/run_standalone_gd.sh`, including device target, data sink config, checkpoint config and dataset. Click [here](https://www.mindspore.cn/tutorial/zh-CN/master/use/data_preparation/loading_the_datasets.html#tfrecord) for more information about dataset and the json schema file.
- Run `run_standalone_gd.sh` for non-distributed general distill of BERT-base model.
``` bash
bash scripts/run_standalone_gd.sh
```
- Run `run_distribute_gd.sh` for distributed general distill of BERT-base model.