未验证 提交 7310387f 编写于 作者: C ceci3 提交者: GitHub

add tinybert result (#621)

上级 609efce5
......@@ -8,14 +8,14 @@ BERT-base模型是一个迁移能力很强的通用语义表示模型,但是
| Task | Metric | Baseline | Result with PaddleSlim |
|:-----:|:----------------------------:|:-----------------:|:----------------------:|
| SST-2 | Accuracy | 0.93005 | 0.931193 |
| QNLI | Accuracy | 0.91781 | 0.920740 |
| CoLA | Mattehew's corr | 0.59557 | 0.601244 |
| MRPC | F1/Accuracy | 0.91667/0.88235 | 0.91740/0.88480 |
| STS-B | Person/Spearman corr | 0.88847/0.88350 | 0.89271/0.88958 |
| QQP | Accuracy/F1 | 0.90581/0.87347 | 0.90994/0.87947 |
| MNLI | Matched acc/MisMatched acc | 0.84422/0.84825 | 0.84687/0.85242 |
| RTE | Accuracy | 0.711191 | 0.718412 |
| SST-2 | Accuracy | 0.93005 | [0.931193]() |
| QNLI | Accuracy | 0.91781 | [0.920740]() |
| CoLA | Mattehew's corr | 0.59557 | [0.601244]() |
| MRPC | F1/Accuracy | 0.91667/0.88235 | [0.91740/0.88480]() |
| STS-B | Person/Spearman corr | 0.88847/0.88350 | [0.89271/0.88958]() |
| QQP | Accuracy/F1 | 0.90581/0.87347 | [0.90994/0.87947]() |
| MNLI | Matched acc/MisMatched acc | 0.84422/0.84825 | [0.84687/0.85242]() |
| RTE | Accuracy | 0.711191 | [0.718412]() |
<p align="center">
<strong>表1-1: GLUE数据集精度对比</strong>
......@@ -184,4 +184,41 @@ python -u ./run_glue_ofa.py --model_type bert \
压缩训练之后在dev上的结果如表1-1中『Result with PaddleSlim』列所示,延时情况如表1-2所示。
## 3. OFA接口介绍
TODO
OFA API介绍参考[API](https://github.com/PaddlePaddle/PaddleSlim/blob/develop/docs/zh_cn/api_cn/ofa_api.rst)
# 基于本代码对TinyBERT(L=4, D=312)进行压缩
下游任务模型是从TinyBERT官方repo转换得到。
## 1. 压缩结果
| Task | Metric | TinyBERT(L=4, D=312) | Result with OFA |
|:-----:|:----------------------------:|:--------------------:|:----------------------:|
| SST-2 | Accuracy | [0.9234]() | [0.9220]() |
| QNLI | Accuracy | [0.8746]() | [0.8720]() |
| CoLA | Mattehew's corr | [0.4961]() | [0.5048]() |
| MRPC | F1/Accuracy | [0.8998/0.8554]() | [0.9003/0.8578]() |
| STS-B | Person/Spearman corr | [0.8635/0.8631]() | [0.8717/0.8706]() |
| QQP | Accuracy/F1 | [0.9047/0.8751]() | [0.9034/0.8733]() |
| MNLI | Matched acc/MisMatched acc | [0.8256/0.8294]() | [0.8211/0.8261]() |
| RTE | Accuracy | [0.6534]() | [0.6787]() |
## 2. 启动命令
以GLUE/QQP任务为例。
```shell
export CUDA_VISIBLE_DEVICES=3
export TASK_NAME='QQP'
python -u ./run_glue_ofa.py --model_type bert \
--model_name_or_path ${PATH_OF_QQP} \
--task_name $TASK_NAME --max_seq_length 128 \
--batch_size 32 \
--learning_rate 2e-5 \
--num_train_epochs 6 \
--logging_steps 10 \
--save_steps 500 \
--output_dir ./tmp/$TASK_NAME/ \
--n_gpu 1 \
--width_mult_list 1.0 0.8333333333333334 0.6666666666666666 0.5
```
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册