Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
PALM
提交
ea7d592a
P
PALM
项目概览
PaddlePaddle
/
PALM
通知
5
Star
3
Fork
0
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
10
列表
看板
标记
里程碑
合并请求
0
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
P
PALM
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
10
Issue
10
列表
看板
标记
里程碑
合并请求
0
合并请求
0
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
未验证
提交
ea7d592a
编写于
4月 07, 2020
作者:
X
Xiaoyao Xi
提交者:
GitHub
4月 07, 2020
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
Update run.py
上级
746ce70f
变更
1
隐藏空白更改
内联
并排
Showing
1 changed file
with
13 addition
and
18 deletion
+13
-18
examples/multi-task/run.py
examples/multi-task/run.py
+13
-18
未找到文件。
examples/multi-task/run.py
浏览文件 @
ea7d592a
...
@@ -22,28 +22,25 @@ if __name__ == '__main__':
...
@@ -22,28 +22,25 @@ if __name__ == '__main__':
train_slot
=
'./data/atis/atis_slot/train.tsv'
train_slot
=
'./data/atis/atis_slot/train.tsv'
train_intent
=
'./data/atis/atis_intent/train.tsv'
train_intent
=
'./data/atis/atis_intent/train.tsv'
predict_file
=
'./data/atis/atis_slot/test.tsv'
predict_file
=
'./data/atis/atis_slot/test.tsv'
save_path
=
'./outputs/'
pred_output
=
'./outputs/predict/'
pred_output
=
'./outputs/predict/'
save_type
=
'ckpt'
pre_params
=
'./pretrain/ERNIE-v2-en-base/params'
config
=
json
.
load
(
open
(
'./pretrain/ERNIE-v2-en-base/ernie_config.json'
))
config
=
json
.
load
(
open
(
'./pretrain/ERNIE-v2-en-base/ernie_config.json'
))
input_dim
=
config
[
'hidden_size'
]
input_dim
=
config
[
'hidden_size'
]
# ----------------------- for training -----------------------
# ----------------------- for training -----------------------
# step 1-1: create readers
for training
# step 1-1: create readers
seq_label_reader
=
palm
.
reader
.
SequenceLabelReader
(
vocab_path
,
max_seqlen
,
label_map
,
seed
=
random_seed
)
seq_label_reader
=
palm
.
reader
.
SequenceLabelReader
(
vocab_path
,
max_seqlen
,
label_map
,
seed
=
random_seed
)
cls_reader
=
palm
.
reader
.
ClassifyReader
(
vocab_path
,
max_seqlen
,
seed
=
random_seed
)
cls_reader
=
palm
.
reader
.
ClassifyReader
(
vocab_path
,
max_seqlen
,
seed
=
random_seed
)
# step 1-2: load t
he training
data
# step 1-2: load t
rain
data
seq_label_reader
.
load_data
(
train_slot
,
file_format
=
'tsv'
,
num_epochs
=
None
,
batch_size
=
batch_size
)
seq_label_reader
.
load_data
(
train_slot
,
file_format
=
'tsv'
,
num_epochs
=
None
,
batch_size
=
batch_size
)
cls_reader
.
load_data
(
train_intent
,
batch_size
=
batch_size
,
num_epochs
=
None
)
cls_reader
.
load_data
(
train_intent
,
batch_size
=
batch_size
,
num_epochs
=
None
)
# step 2: create a backbone of the model to extract text features
# step 2: create a backbone of the model to extract text features
ernie
=
palm
.
backbone
.
ERNIE
.
from_config
(
config
)
ernie
=
palm
.
backbone
.
ERNIE
.
from_config
(
config
)
# step 3: register
the backbone in readers
# step 3: register
readers with ernie backbone
seq_label_reader
.
register_with
(
ernie
)
seq_label_reader
.
register_with
(
ernie
)
cls_reader
.
register_with
(
ernie
)
cls_reader
.
register_with
(
ernie
)
...
@@ -51,7 +48,7 @@ if __name__ == '__main__':
...
@@ -51,7 +48,7 @@ if __name__ == '__main__':
seq_label_head
=
palm
.
head
.
SequenceLabel
(
num_classes
,
input_dim
,
dropout_prob
)
seq_label_head
=
palm
.
head
.
SequenceLabel
(
num_classes
,
input_dim
,
dropout_prob
)
cls_head
=
palm
.
head
.
Classify
(
num_classes_intent
,
input_dim
,
dropout_prob
)
cls_head
=
palm
.
head
.
Classify
(
num_classes_intent
,
input_dim
,
dropout_prob
)
# step 5-1: create
a task t
rainer
# step 5-1: create
task trainers and multiHeadT
rainer
trainer_seq_label
=
palm
.
Trainer
(
"slot"
,
mix_ratio
=
1.0
)
trainer_seq_label
=
palm
.
Trainer
(
"slot"
,
mix_ratio
=
1.0
)
trainer_cls
=
palm
.
Trainer
(
"intent"
,
mix_ratio
=
1.0
)
trainer_cls
=
palm
.
Trainer
(
"intent"
,
mix_ratio
=
1.0
)
trainer
=
palm
.
MultiHeadTrainer
([
trainer_seq_label
,
trainer_cls
])
trainer
=
palm
.
MultiHeadTrainer
([
trainer_seq_label
,
trainer_cls
])
...
@@ -60,23 +57,21 @@ if __name__ == '__main__':
...
@@ -60,23 +57,21 @@ if __name__ == '__main__':
loss2
=
trainer_seq_label
.
build_forward
(
ernie
,
seq_label_head
)
loss2
=
trainer_seq_label
.
build_forward
(
ernie
,
seq_label_head
)
loss_var
=
trainer
.
build_forward
()
loss_var
=
trainer
.
build_forward
()
# step 6-1*:
use warmup
# step 6-1*:
enable warmup for better fine-tuning
n_steps
=
seq_label_reader
.
num_examples
*
1.5
*
num_epochs
//
batch_size
n_steps
=
seq_label_reader
.
num_examples
*
1.5
*
num_epochs
//
batch_size
warmup_steps
=
int
(
0.1
*
n_steps
)
warmup_steps
=
int
(
0.1
*
n_steps
)
sched
=
palm
.
lr_sched
.
TriangularSchedualer
(
warmup_steps
,
n_steps
)
sched
=
palm
.
lr_sched
.
TriangularSchedualer
(
warmup_steps
,
n_steps
)
# step 6-2:
create
a optimizer
# step 6-2:
build
a optimizer
adam
=
palm
.
optimizer
.
Adam
(
loss_var
,
lr
,
sched
)
adam
=
palm
.
optimizer
.
Adam
(
loss_var
,
lr
,
sched
)
# step 6-3: build backward
# step 6-3: build backward
graph
trainer
.
build_backward
(
optimizer
=
adam
,
weight_decay
=
weight_decay
)
trainer
.
build_backward
(
optimizer
=
adam
,
weight_decay
=
weight_decay
)
# step 7: fit
prepared reader and data
# step 7: fit
readers to trainer
trainer
.
fit_readers_with_mixratio
([
seq_label_reader
,
cls_reader
],
"slot"
,
num_epochs
)
trainer
.
fit_readers_with_mixratio
([
seq_label_reader
,
cls_reader
],
"slot"
,
num_epochs
)
# step 8-1*: load pretrained parameters
# step 8-1*: load pretrained model
trainer
.
load_pretrain
(
pre_params
)
trainer
.
load_pretrain
(
'./pretrain/ERNIE-v2-en-base'
)
# step 8-2*: set saver to save model
# step 8-2*: set saver to save models during training
save_steps
=
int
(
n_steps
-
batch_size
)
//
2
trainer
.
set_saver
(
save_path
=
'./outputs/'
,
save_steps
=
300
)
# save_steps = 10
trainer
.
set_saver
(
save_path
=
save_path
,
save_steps
=
save_steps
,
save_type
=
save_type
)
# step 8-3: start training
# step 8-3: start training
trainer
.
train
(
print_steps
=
print_steps
)
trainer
.
train
(
print_steps
=
10
)
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录