cluewsc.yaml 443 字节
Newer Older
C
Chang Xu 已提交
1 2 3 4 5 6 7 8
Global:
  model_dir: ./cluewsc
  model_filename: inference.pdmodel
  params_filename: inference.pdiparams
  task_name: cluewsc
  dataset: clue
  batch_size: 16
  max_seq_length: 128
C
Chang Xu 已提交
9 10 11 12
TransformerPrune:
  pruned_ratio: 0.25
HyperParameterOptimization:
Distillation:
Z
zhouzj 已提交
13
QuantPost:
C
Chang Xu 已提交
14 15 16 17
TrainConfig:
  epochs: 100
  eval_iter: 70
  learning_rate: 1.0e-5
C
ceci3 已提交
18 19 20
  optimizer_builder:
    optimizer: 
      type: AdamW
C
Chang Xu 已提交
21 22
    weight_decay: 0.01
  origin_metric: 0.8421