1. 12 12月, 2020 2 次提交
  2. 11 12月, 2020 5 次提交
    • Z
      remove run_ernie_crf.py · d3029c01
      Zeyu Chen 提交于
      d3029c01
    • S
      Fix dureader api bugs (#5021) · 33a279eb
      smallv0221 提交于
      * update lrscheduler
      
      * minor fix
      
      * add pre-commit
      
      * minor fix
      
      * Add __len__ to squad dataset
      
      * minor fix
      
      * Add dureader robust prototype
      
      * dataset implement
      
      * minor fix
      
      * fix var name
      
      * add dureader-yesno train script and dataset
      
      * add readme and fix md5sum
      
      * integrete dureader datasets
      
      * change var names: segment to mode, root to data_file
      
      * minor fix
      
      * update var name
      
      * Fix api bugs
      33a279eb
    • K
      Optimize BigruCRF example (#5017) · 03d651b4
      kinghuin 提交于
      * optimize lac
      
      * formatted
      
      * optimize lac
      
      * optimize lac
      03d651b4
    • S
      Update datasets naming style (#5014) · ad4720ec
      smallv0221 提交于
      * update lrscheduler
      
      * minor fix
      
      * add pre-commit
      
      * minor fix
      
      * Add __len__ to squad dataset
      
      * minor fix
      
      * Add dureader robust prototype
      
      * dataset implement
      
      * minor fix
      
      * fix var name
      
      * add dureader-yesno train script and dataset
      
      * add readme and fix md5sum
      
      * integrete dureader datasets
      
      * change var names: segment to mode, root to data_file
      
      * minor fix
      
      * update var name
      ad4720ec
    • S
      Add DuReader yesno and robust (#4992) · 26a0cd1e
      smallv0221 提交于
      * update lrscheduler
      
      * minor fix
      
      * add pre-commit
      
      * minor fix
      
      * Add __len__ to squad dataset
      
      * minor fix
      
      * Add dureader robust prototype
      
      * dataset implement
      
      * minor fix
      
      * fix var name
      
      * add dureader-yesno train script and dataset
      
      * add readme and fix md5sum
      
      * integrete dureader datasets
      26a0cd1e
  3. 10 12月, 2020 5 次提交
    • L
      Update perplexity, inherited from Metric (#4995) · 84a6ea83
      LiuChiachi 提交于
      * update ppl, inherited from Metric, suitable for common mt and lm task.
      
      * fix ppl bugs, delete useless argument
      
      * fix ppl bugs, return a num not a numpy array.
      84a6ea83
    • J
      Add TokenEmbedding (#4983) · e59f15a1
      Jack Zhou 提交于
      * Add TokenEmbedding
      
      * download corpus embedding data
      * load embedding data by specifying corpus name
      * extend the vocab of tokenizer from corpus embedding data
      
      * add unk token setting
      
      * modify tokenizer
      
      * add extend voacb
      
      * move jieba tokenizer and rename corpus_name->embedding_name
      
      * use bos url instead of localhost
      
      * add log when loading data
      
      * add token dot computation; add __repr__ of TokenEmbedding
      
      * add color logging
      
      * use paddlenlp.utils.log
      
      * adjust repr
      
      * update pretrained embedding table
      
      * fix padding idx
      e59f15a1
    • L
      Update paddlenlp.metrics README (#5010) · 26368f23
      LiuChiachi 提交于
      * Update paddlenlp.metrics README
      
      * Add info for Mcc
      
      * delete DuReader metric
      
      * update task name, add API col
      26368f23
    • J
      add electra pretrain and modify style of electra modeling (#4990) · f07cdf53
      jeff41404 提交于
      * add electra pretrain and modify style of electra modeling
      
      * add electra pretrain, modify style of electra modeling and fix problems of review
      
      * delete predict_classifer
      
      * modify accu to acc
      
      * add paddlenlp.metrics.glue
      f07cdf53
    • W
      use paddlenlp downloader for the tokenizer (#5003) · 8e45228d
      wawltor 提交于
      use paddlenlp downloader for the tokenizer
      8e45228d
  4. 09 12月, 2020 2 次提交
  5. 07 12月, 2020 2 次提交