1. 11 12月, 2020 1 次提交
    • S
      Add DuReader yesno and robust (#4992) · 26a0cd1e
      smallv0221 提交于
      * update lrscheduler
      
      * minor fix
      
      * add pre-commit
      
      * minor fix
      
      * Add __len__ to squad dataset
      
      * minor fix
      
      * Add dureader robust prototype
      
      * dataset implement
      
      * minor fix
      
      * fix var name
      
      * add dureader-yesno train script and dataset
      
      * add readme and fix md5sum
      
      * integrete dureader datasets
      26a0cd1e
  2. 10 12月, 2020 5 次提交
    • L
      Update perplexity, inherited from Metric (#4995) · 84a6ea83
      LiuChiachi 提交于
      * update ppl, inherited from Metric, suitable for common mt and lm task.
      
      * fix ppl bugs, delete useless argument
      
      * fix ppl bugs, return a num not a numpy array.
      84a6ea83
    • J
      Add TokenEmbedding (#4983) · e59f15a1
      Jack Zhou 提交于
      * Add TokenEmbedding
      
      * download corpus embedding data
      * load embedding data by specifying corpus name
      * extend the vocab of tokenizer from corpus embedding data
      
      * add unk token setting
      
      * modify tokenizer
      
      * add extend voacb
      
      * move jieba tokenizer and rename corpus_name->embedding_name
      
      * use bos url instead of localhost
      
      * add log when loading data
      
      * add token dot computation; add __repr__ of TokenEmbedding
      
      * add color logging
      
      * use paddlenlp.utils.log
      
      * adjust repr
      
      * update pretrained embedding table
      
      * fix padding idx
      e59f15a1
    • L
      Update paddlenlp.metrics README (#5010) · 26368f23
      LiuChiachi 提交于
      * Update paddlenlp.metrics README
      
      * Add info for Mcc
      
      * delete DuReader metric
      
      * update task name, add API col
      26368f23
    • J
      add electra pretrain and modify style of electra modeling (#4990) · f07cdf53
      jeff41404 提交于
      * add electra pretrain and modify style of electra modeling
      
      * add electra pretrain, modify style of electra modeling and fix problems of review
      
      * delete predict_classifer
      
      * modify accu to acc
      
      * add paddlenlp.metrics.glue
      f07cdf53
    • W
      use paddlenlp downloader for the tokenizer (#5003) · 8e45228d
      wawltor 提交于
      use paddlenlp downloader for the tokenizer
      8e45228d
  3. 09 12月, 2020 2 次提交
  4. 07 12月, 2020 2 次提交