1. 15 12月, 2020 6 次提交
    • X
      Upgrade plato2 using paddle2.0 (#5002) · b76f1591
      xiemoyuan 提交于
      * The first version of plato2. Not finished the network.
      
      * Update decode stratage.
      
      * Update decode stratage.
      
      * Completed the encoder and decoder. But it will oom.
      
      * Completed the encoder and decoder.
      
      * backend the code.
      
      * Only completed the network of plato2 and nsp.
      
      * Completed the development. But the effect has not be verified.
      
      * Add readme and remove the code about deal PY2 and PY3.
      
      * Modify comment.
      
      * Modify readme and add images.
      
      * Delete the data folder.
      b76f1591
    • S
      update docs for ernie-tiny · 3366cf65
      Steffy-zxf 提交于
      update docs for ernie-tiny
      3366cf65
    • K
      add ernie_gen for the dygraph mode · b725cbd0
      kinghuin 提交于
      add ernie_gen for the dygraph mode
      b725cbd0
    • L
      [Transformer] Simplify transformer reader and fix TranslationDataset (#5035) · 30ccfc67
      liu zhengxi 提交于
      * fix translation dataset and simplify transformer reader
      30ccfc67
    • S
      update codes for paddenlp text cls example · 8c9d8f56
      Steffy-zxf 提交于
      update codes for paddenlp text cls example
      8c9d8f56
    • L
      Update seq2seq example (#5016) · 7fae3401
      LiuChiachi 提交于
      * update seq2seq, using paddlenlp
      
      * Using new paddlenlp API
      
      * update seq2seqREADME
      
      * wrap dev ds
      
      * delete useless comments
      
      * update predict.py
      
      * using paddlenlp.bleu
      
      * remove shard
      
      * update README, using bleu perl
      
      * delete cand
      
      * Remove tokens that make sentences longer than max_len
      
      * remove pdb
      
      * remove useless code.
      
      * update url and dataset name of vae dataset(ptb and yahoo)
      
      * update seq2seq and vae, data and README
      7fae3401
  2. 14 12月, 2020 7 次提交
  3. 13 12月, 2020 2 次提交
    • L
      Update couplet readme (#5039) · 05d5dee6
      LiuChiachi 提交于
      * update couplet readme
      
      * update generation example
      05d5dee6
    • L
      Add couplet examples (#5007) · 11ee20fb
      LiuChiachi 提交于
      * add couplet
      
      * simplify model code
      
      * simplify code
      
      * update couplet README
      
      * add pad_token to TranslationDataset, update CoupletDataset
      
      * update couplet url, add couplet generation example
      
      * update TranslationDataset
      
      * upadte classname to self in __init__
      
      * update README.md
      11ee20fb
  4. 12 12月, 2020 8 次提交
  5. 11 12月, 2020 8 次提交
    • Z
      remove run_ernie_crf.py · d3029c01
      Zeyu Chen 提交于
      d3029c01
    • N
      Add express task (#5024) · 99d39e52
      Noel 提交于
      * Add Express Example
      
      * Add Express Data
      
      * Add Ernie for Express Example
      
      * add the express for the paddlenlp
      Co-authored-by: Nwanghuijuan03 <wanghuijuan03@baidu.com>
      99d39e52
    • S
      Add Sentence Transformer for text matching and Add readme (#5004) · f5f9dee4
      Steffy-zxf 提交于
      * update docs
      
      * add sbert
      
      * add readme
      
      * update readme
      
      * update codes
      f5f9dee4
    • X
      fixed DGU typos. (#5018) · 7d80374d
      xiemoyuan 提交于
      7d80374d
    • S
      Fix dureader api bugs (#5021) · 33a279eb
      smallv0221 提交于
      * update lrscheduler
      
      * minor fix
      
      * add pre-commit
      
      * minor fix
      
      * Add __len__ to squad dataset
      
      * minor fix
      
      * Add dureader robust prototype
      
      * dataset implement
      
      * minor fix
      
      * fix var name
      
      * add dureader-yesno train script and dataset
      
      * add readme and fix md5sum
      
      * integrete dureader datasets
      
      * change var names: segment to mode, root to data_file
      
      * minor fix
      
      * update var name
      
      * Fix api bugs
      33a279eb
    • K
      Optimize BigruCRF example (#5017) · 03d651b4
      kinghuin 提交于
      * optimize lac
      
      * formatted
      
      * optimize lac
      
      * optimize lac
      03d651b4
    • S
      Update datasets naming style (#5014) · ad4720ec
      smallv0221 提交于
      * update lrscheduler
      
      * minor fix
      
      * add pre-commit
      
      * minor fix
      
      * Add __len__ to squad dataset
      
      * minor fix
      
      * Add dureader robust prototype
      
      * dataset implement
      
      * minor fix
      
      * fix var name
      
      * add dureader-yesno train script and dataset
      
      * add readme and fix md5sum
      
      * integrete dureader datasets
      
      * change var names: segment to mode, root to data_file
      
      * minor fix
      
      * update var name
      ad4720ec
    • S
      Add DuReader yesno and robust (#4992) · 26a0cd1e
      smallv0221 提交于
      * update lrscheduler
      
      * minor fix
      
      * add pre-commit
      
      * minor fix
      
      * Add __len__ to squad dataset
      
      * minor fix
      
      * Add dureader robust prototype
      
      * dataset implement
      
      * minor fix
      
      * fix var name
      
      * add dureader-yesno train script and dataset
      
      * add readme and fix md5sum
      
      * integrete dureader datasets
      26a0cd1e
  6. 10 12月, 2020 3 次提交
    • X
      Unified the task name of DGU with paddle1.8 (#5011) · 2b2147b0
      xiemoyuan 提交于
      * Unified the task name with paddle1.8
      
      * fixed bug.
      2b2147b0
    • J
      Add TokenEmbedding (#4983) · e59f15a1
      Jack Zhou 提交于
      * Add TokenEmbedding
      
      * download corpus embedding data
      * load embedding data by specifying corpus name
      * extend the vocab of tokenizer from corpus embedding data
      
      * add unk token setting
      
      * modify tokenizer
      
      * add extend voacb
      
      * move jieba tokenizer and rename corpus_name->embedding_name
      
      * use bos url instead of localhost
      
      * add log when loading data
      
      * add token dot computation; add __repr__ of TokenEmbedding
      
      * add color logging
      
      * use paddlenlp.utils.log
      
      * adjust repr
      
      * update pretrained embedding table
      
      * fix padding idx
      e59f15a1
    • J
      add electra pretrain and modify style of electra modeling (#4990) · f07cdf53
      jeff41404 提交于
      * add electra pretrain and modify style of electra modeling
      
      * add electra pretrain, modify style of electra modeling and fix problems of review
      
      * delete predict_classifer
      
      * modify accu to acc
      
      * add paddlenlp.metrics.glue
      f07cdf53
  7. 09 12月, 2020 2 次提交
  8. 08 12月, 2020 2 次提交
  9. 07 12月, 2020 2 次提交