1. 15 12月, 2020 12 次提交
  2. 14 12月, 2020 7 次提交
  3. 13 12月, 2020 2 次提交
    • L
      Update couplet readme (#5039) · 05d5dee6
      LiuChiachi 提交于
      * update couplet readme
      
      * update generation example
      05d5dee6
    • L
      Add couplet examples (#5007) · 11ee20fb
      LiuChiachi 提交于
      * add couplet
      
      * simplify model code
      
      * simplify code
      
      * update couplet README
      
      * add pad_token to TranslationDataset, update CoupletDataset
      
      * update couplet url, add couplet generation example
      
      * update TranslationDataset
      
      * upadte classname to self in __init__
      
      * update README.md
      11ee20fb
  4. 12 12月, 2020 8 次提交
  5. 11 12月, 2020 8 次提交
    • Z
      remove run_ernie_crf.py · d3029c01
      Zeyu Chen 提交于
      d3029c01
    • N
      Add express task (#5024) · 99d39e52
      Noel 提交于
      * Add Express Example
      
      * Add Express Data
      
      * Add Ernie for Express Example
      
      * add the express for the paddlenlp
      Co-authored-by: Nwanghuijuan03 <wanghuijuan03@baidu.com>
      99d39e52
    • S
      Add Sentence Transformer for text matching and Add readme (#5004) · f5f9dee4
      Steffy-zxf 提交于
      * update docs
      
      * add sbert
      
      * add readme
      
      * update readme
      
      * update codes
      f5f9dee4
    • X
      fixed DGU typos. (#5018) · 7d80374d
      xiemoyuan 提交于
      7d80374d
    • S
      Fix dureader api bugs (#5021) · 33a279eb
      smallv0221 提交于
      * update lrscheduler
      
      * minor fix
      
      * add pre-commit
      
      * minor fix
      
      * Add __len__ to squad dataset
      
      * minor fix
      
      * Add dureader robust prototype
      
      * dataset implement
      
      * minor fix
      
      * fix var name
      
      * add dureader-yesno train script and dataset
      
      * add readme and fix md5sum
      
      * integrete dureader datasets
      
      * change var names: segment to mode, root to data_file
      
      * minor fix
      
      * update var name
      
      * Fix api bugs
      33a279eb
    • K
      Optimize BigruCRF example (#5017) · 03d651b4
      kinghuin 提交于
      * optimize lac
      
      * formatted
      
      * optimize lac
      
      * optimize lac
      03d651b4
    • S
      Update datasets naming style (#5014) · ad4720ec
      smallv0221 提交于
      * update lrscheduler
      
      * minor fix
      
      * add pre-commit
      
      * minor fix
      
      * Add __len__ to squad dataset
      
      * minor fix
      
      * Add dureader robust prototype
      
      * dataset implement
      
      * minor fix
      
      * fix var name
      
      * add dureader-yesno train script and dataset
      
      * add readme and fix md5sum
      
      * integrete dureader datasets
      
      * change var names: segment to mode, root to data_file
      
      * minor fix
      
      * update var name
      ad4720ec
    • S
      Add DuReader yesno and robust (#4992) · 26a0cd1e
      smallv0221 提交于
      * update lrscheduler
      
      * minor fix
      
      * add pre-commit
      
      * minor fix
      
      * Add __len__ to squad dataset
      
      * minor fix
      
      * Add dureader robust prototype
      
      * dataset implement
      
      * minor fix
      
      * fix var name
      
      * add dureader-yesno train script and dataset
      
      * add readme and fix md5sum
      
      * integrete dureader datasets
      26a0cd1e
  6. 10 12月, 2020 3 次提交
    • X
      Unified the task name of DGU with paddle1.8 (#5011) · 2b2147b0
      xiemoyuan 提交于
      * Unified the task name with paddle1.8
      
      * fixed bug.
      2b2147b0
    • J
      Add TokenEmbedding (#4983) · e59f15a1
      Jack Zhou 提交于
      * Add TokenEmbedding
      
      * download corpus embedding data
      * load embedding data by specifying corpus name
      * extend the vocab of tokenizer from corpus embedding data
      
      * add unk token setting
      
      * modify tokenizer
      
      * add extend voacb
      
      * move jieba tokenizer and rename corpus_name->embedding_name
      
      * use bos url instead of localhost
      
      * add log when loading data
      
      * add token dot computation; add __repr__ of TokenEmbedding
      
      * add color logging
      
      * use paddlenlp.utils.log
      
      * adjust repr
      
      * update pretrained embedding table
      
      * fix padding idx
      e59f15a1
    • J
      add electra pretrain and modify style of electra modeling (#4990) · f07cdf53
      jeff41404 提交于
      * add electra pretrain and modify style of electra modeling
      
      * add electra pretrain, modify style of electra modeling and fix problems of review
      
      * delete predict_classifer
      
      * modify accu to acc
      
      * add paddlenlp.metrics.glue
      f07cdf53