提交 · 65c1f0b99bec27c27b04f64bc0b4f4f8b5a46492 · PaddlePaddle / models

15 12月, 2020 12 次提交

由 kinghuin 提交于 12月 15, 2020

* add ernie_gen

* optimize ernie_gen

* optimize ernie_gen

* optimize ernie_gen code

* fix crf bug

* add ernie_gen __init__.py

* modify nlp version

* fix ernie_gen predict

* optimize doc

65c1f0b9

Z

remove examples/bert, unifiy to benchmark/bert · e92d5d40
由 Zeyu Chen 提交于 12月 15, 2020

e92d5d40
W

add the support token_embedding of express (#5058) · b84cace2
由 wawltor 提交于 12月 15, 2020

b84cace2

Fix ernie_gen and CRF bug (#5057) · 16c6580d

由 kinghuin 提交于 12月 15, 2020

* add ernie_gen

* optimize ernie_gen

* optimize ernie_gen

* optimize ernie_gen code

* fix crf bug

* add ernie_gen __init__.py

* modify nlp version

* fix ernie_gen predict

16c6580d

X

Rename 'decoder' to 'decode'. (#5056) · 885432bf
由 xiemoyuan 提交于 12月 15, 2020

885432bf
J

add glue and electra readme (#5051) · 3698045c
由 jeff41404 提交于 12月 15, 2020

3698045c

Upgrade plato2 using paddle2.0 (#5002) · b76f1591

由 xiemoyuan 提交于 12月 15, 2020

* The first version of plato2. Not finished the network.

* Update decode stratage.

* Update decode stratage.

* Completed the encoder and decoder. But it will oom.

* Completed the encoder and decoder.

* backend the code.

* Only completed the network of plato2 and nsp.

* Completed the development. But the effect has not be verified.

* Add readme and remove the code about deal PY2 and PY3.

* Modify comment.

* Modify readme and add images.

* Delete the data folder.

b76f1591

S
update docs for ernie-tiny · 3366cf65
由 Steffy-zxf 提交于 12月 15, 2020
```
update docs for ernie-tiny
```
3366cf65
K
add ernie_gen for the dygraph mode · b725cbd0
由 kinghuin 提交于 12月 15, 2020
```
add ernie_gen for the dygraph mode
```
b725cbd0
L
[Transformer] Simplify transformer reader and fix TranslationDataset (#5035) · 30ccfc67
由 liu zhengxi 提交于 12月 15, 2020
```
* fix translation dataset and simplify transformer reader
```
30ccfc67
S
update codes for paddenlp text cls example · 8c9d8f56
由 Steffy-zxf 提交于 12月 15, 2020
```
update codes for paddenlp text cls example
```
8c9d8f56

Update seq2seq example (#5016) · 7fae3401

由 LiuChiachi 提交于 12月 15, 2020

* update seq2seq, using paddlenlp

* Using new paddlenlp API

* update seq2seqREADME

* wrap dev ds

* delete useless comments

* update predict.py

* using paddlenlp.bleu

* remove shard

* update README, using bleu perl

* delete cand

* Remove tokens that make sentences longer than max_len

* remove pdb

* remove useless code.

* update url and dataset name of vae dataset(ptb and yahoo)

* update seq2seq and vae, data and README

7fae3401

14 12月, 2020 7 次提交

W
update the readme for the word_embedding (#5050) · 33d65b31
由 wawltor 提交于 12月 14, 2020
```
update the readme for the word_embedding (#5050)
```
33d65b31

Add more embedding and sample for the TokenEmbedding · ec17d938

由 Jack Zhou 提交于 12月 14, 2020

* add all wiki embedding and part of baidu encyclopedia embedding.

* add embedding example

* add people_daily, weibo, sougou pretrained embedding

* add zhihu, finacial,literature embedding

* Add embedding model readme; add embedding train example and readme

* fix README example

* fix embedding doc

ec17d938

update docs (#5049) · 954f02ca

由 Steffy-zxf 提交于 12月 14, 2020

* add paddle.models api reference docs

* update docs

* update docs

* update docs

* tmp

* update docs

* update docs

* update docs

* update docs

* update docs

* update docs

954f02ca

J

clean hapi notes, clean eager_run and modify num_labels to num_classes (#5045) · 6788ab2b
由 jeff41404 提交于 12月 14, 2020

6788ab2b
L

fix bugs (#5046) · 16693577
由 LiuChiachi 提交于 12月 14, 2020

16693577
L
Fix TranslationDataset bug (#5043) · 7e876736
由 LiuChiachi 提交于 12月 14, 2020
```
* fix translation.py bugs

* delete useless comments

* set couplet download root None
```
7e876736

update paddlenlp text matching docs (#5041) · 0ba45dfe

由 Steffy-zxf 提交于 12月 14, 2020

* add paddle.models api reference docs

* update docs

* update docs

* update docs

* tmp

* update docs

* update docs

* update docs

* update docs

0ba45dfe

13 12月, 2020 2 次提交

L
Update couplet readme (#5039) · 05d5dee6
由 LiuChiachi 提交于 12月 13, 2020
```
* update couplet readme

* update generation example
```
05d5dee6

Add couplet examples (#5007) · 11ee20fb

由 LiuChiachi 提交于 12月 13, 2020

* add couplet

* simplify model code

* simplify code

* update couplet README

* add pad_token to TranslationDataset, update CoupletDataset

* update couplet url, add couplet generation example

* update TranslationDataset

* upadte classname to self in __init__

* update README.md

11ee20fb

12 12月, 2020 8 次提交
- Z
  
  update README.md · 9becf7bb
  由 Zeyu Chen 提交于 12月 12, 2020
  
  9becf7bb
- Z
  
  update examples README · becb08ec
  由 Zeyu Chen 提交于 12月 12, 2020
  
  becb08ec
- S
  Update text matching docs (#5033) · fa900a40
  由 Steffy-zxf 提交于 12月 12, 2020
```
* add paddle.models api reference docs

* update docs

* update docs

* update docs
```
  fa900a40
- W
  Add the model zoo readme for the paddlenlp (#5032) · 392ae994
  由 wawltor 提交于 12月 12, 2020
```
Co-authored-by: Nwawltor <fanzeyang0904@hotmail.com>
```
  392ae994
- J
  
  unify run_glue in bert, electra and so on (#5026) · f22a2b1b
  由 jeff41404 提交于 12月 12, 2020
  
  f22a2b1b
- S
  add paddle.models api reference docs (#5030) · e6e41bc1
  由 Steffy-zxf 提交于 12月 12, 2020
```
* add paddle.models api reference docs

* update docs

* update docs
```
  e6e41bc1
- K
  
  fix lac typo and image url (#5028) · 344d03bf
  由 kinghuin 提交于 12月 12, 2020
  
  344d03bf
- Z
  
  remove uselss README · 03e3dd97
  由 Zeyu Chen 提交于 12月 12, 2020
  
  03e3dd97
11 12月, 2020 8 次提交

Z

remove run_ernie_crf.py · d3029c01
由 Zeyu Chen 提交于 12月 11, 2020

d3029c01

Add express task (#5024) · 99d39e52

由 Noel 提交于 12月 11, 2020

* Add Express Example

* Add Express Data

* Add Ernie for Express Example

* add the express for the paddlenlp
Co-authored-by: Nwanghuijuan03 <wanghuijuan03@baidu.com>

99d39e52

S
Add Sentence Transformer for text matching and Add readme (#5004) · f5f9dee4
由 Steffy-zxf 提交于 12月 11, 2020
```
* update docs

* add sbert

* add readme

* update readme

* update codes
```
f5f9dee4
X

fixed DGU typos. (#5018) · 7d80374d
由 xiemoyuan 提交于 12月 11, 2020

7d80374d

Fix dureader api bugs (#5021) · 33a279eb

由 smallv0221 提交于 12月 11, 2020

* update lrscheduler

* minor fix

* add pre-commit

* minor fix

* Add __len__ to squad dataset

* minor fix

* Add dureader robust prototype

* dataset implement

* minor fix

* fix var name

* add dureader-yesno train script and dataset

* add readme and fix md5sum

* integrete dureader datasets

* change var names: segment to mode, root to data_file

* minor fix

* update var name

* Fix api bugs

33a279eb

K
Optimize BigruCRF example (#5017) · 03d651b4
由 kinghuin 提交于 12月 11, 2020
```
* optimize lac

* formatted

* optimize lac

* optimize lac
```
03d651b4

Update datasets naming style (#5014) · ad4720ec

由 smallv0221 提交于 12月 11, 2020

* update lrscheduler

* minor fix

* add pre-commit

* minor fix

* Add __len__ to squad dataset

* minor fix

* Add dureader robust prototype

* dataset implement

* minor fix

* fix var name

* add dureader-yesno train script and dataset

* add readme and fix md5sum

* integrete dureader datasets

* change var names: segment to mode, root to data_file

* minor fix

* update var name

ad4720ec

Add DuReader yesno and robust (#4992) · 26a0cd1e

由 smallv0221 提交于 12月 11, 2020

* update lrscheduler

* minor fix

* add pre-commit

* minor fix

* Add __len__ to squad dataset

* minor fix

* Add dureader robust prototype

* dataset implement

* minor fix

* fix var name

* add dureader-yesno train script and dataset

* add readme and fix md5sum

* integrete dureader datasets

26a0cd1e

10 12月, 2020 3 次提交

X
Unified the task name of DGU with paddle1.8 (#5011) · 2b2147b0
由 xiemoyuan 提交于 12月 10, 2020
```
* Unified the task name with paddle1.8

* fixed bug.
```
2b2147b0

Add TokenEmbedding (#4983) · e59f15a1

由 Jack Zhou 提交于 12月 10, 2020

* Add TokenEmbedding

* download corpus embedding data
* load embedding data by specifying corpus name
* extend the vocab of tokenizer from corpus embedding data

* add unk token setting

* modify tokenizer

* add extend voacb

* move jieba tokenizer and rename corpus_name->embedding_name

* use bos url instead of localhost

* add log when loading data

* add token dot computation; add __repr__ of TokenEmbedding

* add color logging

* use paddlenlp.utils.log

* adjust repr

* update pretrained embedding table

* fix padding idx

e59f15a1

add electra pretrain and modify style of electra modeling (#4990) · f07cdf53

由 jeff41404 提交于 12月 10, 2020

* add electra pretrain and modify style of electra modeling

* add electra pretrain, modify style of electra modeling and fix problems of review

* delete predict_classifer

* modify accu to acc

* add paddlenlp.metrics.glue

f07cdf53

PaddlePaddle / models 1 年多 前同步成功

PaddlePaddle / models
1 年多前同步成功