提交 · 30ccfc676bf257090dabfc6a21938932239422ad · PaddlePaddle / models

15 12月, 2020 2 次提交

L
[Transformer] Simplify transformer reader and fix TranslationDataset (#5035) · 30ccfc67
由 liu zhengxi 提交于 12月 15, 2020
```
* fix translation dataset and simplify transformer reader
```
30ccfc67

Update seq2seq example (#5016) · 7fae3401

由 LiuChiachi 提交于 12月 15, 2020

* update seq2seq, using paddlenlp

* Using new paddlenlp API

* update seq2seqREADME

* wrap dev ds

* delete useless comments

* update predict.py

* using paddlenlp.bleu

* remove shard

* update README, using bleu perl

* delete cand

* Remove tokens that make sentences longer than max_len

* remove pdb

* remove useless code.

* update url and dataset name of vae dataset(ptb and yahoo)

* update seq2seq and vae, data and README

7fae3401

12 12月, 2020 1 次提交
- Z
  
  remove uselss README · 03e3dd97
  由 Zeyu Chen 提交于 12月 12, 2020
  
  03e3dd97
10 12月, 2020 1 次提交

Add TokenEmbedding (#4983) · e59f15a1

由 Jack Zhou 提交于 12月 10, 2020

* Add TokenEmbedding

* download corpus embedding data
* load embedding data by specifying corpus name
* extend the vocab of tokenizer from corpus embedding data

* add unk token setting

* modify tokenizer

* add extend voacb

* move jieba tokenizer and rename corpus_name->embedding_name

* use bos url instead of localhost

* add log when loading data

* add token dot computation; add __repr__ of TokenEmbedding

* add color logging

* use paddlenlp.utils.log

* adjust repr

* update pretrained embedding table

* fix padding idx

e59f15a1

08 12月, 2020 2 次提交
- Z
  
  update PaddleNLP README · 8c076ce0
  由 Zeyu Chen 提交于 12月 08, 2020
  
  8c076ce0
- L
  
  update readme of transformer benchmark (#4997) · 840e9beb
  由 liu zhengxi 提交于 12月 08, 2020
  
  840e9beb
07 12月, 2020 2 次提交
- L
  
  fix predict (#4985) · 9c7c5a53
  由 liu zhengxi 提交于 12月 07, 2020
  
  9c7c5a53
- Z
  
  reorganize legacy files · bf483c0f
  由 Zeyu Chen 提交于 12月 07, 2020
  
  bf483c0f

PaddlePaddle / models 大约 1 年 前同步成功

PaddlePaddle / models
大约 1 年前同步成功