提交 · 0b294906f910d5bec4edb4e21a8d31c097ed786d · PaddlePaddle / Paddle

20 10月, 2020 4 次提交
- Y
  lookup_table_v2_op_xpu report errors;test=kunlun (#28064) (#28100) · 0b294906
  由 yinhaofeng 提交于 10月 20, 2020
```
* lookup_table_v2_op_xpu report errors;test=kunlun

* lookup_table_v2_op_xpu report errors;test=kunlun
```
  0b294906
- Y
  【cherry-pick】xpu adam op (#28031) (#28097) · ea45fb90
  由 yinhaofeng 提交于 10月 20, 2020
```
* xpu adam op (#28031)

* lookup_table_xpu op report errors;test=kunlun

* add adam xpu op;test=kunlun

* reset lookup

* change adam wrong;test=kunlun

* add adam xpu op;test=kunlun
```
  ea45fb90
- D
  
  add rois_num params for roi_align_xpu op, test=kunlun (#28094) · 4f43d51f
  由 Double_V 提交于 10月 20, 2020
  
  4f43d51f
- T
  
  [cherry-pick] Add xpu transpose2 op.test=kunlun (#28096) · 11adb0f3
  由 TeslaZhao 提交于 10月 20, 2020
  
  11adb0f3
19 10月, 2020 13 次提交

C
Fix xpu error message (#28061) (#28092) · 91727ac8
由 Chengmo 提交于 10月 19, 2020
```
* fix error message,test=kunlun

* fix, test=kunlun
```
91727ac8

Allclose op (#27891) (#28069) · 6bb6cb27

由 huangxu96 提交于 10月 19, 2020

* Fixed allclose_op bug, which cannot deal with some cases of fp64 inputs.

* improved CUDA kernel performance.

* Fixed a bug in cuda kernel which cannot deal with large dimension input, and added an unit test for it.

* Add a test case for float32 input.

6bb6cb27

X

rm max_input in conv2d for kunlun, test=kunlun (#28063) · 905b0765
由 xiaoting 提交于 10月 19, 2020

905b0765

error message opt for XPU, test=kunlun (#27972) (#28078) · 8600f474

由 Double_V 提交于 10月 19, 2020

* add stack pool2d roi_align xpu op,test=kunlun

* error message opt, test=kunlun

* add xpu unittest,test=kunlun

* skip check grad,test=kunlun

* fix boostget , test=kunlun

* error message opt for XPU, test=kunlun

8600f474

cherry pick 27861 Add truncated_gaussian_random XPU kernel, test=kunlun (#28060) · 46a1f69b

由 pangyoki 提交于 10月 19, 2020

* Add truncated_gaussian_random_op XPU kernel

* Add truncated_gaussian_random_op XPU kernel, test=kunlun

* little change, test=kunlun

* change boost_get to BOOST_GET_CONST

* change boost_get to BOOST_GET_CONST, test=kunlun

* little change, test=kunlun

* use Generator to generate random number and optimize format, test=kunlun

* little change, test=kunlun

* add TODO, test=kunlun

46a1f69b

cherry pick 27853 Add gaussian_random XPU kernels, test=kunlun (#28059) · b21409e0

由 pangyoki 提交于 10月 19, 2020

* Add gaussian_random XPU kernels

* commit kunlun, test=kunlun

* new version, test=kunlun

* change boost_get to BOOST_GET_CONST, test=kunlun

* use Generator to generate random number and optimize format, test=kunlun

* add TODO, test=kunlun

b21409e0

cherry pick 27846 Add uniform_random XPU kernel, test=kunlun (#28057) · 69ec13cd

由 pangyoki 提交于 10月 19, 2020

* support uniform_random op on Baidu Kunlun

* change dtype of attr shape from int to int64_t

* kunlun ci, test=kunlun

* new version, test=kunlun

* change boost_get to BOOST_GET_CONST

* change boost_get to BOOST_GET_CONST, test=kunlun

* use Generator to generate random number and optimize format

* run Kunlun CI, test=kunlun

* add TODO, test=kunlun

69ec13cd

cherry pick 27946 Fix error message of multinomial op (#28080) · 386429be

由 pangyoki 提交于 10月 19, 2020

* fix multinomial doc

* fix multinomial error message

* little doc change

* fix Categorical class doc

* optimize format of error message

* fix CPU Kernel error message format

* fix isinf and isnan error in WindowsOPENBLAS CI

* delete inf and nan

* add manual_seed in sample code

* little error message change

* change error message to InvalidArgument

* add full point for error message and add manual_seed in CPU environment

386429be

L
Fix diag OP bug on Windows Python3.8, cherry-pick from #28034 · e3a88eb4
由 LutaoChu 提交于 10月 19, 2020
```
Fix diag OP bug on Windows Python3.8, remove the std::min
```
e3a88eb4

add cast/concat/assign xpu op (#27911) (#28050) · 77eddf91

由 liuyuhui 提交于 10月 19, 2020

* addd

* add cast_op_xpu, test=kunlun

* fix bug for cast_op_xpu,test=kunlun

* add concat_op_xpu, test=kunlun

* slove conflicts, test=kunlun

* fix bug,test=kunlun

* add assign_op_xpu, test=kunlun

* fix bug,test=kunlun

* test=kunlun;test=develop

* fix concat bug,test=kunlun

* fix check_dygraph set in test_concat_op_xpu.py,test=kunlun

* fix error message,test=kunlun
Co-authored-by: Nmapingshuo <mps2012@yeah.net>
Co-authored-by: Nmapingshuo <mps2012@yeah.net>

77eddf91

K

update yolo_box support h != w. test=develop (#28054) · f05b184f
由 Kaipeng Deng 提交于 10月 19, 2020

f05b184f

[cherry-pick] polish kunlun error message for 2.0 rc (#28048) · 5c1babde

由 xiaoting 提交于 10月 19, 2020

* polish error message,test=kunlun

* polish error,test=kunlun

* polish error,test=kunlun

* polish error,test=kunlun

5c1babde

[cherry-pick] Incorporate cudnn_lstm into LSTM api (#27217) (#28023) · 3f565903

由 Guo Sheng 提交于 10月 19, 2020

* Incorporate cudnn_lstm into LSTM api (#27217)

* Incorporate cudnn_lstm into LSTM api.
test=develop

* Make coalesce_tensor support alignment optionally.
test=develop

* Reorganize RNN apis. test=develop

* Fix cudnn rnn layout conversion.
test=develop

* Add sequence_length support for RNN cudnn implement.
Add optional init_h and init_c gradient for cudnn_lstm_op.
test=develop

* Use create_parameter for rnn cudnn impl.
test=develop

* Move `self._flat_weight = self.create_parameter()` in RNNBase to main_program.
test=develop

* Update RNN api unittest to use set_device.
test=develop

* Fix set_place for unit tests of RNN apis.
test=develop

* Fix use_align in coalesce_tensor_op.
test=develop

* Adjust RNN apis arguments according to comments.
test=develop

* Polish documents for SimpleRNN apis.
test=develop

* Refine random seed in cudnn_lstm_op.
Expose rnn params from sublayers to RNN.
test=develop

* Fix RNN saving for jit.save.
Refine cudnn_lstm dropout behavior.
test=develop

* Fix doc of GRU. test=develop

* Use ShareDataWith to avoid copying for cudnn_lstm_op test.
test=develop

* Remove updates on cudnn_lstm temporarily.
test=develop

* Use ShareDataWith to avoid copying for cudnn_lstm_op test.
test=develop

* Refine random seed in cudnn_lstm_op.
test=develop

* Fix test_lstm by adjust ConcreteProgram buffer getter.
test=develop

* Use create_parameter instead of create_var for rnn._flat_weight for static graph usage.
test=develop

* Remove W input for cudnn_lstm to pass unused_var_check.
test=develop

* Add test_predict for RNN unit tests coverage.
test=develop

* Fix code style of rnn.
test=develop

* Fix F.rnn usage in rnn.py.
test=develop

* Fix test_lstm unittest failed and Add more unittest (#28029)

* fix test_lstm unittest failed

* add more unittest

* modify cmakelist

* fix judgement
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

3f565903

17 10月, 2020 2 次提交

J
add xpu error message description in some ops · a6c18075
由 Jack Zhou 提交于 10月 17, 2020
```
add xpu error message description
```
a6c18075

[oneDNN] Conv dilation support (#27914) (#28028) · ea76fe31

由 lidanqing 提交于 10月 17, 2020

* conv dilated mkldnn support: forward and backward pass

* add mkldnn conv_transpose dilation UT
test=develop

* remove unnecessary PADDLE_ENFORCE

* add int8 and bf16 dilated conv UT

* update according to reviews

ea76fe31

16 10月, 2020 2 次提交
- T
  Feature/large scale kv save base/delta (#27470) (#27990) · c0550b54
  由 tangwei12 提交于 10月 16, 2020
```
* add size method for large scale

* add large scale UT

* add ut for checkpoint
```
  c0550b54
- M
  
  fix kunlun kernel of reshape op (#27989) · 50d24899
  由 mapingshuo 提交于 10月 16, 2020
  
  50d24899
15 10月, 2020 6 次提交
- Z
  [cherry-pick2.0]Add tensor clone 2.0 (#27982) · b57254ed
  由 Zhou Wei 提交于 10月 15, 2020
```
* add tensor clone (#27953)

* add tensor clone

* fix unittest test_var_base

* fix bug of tensor copy of CUDAPinnedPlace (#27966)
```
  b57254ed
- 1
  【paddle.fleet】geo send sparse optimize (#27719) (#27979) · c0061ff5
  由 123malin 提交于 10月 15, 2020
```
* test=develop, fix geo sgd communicator and gloo http_init for ps
```
  c0061ff5
- G
  
  error message optimization in mean_xpu,softmax_with_cross_entropy_op_xpu,test=kunlun (#27968) · 51dd268c
  由 Guanghua Yu 提交于 10月 15, 2020
  
  51dd268c
- F
  support channel last in BatchNorm*d (#27961) · 429c0b62
  由 Feiyu Chan 提交于 10月 15, 2020
```
1. support channel last in BatchNorm*d (#27875)
2. fix a bug in batch_norm_op cuda kernel by extracting ResizeToChannelFist(Last), TransToChannelFirst(Last) to operators/layer_utils.h
```
  429c0b62
- M
  
  reshape support bool, test=develop (#27944) (#27971) · 39c31a20
  由 mapingshuo 提交于 10月 15, 2020
  
  39c31a20
- Q
  
  add reduce xpu op test=develop;test=kunlun (#27960) · 1f45c06e
  由 Qinghe JING 提交于 10月 15, 2020
  
  1f45c06e
14 10月, 2020 13 次提交
- L
  Support setting xpu place in dygraph mode (#27909) · 9a2a4b5f
  由 Leo Chen 提交于 10月 14, 2020
```
* support setting xpu place

* add ut, test=kunlun
```
  9a2a4b5f
- M
  Fix adam (#27778) · 263a9e97
  由 MRXLT 提交于 10月 14, 2020
```
* fix adam

* fix gpu adam

* fix code style

* fix ut

* update ut add cuda code
```
  263a9e97
- D
  kunlun add op (#27890) · b0edda4d
  由 Double_V 提交于 10月 14, 2020
```
* add stack pool2d roi_align xpu op,test=kunlun

* error message opt, test=kunlun

* add xpu unittest,test=kunlun

* skip check grad,test=kunlun

* fix boostget , test=kunlun
```
  b0edda4d
- J
  Add elementwise XPU OP kernel for KUNLUN core, including (but still cannot process common broadcast · c791df09
  由 Jack Zhou 提交于 10月 14, 2020
```
Add elementwise XPU OP kernel for KUNLUN core, including (but still cannot process common broadcast
```
  c791df09
- W
  
  xpu support for fill_constant Op (#27675) · c5fcc96d
  由 wangchaochaohu 提交于 10月 14, 2020
  
  c5fcc96d
- C
  【paddle.fleet】fix sparse load (#27680) · 328cb289
  由 Chengmo 提交于 10月 14, 2020
```
* add sparse tensor load method
```
  328cb289
- T
  
  fix paddle error informations (#27889) · cf70d5b3
  由 tangwei12 提交于 10月 14, 2020
  
  cf70d5b3
- W
  update the code for the topk message optimize · 95aa5342
  由 wawltor 提交于 10月 14, 2020
```
update the code for the topk message optimize 
```
  95aa5342
- C
  Polish some error message in opeators (#27876) · 4ba977c7
  由 Chen Weihang 提交于 10月 14, 2020
```
* polish some error message

* add white list

* revert shell script change
```
  4ba977c7
- 1
  【paddle.fleet】bug fix for parameter_recv (#27838) · a4f85074
  由 123malin 提交于 10月 14, 2020
```
* test=develop, bug fix for parameter_recv
* test=develop, for unittest, test_fleet_rolemaker_new
```
  a4f85074
- Q
  support kunlun matmul_v2 (#27910) · 2712d076
  由 QingshuChen 提交于 10月 14, 2020
```
*test=kunlun
```
  2712d076
- Z
  Multi task (#26002) · 5a83496c
  由 zhang wenhui 提交于 10月 14, 2020
```
* add multitask

* add multitask, test=develop

* fix code style, test=develop

* add partail push dense, test=develop

* fix has_kay in py3, test=develop

* fix, test=develop

* fix, test=develop

* fix, test=develop
```
  5a83496c
- Z
  fix norm api doc, test=develop (#27652) · 7a58431c
  由 zhang wenhui 提交于 10月 14, 2020
```
* fix norm api doc, test=develop

* fix error message, test=develop

* fix api norm, test=develop

* add adagrad, test=develop

* fix bug, test=develop

* fix bug, test=develop

* add spetral_norm, test=develop

* fix adagrad, test=develop

* merge , test=develop
```
  7a58431c

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功