提交 · 495e7f9c848bb6d36b2ba64bf84fdebf5da3f71b · BaiXuePrincess / Paddle

31 3月, 2021 4 次提交

Update eigen version to f612df27 (#31832) · 495e7f9c

由 wuhuanzhou 提交于 3月 31, 2021

* update eigen version to f612df27, test=develop

* fix compilation error, test=develop

* remove patch command in eigen, test=develop

* fix compilation error caused by call Eigen function with float16 and bfloat16, test=develop

* fix unittest error, test=develop

* fix unittest error caused by precision, test=develop

* remove patch files used by old version eigen, test=develop

495e7f9c

T

fix some bug in transformer training in xpu (#31918) · 52b05bac
由 taixiurong 提交于 3月 31, 2021

52b05bac
W
support minus-int idx to LayerList (#31750) · 5394194e
由 Wenyu 提交于 3月 31, 2021
```
* support minus-int idx to LayerList
* update layerlist test
```
5394194e

[ROCM] Add ROCm support for warpctc op (#31817) · ef8323d4

由 furnace 提交于 3月 31, 2021

* bugfix for warpctc

* fix warpctc commit id

* fix warpctc commit id

* fix warpctc commit id

* fix warpctc commit id

* fix warpctc commit id

* fix WARPCTC_WITH_HIP invalid

* Add logs to find out why can not dlopen libwarpctc.so

* fix warpctc commit id

* fix unit test test_warpctc_op

* Optime failed log for dlopen

* Optime failed log for dlopen

* Delete extra changes

* fix warpctc commit id

* fix warpctc commit id

* Add is_compiled_with_rocm for test_warpctc_op

* fix warpctc commit id

* Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed

* Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed

* Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed

* fix code style problems

ef8323d4

30 3月, 2021 8 次提交

L

[dynamic setitem] Fix bug of dynamic setitem: Decerease axes to do right broadcast (#31960) · 57d4288a
由 liym27 提交于 3月 30, 2021

57d4288a
J

Added int8 kernel for oneDNN LSTM op (#31894) · 6dca7a1d
由 jakpiase 提交于 3月 30, 2021

6dca7a1d
Z

fix bug when dtype of to_tensor is core.VarType (#31931) · 245252b8
由 Zhou Wei 提交于 3月 30, 2021

245252b8
W

add exclusive for test_conv2d_op, test=develop (#31936) · fe284868
由 wangguanzhong 提交于 3月 30, 2021

fe284868

add deprecated for softmax_with_cross_entropy (#31722) · 73a6fa3e

由 chajchaj 提交于 3月 30, 2021

* add deprecated for softmax_with_cross_entropy, test=develop

* test for deprecated in english doc, test=develop

* test deprecated for softmax_with_cross_entropy in english doc, test=develop

* fix readme and English doc for cross_entropy, test=develop

* rm test for softmax_with_cross_entropy deprecated, test=develop

* update readme for CrossEntropyLoss, test=develop

* fix readme format, test=develop

* fix readme format, test=develop

* fix readme format for cross_entropy, test=develop

* add softmax_switch and fix softlabel for cross_entropy, test=develop

* 1)recovery softmax_with_cross_entropy in fluid 2) change softmax_switch to use_softmax 3) add example for softlabel for cross_entropy, test=develop

* fix Example number for cross_entropy, test=develop

* fix code format, test=develop

* fix for CI-Coverage, test=develop

* fix for CI-Coverage, test=develop

* fix ci-coverage for Non-ASCII character '\xe2' in file, test=develop

* fix ci-coverage for Non-ASCII character '\xe2' in nn.layer.loss.py, test=develop

* update description for doc when use_softmax=Fasle, test=develop

* fix some docs and code example for cross_entropy, test=develop

* delete redundant description for soft_label parameter of cross_entropy, test=develop

* fix some comment for test_cross_entropy_loss.py, test=develop

73a6fa3e

S
fix batchnorm when inpu dims < 3 (#31933) · 8084b759
由 Shang Zhizhou 提交于 3月 30, 2021
```
* fix batchnorm when inpu dims < 3

* add unittest for batchnorm dims = 2
```
8084b759

[Paddle-TRT] yolobox (#31755) · 64ee255f

由 zlsh80826 提交于 3月 30, 2021

* yolobox converter and plugin

* yolobox unittest

* add dynamic shape restriction

* fix git merge log

64ee255f

A
Fix segment Fault from set_value (#31891) · c4b60efa
由 Aurelius84 提交于 3月 30, 2021
```
* Avoid raising warning while import paddle

* fix segment fault of set_value

* fix code style
```
c4b60efa

29 3月, 2021 4 次提交

L

Fix bug of set_value op：Decerease axes to do right broadcast (#31875) · 525c32e3
由 liym27 提交于 3月 29, 2021

525c32e3
R

[ROCM] added a cudnn switch of conv2d for rocm platform (#31836) · 123949eb
由 ronnywang 提交于 3月 29, 2021

123949eb

[Paddle-TRT] roi_align_plugin (#31732) · e3a38d79

由 zlsh80826 提交于 3月 29, 2021

* add roi_align_plugin

* add roi align unit_test

* add roi align serialization

* remove roi align static plugin because of batch dim issue

* refine roi align unittest and add fp16/serialization

* add trt roi align condition to op_teller

* refine error message

* remove unnecessary reshape layer

e3a38d79

[Paddle-TRT] trt affine channel converter (#31628) · bfb5cf55

由 zlsh80826 提交于 3月 29, 2021

* trt affine channel converter

* add trt affine channel base test

* add trt affine channel NHWC

* remove asterisk for python2 compatibility

* trt affine channel converter

* add trt affine channel base test

* add trt affine channel NHWC

* remove asterisk for python2 compatibility

* fix rebase

* move LodTensor to Tensor

* add dbg info

* affine channel converter only support NCHW

* scale,bias are parameters, use create_parameters api

* reduce test input size to not exceed the timelimit of ci

* refine affine channel unittest and add serialization/dynamic test

* change super to InferencePassTest for python2 compatibility

* change super to InferencePassTest for python2 compatibility

* fix affine channel fp16 serialize setting

bfb5cf55

26 3月, 2021 3 次提交

[dygraph qat] Use layer to calculate output scale (#31861) · b47478ef

由 cc 提交于 3月 26, 2021

* Use layer to calculate output scale
* add backward for moving_average_abs_max_scale and save output scales to op's attr

b47478ef

L
[3D-parallel] Reformat pipeline parallel (#31786) · c3974d0e
由 lilong12 提交于 3月 26, 2021
```
* update, test=develop
```
c3974d0e

[Paddle-TRT] multiclass nms (#31742) · 01aa2526

由 zlsh80826 提交于 3月 26, 2021

* add multiclass_nms

* add multiclass_nms unittest

* add default enable_tensorrt_oss option

* refine multiclas nms unittest and add serialization/dynamic test

* change super to InferencePassTest for python2 compatibility

* refine multiclass nms unittest

* move out dynamic shape test due to ci timelimit

01aa2526

25 3月, 2021 2 次提交
- C
  【Paddle.Fleet】fix dataset zip py3 bug (#31441) · f58cb018
  由 Chengmo 提交于 3月 25, 2021
```
* fix zip py3 bug
```
  f58cb018
- Z
  
  LRScheduler.get_lr should not update lr in LinearWarmup (#31843) · 511e204e
  由 Zhou Wei 提交于 3月 25, 2021
  
  511e204e
24 3月, 2021 2 次提交
- H
  [Dy2stat] Fix the bug that loop_body_func may return single element (#31806) · 649868ff
  由 Huihuang Zheng 提交于 3月 24, 2021
```
Our old `loop_body` function may return single element when `loop_vars` just contains only 1 element, which can cause bug. The key point of this PR is forcing `loop_body` functions always return tuple.
```
  649868ff
- R
  
  [ROCM] fix test_matmul_v2_op (#31802) · 270699e6
  由 ronnywang 提交于 3月 24, 2021
  
  270699e6
23 3月, 2021 2 次提交
- F
  
  add coalesce_tensor into white list when checking re-creation of parameters (#31800) · 4046f130
  由 Feiyu Chan 提交于 3月 23, 2021
  
  4046f130
- G
  fix launch ps ut test=develop (#31771) · f72d197e
  由 gongweibao 提交于 3月 23, 2021
```
fix launch ps ut test=develop
```
  f72d197e
22 3月, 2021 3 次提交

[Paddle-TRT] nearest_interp op (#31626) · bfced39e

由 zlsh80826 提交于 3月 22, 2021

* nearest_interp op converter w/ dynamic/static

* fix data_layout include

* add trt nearest unit_test

* add nearest_interp NHWC test

* update trt nearest interp nhwc testcase

* remove asterisk for python2 compatibility

* add empty line to prevent conflict

* nearest_interp op converter w/ dynamic/static

* fix data_layout include

* add trt nearest unit_test

* add nearest_interp NHWC test

* update trt nearest interp nhwc testcase

* remove asterisk for python2 compatibility

* add empty line to prevent conflict

* change the priority of out_h, out_w

bfced39e

A

[oneDNN] Initial bf16 amp integration (#31093) · 7ccf6b60
由 arlesniak 提交于 3月 22, 2021

7ccf6b60
L
[3D-parallel] add 1f1b scheduler for pipeline (#31566) · a501a7b0
由 lilong12 提交于 3月 22, 2021
```
* add 1f1b scheduler for pp, test=develop
```
a501a7b0

21 3月, 2021 1 次提交
- R
  
  [ROCM] fix test_conv2d_transpose_op (#31749) · 8c19d7aa
  由 ronnywang 提交于 3月 21, 2021
  
  8c19d7aa
19 3月, 2021 4 次提交
- J
  
  [oneDNN] Added Elementwise Mul grad fp32/bf16 (#31647) · 25fc2a1f
  由 Jacek Czaja 提交于 3月 19, 2021
  
  25fc2a1f
- R
  
  [ROCM] fix test_rnn_op (#31735) · c9e1d9dc
  由 ronnywang 提交于 3月 19, 2021
  
  c9e1d9dc
- A
  
  [oneDNN] lookup_table op with support for BF16 data type. (#31558) · a4a2b77d
  由 Adam Osewski 提交于 3月 19, 2021
  
  a4a2b77d
- R
  
  [ROCM] fix layer_norm, norm, p_norm, test_sequence_softmax_op, test_math_op_patch_var_base (#31709) · 420527f0
  由 ronnywang 提交于 3月 19, 2021
  
  420527f0
18 3月, 2021 2 次提交
- Z
  [Paddle-TRT] gather converter (#31640) · fe241fd0
  由 zlsh80826 提交于 3月 18, 2021
```
* trt gather converter

* add trt gather unit_test
```
  fe241fd0
- C
  【Paddle.Fleet】Fix one ps gradient clip (#31664) · 09482dde
  由 Chengmo 提交于 3月 18, 2021
```
* fix one ps gradient clip
```
  09482dde
17 3月, 2021 1 次提交
- Z
  
  support NHWC for temporal_shift op (#31642) · 7f50bb7e
  由 Zhang Ting 提交于 3月 17, 2021
  
  7f50bb7e
16 3月, 2021 2 次提交
- G
  
  Extend unittest time of (#31570) · 9c624b16
  由 gongweibao 提交于 3月 16, 2021
  
  9c624b16
- R
  
  [ROCM] fix softmax_with_cross_entropy_op, test=develop (#31629) · da10c5cf
  由 ronnywang 提交于 3月 16, 2021
  
  da10c5cf
15 3月, 2021 1 次提交
- K
  DataLoader supprot dict str (#31481) · a32e8bf1
  由 Kaipeng Deng 提交于 3月 15, 2021
```
* add dict/str/list supprot for DataLoader. test=develop
```
  a32e8bf1
12 3月, 2021 1 次提交
- S
  Trt elementwise plugin serialize (#31587) · 50ac7dbf
  由 Shang Zhizhou 提交于 3月 12, 2021
```
* add serialize unittest

* fix element_op trt plugin serialize bug
```
  50ac7dbf

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致