提交 · db5fd2a1925cbfa710c998364818f1d191f088a4 · PaddlePaddle / Paddle

08 9月, 2021 16 次提交

W
multiply supports bool · db5fd2a1
由 will-jl944 提交于 9月 08, 2021
```
multiply supports bool  
```
db5fd2a1
W

[hybrid] check pipeline persist var which changed in forward and used in backward (#35453) · a2dbb0c2
由 WangXi 提交于 9月 08, 2021

a2dbb0c2
L
add clip_by_norm fp16 kernel (#35446) · 7aa4d879
由 Leo Chen 提交于 9月 08, 2021
```
* add clip_by_norm fp16 kernel

* add ut
```
7aa4d879

由 Shang Zhizhou 提交于 9月 08, 2021

* update slice plugin

* add test

* fix code style

* fix trt6

* update test

* fix test

* add timeout

* update trt version

* update cmake

28abd5d8

Intergrate GLOOParallelContext to support Multi-CPU Core for Dygraph DataParallel (#35154) · 51cc73f0

由 xiongkun 提交于 9月 08, 2021

* can pass the fake test

* add files

* modify cmake to pass windows-ci

* for ci pass

* WITH_GLOO=ON

* for pass coverage test

* add cpuonly testcase

* add

* disable nccl when compile with cuda

* change python version in cpuonly

* add backend argument

* add required gpu

* add required:gpu

51cc73f0

G

fix bug (#35482) · e133d8ef
由 Guoxia Wang 提交于 9月 08, 2021

e133d8ef
Z
Fix scatter_nd_add and gather bug (#35544) · 3c457a38
由 Zeng Jinle 提交于 9月 08, 2021
```
* fix scatter_add_nd and gather bug

* fix gather compile error
```
3c457a38

Enable program passes on Fleet APIs (#34955) · 5f369881

由 Zeng Jinle 提交于 9月 08, 2021

* add fleet api for program pass

* turn on apply pass for CI test

* fix disable fuse_all_optimizer bug

* try to test ci

* fix CI

* fill unspecified op role

* fix fuse_allreduce

* add ut to improve coverage

* remove useless change

* improve c++ coverage

* follow some comments

* test ir pass pipeline

* update doc

* reduce ut time again

5f369881

fix the bug of layer_norm when batch_size=1 (#35480) · ad5f7494

由 zhangkaihuo 提交于 9月 08, 2021

The bug is that access to mean and var is incorrect, and the array will be out of bounds: the shape of mean and var is [batch_size], and the range of thread idx is 0~feature_size, so mean[idx] and var[idx] is incorrect.

When batch_size=1, the correct access is mean[0] and var[0], and a unit test with batch_size=1 is added.

ad5f7494

C

Add FP16 PRelu (#35532) · 4e62af80
由 cc 提交于 9月 08, 2021

4e62af80
L
add checkers for auto parallel apis (#35486) · 39540b0e
由 lilong12 提交于 9月 08, 2021
```
* update, test=develop
```
39540b0e

merge CMakeList.txt manual (#35378) · c4a3e8b4

由 feng_shuai 提交于 9月 08, 2021

* merge CMakeList.txt manual

* add platform for changethreadnum

* repair some bugs according to make error

* do nothing just flush CI

* forget change thread num

* add inplace_atol param for check_output_with_place

* Windows

* std:min and std::max should be change because of windows

c4a3e8b4

L
[NPU] release gil before op run (#35370) · db6242e9
由 Leo Chen 提交于 9月 08, 2021
```
* release gil before op run

* support npu grad test

* fix op_test
```
db6242e9
Z

Add op define extra for norm and frobenius norm op. (#35329) · 3dab2e20
由 Zhong Hui 提交于 9月 08, 2021

3dab2e20

add the matmul v2 grad kernel · b3787d1b

由 wawltor 提交于 9月 08, 2021

* add the matmul v2 grad kernel

* relief the test case time

* update the test case for the matmul double grad

* remove the unsed code for the matmul double grad

* update the test case for the double grad matmul

* remove the unused code in dot

b3787d1b

W

[NPU] add get_float_status op and refine NPU check_nan_inf (#35274) · c727ec4a
由 WangXi 提交于 9月 08, 2021

c727ec4a

07 9月, 2021 9 次提交

W
add conv op check for illegal input or attributes (#35337) · 8307b0cb
由 wangxinxin08 提交于 9月 07, 2021
```
* add conv op check for illegal input or attributes
```
8307b0cb
Q
[NPU] update batch norm op, test=develop (#35223) · cc6d2b07
由 Qi Li 提交于 9月 07, 2021
```
* [NPU] update batch norm op, test=develop

* add NHWC support for bn, test=develop
```
cc6d2b07

[NPU] Add norm_grad kernel (#35237) · cf408949

由 furnace 提交于 9月 07, 2021

* [NPU] fix for test_norm_op_npu

* [NPU] add norm_grad

* [NPU] add CheckAxis for axis

* [NPU] delete debug codes

* norm can not use L2Normalize, norm_grad can use L2NormalizeGrad

* [NPU] delete useless codes

* [NPU] optimize norm_grad OpMaker

* Update python import path

cf408949

[NPU] log_softmax_grad, test=develop (#35484) · e928274c

由 Qi Li 提交于 9月 07, 2021

* [NPU] log_softmax_grad, test=develop

* remove debug files, test=develop

* update lookup_table_v2 for CANN 5.0.x, test=develop

e928274c

J
Fix for reshape2 oneDNN op (#35455) · 36cdb6e2
由 jakpiase 提交于 9月 07, 2021
```
* fix for reshape2

* added reviewers sugestions
```
36cdb6e2
A
Fix DryRun unittest failed from test_standalon_executor.py (#35433) · 071e8156
由 Aurelius84 提交于 9月 07, 2021
```
* fix commit

* Open unittest

* fix unittest on Windows

* fix constructor
```
071e8156

[Dy2Stat]Open test_resnet_amp on Windows (#35323) · 3c8eeb5d

由 Aurelius84 提交于 9月 07, 2021

* open test_resnet_amp on Windows

* disable on Windows CPU CI for timeout

* disable on Windows CPU CI for timeout

* fix code style

3c8eeb5d

W
transfer the static.accurcay to v2 op (#35494) · 2b1efc35
由 wawltor 提交于 9月 07, 2021
```
* transfer the static.accurcay to v2 api

* remove the unused code
```
2b1efc35

[HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is in… (#35394) · 28b64075

由 xiayanming 提交于 9月 07, 2021

* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid

* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid

* [HIP] fix op not support AMD GPU bug

28b64075

06 9月, 2021 6 次提交
- W
  support double in deformable conv (#35330) · 266fcbe0
  由 wangguanzhong 提交于 9月 06, 2021
```
* support double in deformable conv

* add double for dcn v2
```
  266fcbe0
- W
  Add grad grad for AvgPool2D (#35388) · 97798f9a
  由 Wei Shengyu 提交于 9月 06, 2021
```
* add pool2d grad grad

* dbg

* add unittest

* update format

* add more unittests

* dbg
```
  97798f9a
- D
  add kernel, stride check (#35106) · 13bbb6b6
  由 Double_V 提交于 9月 06, 2021
```
* add kernel, stride check

* add unitest for param out of range

* delete max limit check
```
  13bbb6b6
- H
  [NPU]add depthwise_conv_npu_grad op (#35374) · 4bea0ff1
  由 heliqi 提交于 9月 06, 2021
```
* add depthwise_conv_npu_grad op

* add depthwise_conv_npu_grad op

* add depthwise_conv_npu_grad op

* add NHWC test case
```
  4bea0ff1
- W
  support numpy dtype and polish code of list index. (#35404) · 60c5adaa
  由 WeiXin 提交于 9月 06, 2021
```
* support numpy dtype and polish code of list index.

* polish code.
```
  60c5adaa
- W
  
  update trt ut. (#35458) · 18934c53
  由 Wilber 提交于 9月 06, 2021
  
  18934c53
05 9月, 2021 1 次提交
- F
  [WIP] paddle.where api add broadcast, when x_shape == y_shape, and x_shape != cond_shape (#35092) · ffc3d364
  由 furnace 提交于 9月 05, 2021
```
* where op add broadcast, when x_shape == y_shape, and x_shape != cond_shape

* add static api tests, and delete debug codes
```
  ffc3d364
04 9月, 2021 1 次提交
- W
  
  update inference trt ut framework (#35418) · e8772486
  由 Wilber 提交于 9月 04, 2021
  
  e8772486
03 9月, 2021 7 次提交
- A
  
  disable test_standalone_executor temporarily (#35436) · e8a88164
  由 Aurelius84 提交于 9月 03, 2021
  
  e8a88164
- L
  support lodtensorarray for send/recv (#35279) · b6adfd97
  由 lilong12 提交于 9月 03, 2021
```
* support lodtensorarray
```
  b6adfd97
- Z
  [NPU] Add huber_loss op (#34826) · 4e67cd17
  由 zhulei 提交于 9月 03, 2021
```
* [NPU] Add huber_loss op

* [NPU] Add huber_loss op

* [NPU] Add huber_loss p[

* [NPU] Add huber_loss
```
  4e67cd17
- Q
  [NPU] add int64_t kernels for YoloV3, test=develop (#35045) · f014e301
  由 Qi Li 提交于 9月 03, 2021
```
* [NPU] add int64 kernels, test=develop

* update ci scripts to be able to trun WITH_ASCEND_INT64 on, test=develop
```
  f014e301
- J
  Add AsExtra for transpose, lstm, gru (#35317) · f13dcfb1
  由 Jack Zhou 提交于 9月 03, 2021
```
* Add AsExtra for transpose

* add AsExtra for lstm op

* add AsExtra for gru
```
  f13dcfb1
- F
  [iscan] bugfix: DLTP-33615 / DLTP-33953 / DLTP-33968 / DLTP-34166 (#35383) · b333dac0
  由 Fan Zhang 提交于 9月 03, 2021
```
* [iscan] bugfix

* test_standalone_executor modify
```
  b333dac0
- H
  [NPU]add conv2d_transpose npu op (#35232) · a9dfebb9
  由 heliqi 提交于 9月 03, 2021
```
* add conv2d_transpose npu op

* CopyRight 2020 to 2021

* add fp32

* delete repeat test case

* delete repeat test case

* fix paddle.NPUPlace
```
  a9dfebb9

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功