提交 · ad5f749448f1caf9232f89b96b62a0f2b7a4c2ef · PaddlePaddle / Paddle

08 9月, 2021 12 次提交

fix the bug of layer_norm when batch_size=1 (#35480) · ad5f7494

由 zhangkaihuo 提交于 9月 08, 2021

The bug is that access to mean and var is incorrect, and the array will be out of bounds: the shape of mean and var is [batch_size], and the range of thread idx is 0~feature_size, so mean[idx] and var[idx] is incorrect.

When batch_size=1, the correct access is mean[0] and var[0], and a unit test with batch_size=1 is added.

ad5f7494

C

Add FP16 PRelu (#35532) · 4e62af80
由 cc 提交于 9月 08, 2021

4e62af80
L
hidden the auto parallel apis (#35385) · afd1b372
由 lilong12 提交于 9月 08, 2021
```
* update, test=develop
```
afd1b372
L
add checkers for auto parallel apis (#35486) · 39540b0e
由 lilong12 提交于 9月 08, 2021
```
* update, test=develop
```
39540b0e

merge CMakeList.txt manual (#35378) · c4a3e8b4

由 feng_shuai 提交于 9月 08, 2021

* merge CMakeList.txt manual

* add platform for changethreadnum

* repair some bugs according to make error

* do nothing just flush CI

* forget change thread num

* add inplace_atol param for check_output_with_place

* Windows

* std:min and std::max should be change because of windows

c4a3e8b4

L
support weight sharing for pipeline (#35351) · 5199c744
由 lilong12 提交于 9月 08, 2021
```
* support weight sharing
```
5199c744

add some file for jetson-op-test-CE (#35431) · 18a963a5

由 feng_shuai 提交于 9月 08, 2021

* add some file for jetson-op-test-CE

* change diff for some case

* rename op_test -> jetson_infer_op

* add some case because too large

18a963a5

L
[NPU] release gil before op run (#35370) · db6242e9
由 Leo Chen 提交于 9月 08, 2021
```
* release gil before op run

* support npu grad test

* fix op_test
```
db6242e9
Z

Add op define extra for norm and frobenius norm op. (#35329) · 3dab2e20
由 Zhong Hui 提交于 9月 08, 2021

3dab2e20

Work queue group (#35470) · a53460aa

由 liutiexing 提交于 9月 08, 2021

* Split Tracker and WorkQueue

* add WorkQueueGroup

* add unittest

* fix

* update

* update

* fix compile

a53460aa

add the matmul v2 grad kernel · b3787d1b

由 wawltor 提交于 9月 08, 2021

* add the matmul v2 grad kernel

* relief the test case time

* update the test case for the matmul double grad

* remove the unsed code for the matmul double grad

* update the test case for the double grad matmul

* remove the unused code in dot

b3787d1b

W

[NPU] add get_float_status op and refine NPU check_nan_inf (#35274) · c727ec4a
由 WangXi 提交于 9月 08, 2021

c727ec4a

07 9月, 2021 23 次提交
- Z
  Fix scatter_nd_add doc (#35542) · 1635c02b
  由 Zeng Jinle 提交于 9月 07, 2021
```
* fix scatter_nd_add doc, test=document_fix

* update
test=document_fix
```
  1635c02b
- X
  Add depth while cloning benchmark code (#35548) · 2b105211
  由 xiegegege 提交于 9月 07, 2021
```
* Add depth while cloning benchmark code,test=document_fix

* Add depth while cloning benchmark code, test=document_fix
```
  2b105211
- Y
  
  support multi-node (#35396) · c6e0cedc
  由 yaoxuefeng 提交于 9月 07, 2021
  
  c6e0cedc
- W
  add conv op check for illegal input or attributes (#35337) · 8307b0cb
  由 wangxinxin08 提交于 9月 07, 2021
```
* add conv op check for illegal input or attributes
```
  8307b0cb
- N
  
  Modify the elementwise op according to the kernel primitive API (#34456) · eae4bf5b
  由 niuliling123 提交于 9月 07, 2021
  
  eae4bf5b
- P
  
  add as-extra for softplus/leaky_relu/softmax (#35493) · b211f02b
  由 Pei Yang 提交于 9月 07, 2021
  
  b211f02b
- Q
  [NPU] update batch norm op, test=develop (#35223) · cc6d2b07
  由 Qi Li 提交于 9月 07, 2021
```
* [NPU] update batch norm op, test=develop

* add NHWC support for bn, test=develop
```
  cc6d2b07
- X
  fix trace op stack overflow (#35419) · d47a97db
  由 XiangGao 提交于 9月 07, 2021
```
Co-authored-by: Nroot <root@bjyz-sys-gpu-kongming9.bjyz.baidu.com>
```
  d47a97db
- A
  Add DPADDLE_WITH_CUDA for GCC (#35448) · cec36ea6
  由 Aurelius84 提交于 9月 07, 2021
```
* Add DPADDLE_WITH_CUDA for GCC

* polish code
```
  cec36ea6
- Y
  
  disable added ut check,test=document_fix (#35535) · f57a2404
  由 YUNSHEN XIE 提交于 9月 07, 2021
  
  f57a2404
- F
  [NPU] Add norm_grad kernel (#35237) · cf408949
  由 furnace 提交于 9月 07, 2021
```
* [NPU] fix for test_norm_op_npu

* [NPU] add norm_grad

* [NPU] add CheckAxis for axis

* [NPU] delete debug codes

* norm can not use L2Normalize, norm_grad can use L2NormalizeGrad

* [NPU] delete useless codes

* [NPU] optimize norm_grad OpMaker

* Update python import path
```
  cf408949
- Q
  [NPU] log_softmax_grad, test=develop (#35484) · e928274c
  由 Qi Li 提交于 9月 07, 2021
```
* [NPU] log_softmax_grad, test=develop

* remove debug files, test=develop

* update lookup_table_v2 for CANN 5.0.x, test=develop
```
  e928274c
- J
  [oneDNN] Disable cache matmul v1 & refactoring (#35331) · e9ae8dd0
  由 Jacek Czaja 提交于 9月 07, 2021
```
* - refactoring progressing

- Fix

- compilation fix

- another compilation fix

- refactoring

* - fix

* - compilation fix

* - compilation fix

* - missing set_format

* - compilation fix

* - reverted setting memeory format

* - Brought back format

* - Fix

* - fixes after review

* CI rerun

* CI rerun
```
  e9ae8dd0
- J
  Fix for reshape2 oneDNN op (#35455) · 36cdb6e2
  由 jakpiase 提交于 9月 07, 2021
```
* fix for reshape2

* added reviewers sugestions
```
  36cdb6e2
- C
  
  fix int8 (#35504) · ed97be09
  由 ceci3 提交于 9月 07, 2021
  
  ed97be09
- D
  operators/flatten_op.cc add AsExtra (#35471) · 0c71edc3
  由 dyning 提交于 9月 07, 2021
```
* operators/flatten_op.cc add AsExtra

* operators/flatten_op.cc add AsExtra

* fix format
```
  0c71edc3
- X
  add AsExtra in data_norm op (#35420) · 7907e241
  由 XiangGao 提交于 9月 07, 2021
```
* add AsExtra in data_norm op

* pass data_layout from python to data_norm op

* fix data_layout in data_norm op
Co-authored-by: Nroot <root@bjyz-sys-gpu-kongming9.bjyz.baidu.com>
```
  7907e241
- A
  Fix DryRun unittest failed from test_standalon_executor.py (#35433) · 071e8156
  由 Aurelius84 提交于 9月 07, 2021
```
* fix commit

* Open unittest

* fix unittest on Windows

* fix constructor
```
  071e8156
- S
  
  merge from latest develop branch, test=document_fix (#34995) · 1445103b
  由 Sing_chan 提交于 9月 07, 2021
  
  1445103b
- P
  support test different infer_ut suite type (#35435) · 5bb12853
  由 Peihan 提交于 9月 07, 2021
```
* notest,test=inference;support test different suite type

* notest,test=inference;fix script bugs

* notest,test=inference;fix count time issue

* test=document_fix; fix readme grammar
```
  5bb12853
- A
  [Dy2Stat]Open test_resnet_amp on Windows (#35323) · 3c8eeb5d
  由 Aurelius84 提交于 9月 07, 2021
```
* open test_resnet_amp on Windows

* disable on Windows CPU CI for timeout

* disable on Windows CPU CI for timeout

* fix code style
```
  3c8eeb5d
- W
  transfer the static.accurcay to v2 op (#35494) · 2b1efc35
  由 wawltor 提交于 9月 07, 2021
```
* transfer the static.accurcay to v2 api

* remove the unused code
```
  2b1efc35
- X
  [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is in… (#35394) · 28b64075
  由 xiayanming 提交于 9月 07, 2021
```
* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid

* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid

* [HIP] fix op not support AMD GPU bug
```
  28b64075
06 9月, 2021 5 次提交

W
support double in deformable conv (#35330) · 266fcbe0
由 wangguanzhong 提交于 9月 06, 2021
```
* support double in deformable conv

* add double for dcn v2
```
266fcbe0
W
Add the extra flag for the some ops (#35442) · 49797d85
由 wawltor 提交于 9月 06, 2021
```
* Add the extra flag for the some ops

* fix the compile problem in matmul extra
```
49797d85

Add fusion_lstm INT8 PTQ (#35334) · 7ef04da6

由 joanna.wozna.intel 提交于 9月 06, 2021

* Add fusion_lstm INT8 PTQ

* Correct mkldnn_cache_capacity and enable fc_lstm_fuse_pass only for this test

* Change mkldnn_cache_capacity

7ef04da6

W
Add grad grad for AvgPool2D (#35388) · 97798f9a
由 Wei Shengyu 提交于 9月 06, 2021
```
* add pool2d grad grad

* dbg

* add unittest

* update format

* add more unittests

* dbg
```
97798f9a

transpose/slice/stride_slice/squeeze/unsqueeze op_def_enhance-1.0 (#35391) · 70a9b652

由 feng_shuai 提交于 9月 06, 2021

* transpose/slice/stride_slice/squeeze/unsqueeze op_def_enhance-1.0

* delete infer_flags and decrease-axis

* delet infer_flags and decrea_axis

70a9b652

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功