提交 · eae4bf5bf12dae2794d15a46ff97541630475cf8 · 机器未来 / Paddle

07 9月, 2021 19 次提交
- N
  
  Modify the elementwise op according to the kernel primitive API (#34456) · eae4bf5b
  由 niuliling123 提交于 9月 07, 2021
  
  eae4bf5b
- P
  
  add as-extra for softplus/leaky_relu/softmax (#35493) · b211f02b
  由 Pei Yang 提交于 9月 07, 2021
  
  b211f02b
- Q
  [NPU] update batch norm op, test=develop (#35223) · cc6d2b07
  由 Qi Li 提交于 9月 07, 2021
```
* [NPU] update batch norm op, test=develop

* add NHWC support for bn, test=develop
```
  cc6d2b07
- X
  fix trace op stack overflow (#35419) · d47a97db
  由 XiangGao 提交于 9月 07, 2021
```
Co-authored-by: Nroot <root@bjyz-sys-gpu-kongming9.bjyz.baidu.com>
```
  d47a97db
- A
  Add DPADDLE_WITH_CUDA for GCC (#35448) · cec36ea6
  由 Aurelius84 提交于 9月 07, 2021
```
* Add DPADDLE_WITH_CUDA for GCC

* polish code
```
  cec36ea6
- Y
  
  disable added ut check,test=document_fix (#35535) · f57a2404
  由 YUNSHEN XIE 提交于 9月 07, 2021
  
  f57a2404
- F
  [NPU] Add norm_grad kernel (#35237) · cf408949
  由 furnace 提交于 9月 07, 2021
```
* [NPU] fix for test_norm_op_npu

* [NPU] add norm_grad

* [NPU] add CheckAxis for axis

* [NPU] delete debug codes

* norm can not use L2Normalize, norm_grad can use L2NormalizeGrad

* [NPU] delete useless codes

* [NPU] optimize norm_grad OpMaker

* Update python import path
```
  cf408949
- Q
  [NPU] log_softmax_grad, test=develop (#35484) · e928274c
  由 Qi Li 提交于 9月 07, 2021
```
* [NPU] log_softmax_grad, test=develop

* remove debug files, test=develop

* update lookup_table_v2 for CANN 5.0.x, test=develop
```
  e928274c
- J
  [oneDNN] Disable cache matmul v1 & refactoring (#35331) · e9ae8dd0
  由 Jacek Czaja 提交于 9月 07, 2021
```
* - refactoring progressing

- Fix

- compilation fix

- another compilation fix

- refactoring

* - fix

* - compilation fix

* - compilation fix

* - missing set_format

* - compilation fix

* - reverted setting memeory format

* - Brought back format

* - Fix

* - fixes after review

* CI rerun

* CI rerun
```
  e9ae8dd0
- J
  Fix for reshape2 oneDNN op (#35455) · 36cdb6e2
  由 jakpiase 提交于 9月 07, 2021
```
* fix for reshape2

* added reviewers sugestions
```
  36cdb6e2
- C
  
  fix int8 (#35504) · ed97be09
  由 ceci3 提交于 9月 07, 2021
  
  ed97be09
- D
  operators/flatten_op.cc add AsExtra (#35471) · 0c71edc3
  由 dyning 提交于 9月 07, 2021
```
* operators/flatten_op.cc add AsExtra

* operators/flatten_op.cc add AsExtra

* fix format
```
  0c71edc3
- X
  add AsExtra in data_norm op (#35420) · 7907e241
  由 XiangGao 提交于 9月 07, 2021
```
* add AsExtra in data_norm op

* pass data_layout from python to data_norm op

* fix data_layout in data_norm op
Co-authored-by: Nroot <root@bjyz-sys-gpu-kongming9.bjyz.baidu.com>
```
  7907e241
- A
  Fix DryRun unittest failed from test_standalon_executor.py (#35433) · 071e8156
  由 Aurelius84 提交于 9月 07, 2021
```
* fix commit

* Open unittest

* fix unittest on Windows

* fix constructor
```
  071e8156
- S
  
  merge from latest develop branch, test=document_fix (#34995) · 1445103b
  由 Sing_chan 提交于 9月 07, 2021
  
  1445103b
- P
  support test different infer_ut suite type (#35435) · 5bb12853
  由 Peihan 提交于 9月 07, 2021
```
* notest,test=inference;support test different suite type

* notest,test=inference;fix script bugs

* notest,test=inference;fix count time issue

* test=document_fix; fix readme grammar
```
  5bb12853
- A
  [Dy2Stat]Open test_resnet_amp on Windows (#35323) · 3c8eeb5d
  由 Aurelius84 提交于 9月 07, 2021
```
* open test_resnet_amp on Windows

* disable on Windows CPU CI for timeout

* disable on Windows CPU CI for timeout

* fix code style
```
  3c8eeb5d
- W
  transfer the static.accurcay to v2 op (#35494) · 2b1efc35
  由 wawltor 提交于 9月 07, 2021
```
* transfer the static.accurcay to v2 api

* remove the unused code
```
  2b1efc35
- X
  [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is in… (#35394) · 28b64075
  由 xiayanming 提交于 9月 07, 2021
```
* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid

* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid

* [HIP] fix op not support AMD GPU bug
```
  28b64075
06 9月, 2021 14 次提交

W
support double in deformable conv (#35330) · 266fcbe0
由 wangguanzhong 提交于 9月 06, 2021
```
* support double in deformable conv

* add double for dcn v2
```
266fcbe0
W
Add the extra flag for the some ops (#35442) · 49797d85
由 wawltor 提交于 9月 06, 2021
```
* Add the extra flag for the some ops

* fix the compile problem in matmul extra
```
49797d85

Add fusion_lstm INT8 PTQ (#35334) · 7ef04da6

由 joanna.wozna.intel 提交于 9月 06, 2021

* Add fusion_lstm INT8 PTQ

* Correct mkldnn_cache_capacity and enable fc_lstm_fuse_pass only for this test

* Change mkldnn_cache_capacity

7ef04da6

W
Add grad grad for AvgPool2D (#35388) · 97798f9a
由 Wei Shengyu 提交于 9月 06, 2021
```
* add pool2d grad grad

* dbg

* add unittest

* update format

* add more unittests

* dbg
```
97798f9a

transpose/slice/stride_slice/squeeze/unsqueeze op_def_enhance-1.0 (#35391) · 70a9b652

由 feng_shuai 提交于 9月 06, 2021

* transpose/slice/stride_slice/squeeze/unsqueeze op_def_enhance-1.0

* delete infer_flags and decrease-axis

* delet infer_flags and decrea_axis

70a9b652

add pool2d pool3d extra() (#35393) · 295253a6

由 Double_V 提交于 9月 06, 2021

* add pool2d pool3d extra()

* delete ceil_mode extra()

* delete ceil_mode extra()

* delete ceil_mode extra()

* add extra to use_mkldnn

295253a6

add kernel, stride check (#35106) · 13bbb6b6

由 Double_V 提交于 9月 06, 2021

* add kernel, stride check

* add unitest for param out of range

* delete max limit check

13bbb6b6

A
Support Reset for DeviceEvent (#35443) · 8c73c1b5
由 Aurelius84 提交于 9月 06, 2021
```
* Support Reset for DeviceEvent

* fix code

* add more unittest
```
8c73c1b5
W
add AsExtra tag for conv transpose op (#35354) · c2f76b0a
由 wangxinxin08 提交于 9月 06, 2021
```
* add AsExtra tag for conv transpose op

* check the existence of use_cudnn before get this attribute
```
c2f76b0a

[NPU]add depthwise_conv_npu_grad op (#35374) · 4bea0ff1

由 heliqi 提交于 9月 06, 2021

* add depthwise_conv_npu_grad op

* add depthwise_conv_npu_grad op

* add depthwise_conv_npu_grad op

* add NHWC test case

4bea0ff1

W
support numpy dtype and polish code of list index. (#35404) · 60c5adaa
由 WeiXin 提交于 9月 06, 2021
```
* support numpy dtype and polish code of list index.

* polish code.
```
60c5adaa

replase pass with error exception (#35367) · 5675042d

由 Feng Xing 提交于 9月 06, 2021

This PR adds error exception in fused transformer python interface.
The function body are not implemented (will be implemented later).
Following zhiqiu's comment in previous PR-35206 (merged already), it is better to raise an exception instead of using "pass".

5675042d

Y

Revert hccl check nan (#35438) · c3ad7775
由 Yuang Liu 提交于 9月 06, 2021

c3ad7775
W

update trt ut. (#35458) · 18934c53
由 Wilber 提交于 9月 06, 2021

18934c53

05 9月, 2021 1 次提交
- F
  [WIP] paddle.where api add broadcast, when x_shape == y_shape, and x_shape != cond_shape (#35092) · ffc3d364
  由 furnace 提交于 9月 05, 2021
```
* where op add broadcast, when x_shape == y_shape, and x_shape != cond_shape

* add static api tests, and delete debug codes
```
  ffc3d364
04 9月, 2021 1 次提交
- W
  
  update inference trt ut framework (#35418) · e8772486
  由 Wilber 提交于 9月 04, 2021
  
  e8772486
03 9月, 2021 5 次提交
- A
  
  disable test_standalone_executor temporarily (#35436) · e8a88164
  由 Aurelius84 提交于 9月 03, 2021
  
  e8a88164
- W
  modify gc logic, use new device_event (#35208) · 80c0cc97
  由 wanghuancoder 提交于 9月 03, 2021
```
* modify gc logic, use new device_event, test=develop

* use GenerateDeviceEventFlag, test=develop

* refine, test=develop

* fix test_standalone_executor.py, test=develop

* refine, test=develop
```
  80c0cc97
- L
  support lodtensorarray for send/recv (#35279) · b6adfd97
  由 lilong12 提交于 9月 03, 2021
```
* support lodtensorarray
```
  b6adfd97
- Z
  [NPU] Add huber_loss op (#34826) · 4e67cd17
  由 zhulei 提交于 9月 03, 2021
```
* [NPU] Add huber_loss op

* [NPU] Add huber_loss op

* [NPU] Add huber_loss p[

* [NPU] Add huber_loss
```
  4e67cd17
- Q
  [NPU] add int64_t kernels for YoloV3, test=develop (#35045) · f014e301
  由 Qi Li 提交于 9月 03, 2021
```
* [NPU] add int64 kernels, test=develop

* update ci scripts to be able to trun WITH_ASCEND_INT64 on, test=develop
```
  f014e301

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致