提交 · 6fc15986f70f69fd0af581b01ffa63c26e14c95a · PaddlePaddle / Paddle

30 8月, 2022 9 次提交

W
[OpAttr]Adapt tensor axis for argmin/max (#45453) · 6fc15986
由 WangZhen 提交于 8月 30, 2022
```
* Adapt tensor axis for argmin/max

* Add UT

* Polish UT
```
6fc15986
P
[PHI] move layer_norm/layer_norm_grad xpu kernel to phi (#45524) · 871e3329
由 pangyoki 提交于 8月 30, 2022
```
* move layer_norm xpu kernel to phi, test=kunlun

* fix, test=kunlun
```
871e3329
W
[OpAttr]Adapt tensor axis for reduce_min/max/mean/sum/prod (#45078) · 32f42e94
由 WangZhen 提交于 8月 30, 2022
```
* [OpAttr]Adapt tensor axis for reduce_min/max/mean/sum/prod
```
32f42e94

Remove extra attribute in OpMaker (#44310) · fe321f9a

由 zyfncg 提交于 8月 30, 2022

* add runtime config in phi

* add runtime attr for op desc and op

* fix no proto error

* adjust opdesc set_attr impl

* try to remove conv_op extra attrs

* add init runtime attr map

* change extra header path

* fix runtime_attr

* fix trace_op

* fix bug of pass

* fix merge conflict

* fix dygraph attrs

* fix bug of pass

* fix dygraph bug

* fix unittest module

* delete extra attr default

* fix dropout kernel

* polish code

* fix extra output of instance_norm

* fix merge confilct

* fix op_desc bug

* add extra attr in yaml for conv3d_transpose

* don't remove extra input and output

* fix save_inference_model

* fix bug of batch_norm

* revert some change

* polish log

* polish code

* add code comment
Co-authored-by: NChen Weihang <chenweihang@baidu.com>

fe321f9a

H
fix reduce mean grad bug *test=kunlun (#45511) · a7c4facb
由 haosicheng 提交于 8月 30, 2022
```
fix missing keep_dim variable

    fix missing grad check in unittest

    add new test case
```
a7c4facb
A
[OpAttr]padding_value of Pad support Tensor type (#45514) · db235bf0
由 Aurelius84 提交于 8月 30, 2022
```
* [OpAttr]padding_value of Pad support Tensor type

* fix unittest

* fix unittest

* fix coverage
```
db235bf0
R

move cast XPU kernel to PHI,test=kunlun (#45534) · 9dad4f79
由 risemeup1 提交于 8月 30, 2022

9dad4f79

move gelu/gelu_grad/generate_proposals_v2 kernel to phi (#45471) · 8b24c795

由 Leo Chen 提交于 8月 30, 2022

* move xpu kernel to phi

* delete fluid file

* fix compile

* add guard, test=kunlun

* xpu set constant

* fix xpu error, test=kunlun

8b24c795

W

Adapt tensor num_samples for multinomial (#45522) · c857841e
由 WangZhen 提交于 8月 30, 2022

c857841e

29 8月, 2022 13 次提交

[new_exe] Dy2Static support new_executor (#44450) · aba1295b

由 zhangbo9674 提交于 8月 29, 2022

* add interpretercore

* refine backward program id

* add code

* refine program

* refine code

* create forward/backward_program by prog2graph2prog method

* test, do not care

* refine code

* refine code

* refine code

* test, do not care

* add interpretorcore

* add scope

* refine scope create method

* add jit for new_exe

* solve conflict

* delete unused code

* polish code

* polish code

* refine scope in inplace

* refine for datatransfer

* refine _rebuild_from_desc

* refine control eager deletion attr

* refine used_for_jit

* refine jit for infer

* op size0 use ori program

* polish code

* refine jit

* refine run_program_op ut

* refine inplace

* refine control

* refine graph helper

* refine control

* refine inplace

* refine buffer_share_inplace_pass

* polish code

* polish code

* refine usage for compilerProgram

* refine control

* test

* test core cache

* refine code

* refine io.py

* increase test_seq2seq timeout

* refine convert program

* refine interpretercore_cache release

* delete buildinplace

* refine partial_program && io

* refine code for io

* test

* test

* test

aba1295b

Q
[MLU] fix compile error, test=develop (#45499) · e10e26e7
由 Qi Li 提交于 8月 29, 2022
```
* [MLU] fix compile error, test=develop

* fix more compile error, test=develop
```
e10e26e7
Y
[PHI]Mv xpu elementwise add kernel to phi (#45473) · bb3e4e0c
由 YuanRisheng 提交于 8月 29, 2022
```
* mv elementwise add to xpu , test=kunlun

* fix ci bugs, test=kunlun

* fix ci bugs , test=kunlun
```
bb3e4e0c

[PHI] Migrate relu6 and abs kernels (#45397) · 632bc1f2

由 Sławomir Siwek 提交于 8月 29, 2022

* abs relu6 fwd

* abs bwd

* gaussian_random_kernel and mkldnn-onednn renaming

* scale kernel

* whitespace

* whitespace

* revert scale migration

* whitespaces

* revert changes to gaussian kernel

* whitespaces

632bc1f2

W
[XPU] migrate mul to phi (#45502) · 923594de
由 Weilong Wu 提交于 8月 29, 2022
```
* [XPU] migrate mul to phi;test=kunlun

* rm fluid mul xpu op;test=kunlun
```
923594de
C
Migrate assign xpu kernel into phi (#45467) · 0710f058
由 Chen Weihang 提交于 8月 29, 2022
```
* migrate assign xpu kernel, test=kunlun

* remove assign_value xpu, test=kunlun
```
0710f058
W
[Phi] gather gather_grad gather_nd gaussian_random xpu to Phi (#45465) · 60e1eccb
由 wanghuancoder 提交于 8月 29, 2022
```
* gather gather_grad gather_nd gaussian_random xpu to phi
```
60e1eccb
C

[MLU] optimize matmul_grad_v2 dy (B,M,K)*(K,N) for better performance (#45336) · 212b51ef
由 cambriconhsq 提交于 8月 29, 2022

212b51ef
A
[OpAttr]num_rows/num_colums of eye support Tensor type (#45427) · b93b710a
由 Aurelius84 提交于 8月 29, 2022
```
* [OpAttr]num_rows/num_colums of eye support Tensor type

* fix attr cast with long type
```
b93b710a
Z

Move expand_as_v2 XPU kernel to PHI, test=kunlun (#45474) · 4b749513
由 zhangbo9674 提交于 8月 29, 2022

4b749513
Z

move expand_v2 to phi, test=kunlun (#45469) · 23a79923
由 zhangbo9674 提交于 8月 29, 2022

23a79923

Move matmul_v2 kernel of xpu from fluid to phi (#45446) · de436f07

由 zyfncg 提交于 8月 29, 2022

* move matmul_v2 kernel of xpu from fluid to phi, test=kunlun

* fix complie bug, test=kunlun

* fix complie bug, test=kunlun

* fix complie bug, test=kunlun

de436f07

W

[XPU] migrate bce_loss to phi;test=kunlun (#45459) · d3ec3fe3
由 Weilong Wu 提交于 8月 29, 2022

d3ec3fe3

26 8月, 2022 8 次提交
- R
  
  Move conv2d_transpose_grad XPU kernel to PHI, test=kunlun (#45466) · a635a8a5
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  a635a8a5
- R
  
  Move grid_sample XPU kernel to PHI, test=kunlun (#45425) · 2c89bccb
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  2c89bccb
- R
  
  Move conv2d_transpose XPU kernel to PHI, test=kunlun (#45419) · 1f1a7835
  由 Ruibiao Chen 提交于 8月 26, 2022
  
  1f1a7835
- Z
  [Phi] Delete xpu kernel of fill_any_like and fill_constant in fluid (#45420) · 6ab80b64
  由 zyfncg 提交于 8月 26, 2022
```
* delete fill xpu op in fluid

* delete fill_constant header, test=kunlun

* fix npu header, test=kunlun
```
  6ab80b64
- H
  
  [XPU] add load_combine_op_xpu. test=kunlun (#45436) · 3055d71a
  由 houj04 提交于 8月 26, 2022
  
  3055d71a
- K
  Transfer transfer_layout from fluid to phi (#45261) · 985f2a4a
  由 kangguangli 提交于 8月 26, 2022
```
* remove fluid kernel and activate phi kernel

* fix parameter error

* transfer mkldnn part

* modify header file path

* fix compile error

* transfer special case

* fix lod setting and special case for layout setting

* add testcase and refine code
```
  985f2a4a
- H
  fix reduce mean grad bug *test=kunlun (#45401) · 2a992178
  由 haosicheng 提交于 8月 26, 2022
```
* add temporal shift and grad *test=kunlun

* fix reduce mean grad bug *test=kunlun
```
  2a992178
- X
  [ Dy2static ] select input fix and while_op memory bug fixed. (#45380) · 91298884
  由 xiongkun 提交于 8月 26, 2022
```
* while support for python container.
It is convenient to convert more dynamic graph codes into static graphs.

* cond support python container

* 1. make select_input output shape = input[1]
2. add warning in while_loop risky assign

* fix 2 problem in GPT export:
1. a bug in while_op no_need_copy_var, which causes gpu memory leakage
2. a bug in undefined_var where the stop_gradient should be False.

* change name by code review

* format
```
  91298884
25 8月, 2022 7 次提交

A
[OpAttr]axis of Reverse Support Tensor type (#45391) · 91110661
由 Aurelius84 提交于 8月 25, 2022
```
* [OpAttr]axis of Reverse Support Tensor type

* fix coverage

* fix unittest
```
91110661
A
[OpAttr]min/max of uniform_random support Tensor type (#45417) · c8955d0d
由 Aurelius84 提交于 8月 25, 2022
```
* [OpAttr]min/max of Uniform_rand support Tensor type

* fix typo
```
c8955d0d

Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a

由 kangguangli 提交于 8月 25, 2022

* transfer memcpy_d2h from fluid to phi

* refine arg check and add comment

* fix cannot fallback to phi kernel

* fix gpu_context host alloc when tensor size = 0

* add kernel for std::vector<DenseTensor> args

* fix bugs in MemcpyD2HMultiIOKernel

* remove useless header file

* polish format

* fix typo

* add testcase for cudapinned place

* refine check condition in test

* polish error message

* polish error message

* remove header in fluid  directory

* merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation

* fix code style check

0d14e74a

R
[NPU] add run_program_op_npu (#45349) · 64afa638
由 ronnywang 提交于 8月 25, 2022
```
* [NPU] add run_program_op_npu

* add run_program_op_npu ut
```
64afa638

optimize conv algo cache (#41891) · 1cd7e68b

由 hong 提交于 8月 25, 2022

* optimizer conv alog speed

* code polish

* remove useless code

* fix compile error

* fix cpu compile error

* not use cudnn alog t

* add search cache max number

* polish code

* fix cache test bug

* add groups data format to conv args

* fix cache test bug

* fix cudnn_deterministic bug

* fix test switch auto tune bug

* fix test swith autotune bug;

* fix conv cache bug

* fix cache test error

* fix cache test bug

* fix windows mac compile error

* fix workspace search error

* update cudnn cache

* fix cache test bug; test=develop

* fix autotune swith test error

* polish code

* oplish code

1cd7e68b

R

[triu_indices] add triu_indices_op (#45168) · a410c397
由 Rayman 提交于 8月 25, 2022

a410c397
U

fix roi_align_op_npu to pass the unittest (#45310) · 256bf6ff
由 USTCKAY 提交于 8月 25, 2022

256bf6ff

24 8月, 2022 3 次提交

make tensor_util contains no cuda code (#45256) · 78916a7a

由 Leo Chen 提交于 8月 24, 2022

* make tensor_util contains no cuda code

* refine isfinite

* revert ut

* move isfinite function to its op

* fix test

* fix compile

* std::isnan is not defined for int type on windows

* fix windows compile

* fix fp16

* fix rocm compile

* revert gradient node

78916a7a

W

Adapt tensor axis for cumsum (#45372) · 7f49b9ba
由 WangZhen 提交于 8月 24, 2022

7f49b9ba

Support fp16 of adam operator in xpu environment (#45292) · a012d426

由 mengqingchun02 提交于 8月 24, 2022

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support fp16 of adam operator in xpu environment. test=kunlun

* support fp16 of adam operator in xpu environment. test=kunlun

* support fp16 of adam operator in xpu environment. test=kunlun

a012d426

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功