提交 · da51baf22262e52b1a8822ee646ffba5bad1b6a3 · PaddlePaddle / Paddle

23 8月, 2022 1 次提交

[CustomDevice] add profiler apis (#45130) · da51baf2

由 ronnywang 提交于 8月 23, 2022

* [CustomDevice] add profiler apis

* migrate CalculateEstOccupancy into cuda_tracer

* update

* add ut

da51baf2

22 8月, 2022 4 次提交
- W
  [Eager] some python c api use final state (#45221) · d2ef888b
  由 wanghuancoder 提交于 8月 22, 2022
```
some python c api use final state
```
  d2ef888b
- Z
  
  rename the member function of SparseTensor (#45291) · 016b94c2
  由 zhangkaihuo 提交于 8月 22, 2022
  
  016b94c2
- S
  
  fix infershape in compile time (#45156) · ed57237e
  由 shangliang Xu 提交于 8月 22, 2022
  
  ed57237e
- R
  
  [CustomDevice] fix custom ccl (#45276) · 307ad60d
  由 ronnywang 提交于 8月 22, 2022
  
  307ad60d
19 8月, 2022 4 次提交
- P
  call final_state method in inplace APIs (#42968) · 7c1e7e46
  由 pangyoki 提交于 8月 19, 2022
```
* add forward inplace final state api

* fix bug

* fix reshape

* fix coverage

* add inplace info for erfinv, lerp, put_along_axis

* fix put_along_axis infer_meta

* fix format

* update yaml

* fix
```
  7c1e7e46
- H
  
  polish REGISTER_OPERATOR parameter of fill_any (#45263) · 1c4134f6
  由 HongyuJia 提交于 8月 19, 2022
  
  1c4134f6
- W
  Trt groupnorm dynamic plugin (#44911) · 1aa6adb1
  由 Wang Bojun 提交于 8月 19, 2022
```
* add group_norm dyanmic plugin
```
  1aa6adb1
- A
  
  [CustomDevice] support scalar (#45244) · dc331231
  由 Aganlengzi 提交于 8月 19, 2022
  
  dc331231
18 8月, 2022 6 次提交

[phi] Transfer fluid trilinear_interp_v2 to phi trilinear_interp (add yaml) (#45145) · 6150fade

由 HongyuJia 提交于 8月 18, 2022

* transfer trilinear op to phi, change name from trilinear_interp_v2 to trilinear_interp

* reserve linear_interp param

* change testcase scale if-branch

* testcase test_imperative_case

* fix trilinear testcase

* import paddle in test_trilinear_interp_v2

6150fade

A
[OpAttr]Squeeze axes support Tensor (#45189) · c93451f4
由 Aurelius84 提交于 8月 18, 2022
```
* [OpAttr]Squeeze axes support Tensor

* add support_tensor

* fix unittest

* fix coverage
```
c93451f4

change to async mode for xpu multi-card training in static graph mode, test=kunlun (#45024) · 41bdf41d

由 zhangxiaoci 提交于 8月 18, 2022

* change to async mode for xpu multi-card training in static graph mode

* minor bugfix

* irrelevant. move to another pr

* move change to other pr

* fix stream issue

* fix 'stream not meet with current context' error

* fix branch diverge, test=kunlun

41bdf41d

W

sync_batch_norm_backword_yaml (#45218) · 133f608f
由 wanghuancoder 提交于 8月 18, 2022

133f608f

[phi] Transfer fluid bilinear_interp_v2 to phi bilinear_interp (add yaml) (#45140) · 2c2137bb

由 HongyuJia 提交于 8月 18, 2022

* transfer bilinear op to phi, change bname from bilinear_interp_v2 to bilinear_interp

* reserve linear_interp param

* fix cross device import

2c2137bb

Z

support selected_rows kernel for multiply in dygraph (#45217) · bcbb7a97
由 zyfncg 提交于 8月 18, 2022

bcbb7a97

17 8月, 2022 4 次提交
- L
  Reuse addKernel to replace TensorAdd (#45161) · 0e3b49d4
  由 Leo Chen 提交于 8月 17, 2022
```
* use addKernel

* fix compile

* remove elementwiseAddto

* add return

* fix custom place
```
  0e3b49d4
- Y
  add instance norm op for xpu (#45097) · 216d25ac
  由 ykkk2333 提交于 8月 17, 2022
```
* xpu unittest grad compute supports more types, *test=kunlun

* add instance norm xpu, *test=kunlun
```
  216d25ac
- H
  [phi] Transfer fluid bicubic_interp_v2 to phi bicubic_interp (add yaml) (#45151) · f4da2d4d
  由 HongyuJia 提交于 8月 17, 2022
```
* transfer bicubic_interp op to phi, change name from bicubic_interp_v2 to bicubic_interp

* test final_state_bicubic_interp api

* testcase match imperative case
```
  f4da2d4d
- S
  Fix squared_l2_norm wrong stream bug (#45174) · 951010a2
  由 sneaxiy 提交于 8月 17, 2022
```
* fix squared_l2_norm bug

* update buffer.h
```
  951010a2
16 8月, 2022 7 次提交

[Phi] Move amp ops into phi (#45079) · b4f67757

由 Chen Weihang 提交于 8月 16, 2022

* move check finite and unscale kernel into phi

* move infershape into phi

* move update_loss_scaling kernel into phi

* remove original kernels

* move update loss scaling infershape into phi

* add header for xpu and npu

* solve coverage failed

* fix npu test failed

* remove mutable data in cu file

* fix new executor failed

* add valid check for meta tensor output

b4f67757

[geometric]Add paddle.geometric.send_uv API (#44848) · 88724a53

由 Siming Dai 提交于 8月 16, 2022

* initial commit

* fix op maker bug

* fix mul grad bug

* add unittest

* fix add grad bug, add cpu kernel

* add paddle.geometric.message_passing

* add paddle.geometric.send_uv api, add unittest

* add fp16 judgement

* fix file typo, move compute_type to message_op

* add impl file

* fix unittest timeout time

* add review revise

88724a53

[Eager] Forword only add dygraph func (#45153) · 933db9d4

由 Weilong Wu 提交于 8月 16, 2022

* [Eager draft] forward_only interface migrate to autograd_api

* strings api add dygraph forward function

* rm useless comments

* draft version for check CI

* fix ci

* forward-only no need compute_require_grad and pass stop_gradient, rm useless comments

* polish yaml and using CPUPlace = phi::CPUPlace

* rm useless comments

* polish yaml and update some test case

* rm useless funcs

* polish eager_gen code

* polish code

933db9d4

H

transfer nearest_interp op to phi, change name from nearest_interp_v2 to nearest_interp (#45148) · 6452ab3b
由 HongyuJia 提交于 8月 16, 2022

6452ab3b
Z

Use base visit in cpu kernel (#45062) · ab583173
由 zhangkaihuo 提交于 8月 16, 2022

ab583173
S

[Ops] Support more dtype for gather kernel (#45142) · 0b4268a6
由 Siming Dai 提交于 8月 16, 2022

0b4268a6
C

support momentum op auto generation (#45163) · 642f6df9
由 Charles-hit 提交于 8月 16, 2022

642f6df9

15 8月, 2022 4 次提交
- C
  
  support adamw generation (#45149) · 1353761a
  由 Charles-hit 提交于 8月 15, 2022
  
  1353761a
- H
  [phi] change op name linear_interp_v2 to linear_interp (#45128) · 6de3bdb3
  由 HongyuJia 提交于 8月 15, 2022
```
* change name linear_interp_v2 to linear_interp

* fix deprecated_op_names

* deprecated_op_names add linear_interp_grad
```
  6de3bdb3
- W
  [Eager] fix sync batch norm to inplace (#45028) · c75b091b
  由 wanghuancoder 提交于 8月 15, 2022
```
* fix sync batch norm to inplace
```
  c75b091b
- D
  Fix compile error of windows platform(atomicAdd in grid_sample_grad_kernel) (#45131) · 05f7d0c5
  由 duanyanhui 提交于 8月 15, 2022
```
* fix compile error
```
  05f7d0c5
12 8月, 2022 9 次提交

L

fix nccl comm in sync_bn (#45100) · 1e965756
由 LiYuRio 提交于 8月 12, 2022

1e965756

Offload calculations from matmul op to fuse pass (#44941) · acb78ea2

由 Sławomir Siwek 提交于 8月 12, 2022

* remove v2_transpose_reshape

* matmul_transpose_reshape

* reshape_transpose_matmul

* Add int8 support for matmulV2

* restore ut

* adjust old ut

* restore parallel UT ruels

* remove mkldnn code from base ops

* move enforces to pass

* remove duplicated functions

* delete duplicated enforces

* feedback from review

* add comments to variables

* enable eltwise support

* dynamic attribute

* remove fusepass tests from op test

* remove fuse pass cases from op test

* revert introduction of dynamic attributes

* style
Co-authored-by: Nwozna <joanna.wozna@intel.com>

acb78ea2

[phi] Transfer linear_interp_v2 yaml to phi (#45072) · c737232f

由 HongyuJia 提交于 8月 12, 2022

* support optional<vector<Tensor>> in yaml and eager

* delete useless comments in eager_gen.py

* fix api_base.py support optional<vector<TTensor>>

* python_c_gen.py support optional<vector<tensor>>

* transfer linear_interp_v2 yaml from fluid to phi

* fix op_test typo error

* change linear_interp_v2 testcase

* fix args in final_state_linear_interp_v2

* fix zeropad2d typo. test=document_fix

c737232f

transfer memcpy_h2d from fluid to phi (#44932) · 7bc57d35

由 kangguangli 提交于 8月 12, 2022

* transfer memcpy_h2d from fluid to phi

* use UnchangedInferMeta instead

* restore test_standalone_executor

* add newline to fix codestyle check

* rename pt -> phi

* simplify logic and add check

* make the comment more clear

* remove useless comment

* refine code

7bc57d35

Remove some custom_impl api (#45066) · adb61b7b

由 zyfncg 提交于 8月 12, 2022

* remove some custom_impl api and make them generated by yaml completely

* delete useless code

* fix adamw bug

* fix infermeta

* revert adamw

* polish code

* fix bug

adb61b7b

Z

refix index resize in multiclassnms3 (#45095) · 49e2a4d8
由 zhiboniu 提交于 8月 12, 2022

49e2a4d8
D
enhance grid_sampler to support 3d input (#45015) · 1773fbba
由 duanyanhui 提交于 8月 12, 2022
```
* enhance grid_sampler to support 3d input
```
1773fbba
Z

fix extra output of kernels for inference (#45048) · 1cb883da
由 zyfncg 提交于 8月 12, 2022

1cb883da

[geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3

由 Siming Dai 提交于 8月 12, 2022

* add init file

* add op definition and infermeta

* add kernel definition funcs

* add broadcast infer shape

* add gpu forward kernel

* delete SUB and DIV

* add x_grad

* add template

* add e_grad for min and max

* fix small bug

* temp commit

* temp commit

* add e_grad for sum and mean

* fix some compile bug

* fix compile bugs

* fix compile problem

* add sum forward unittest

* fix broadcast error, add kernel sig, register e_grad, change unit test

* fix grad

* add temp grad fix

* temp commit

* add min max unittest

* add max, min unittest, fix mul bug

* add cpu forward sum and mean

* add forward min max, fix mean unittest

* add cpu backward min max

* fix code-style

* add backward sum mean

* fix rocm ci

* set uniitest timeout

* fix bug of x broadcast to e, gpu grad

* fix bug of x broadcast to e, cpu grad

* rename BOOST_GET_CONST macro

* fix rocm ci

* mv graph_send_e_recv to graph_send_ue_recv

* move out_size to IntArray

* add eager op test

* fix max pool type bug, add unittest for api

* revise api doc

* add fp16 for atomic min and max, add unittest

* add unittest

* add fp16 support for graph_send_recv

* fix unittest fp16 bug

* change OutSizeTensor to Out_size

* move E to Y

* add copyright, fix comment

* review code

* fix thread block size

* fix thread block size

* change api attribute name: pool_type to reduce_op, compute_type to message_op

* change api attribute name, move pool_type to reduce_op, move compute_type to message_op

615b15a3

11 8月, 2022 1 次提交
- C
  make affine_grid_op support 5d input_dim on cpu and gpu (#45012) · 7812522c
  由 carryyu 提交于 8月 11, 2022
```
* make affine_grid_op support 5d_input on cpu and gpu
```
  7812522c

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功