提交 · 1927aff98b8a4feb3e408750b883e3a250d2d7dd · 机器未来 / Paddle

15 4月, 2022 7 次提交

[Phi]Reduce kernels into multiply files (#41747) · 1927aff9

由 chentianyu03 提交于 4月 15, 2022

* split reduce_kernel

* rm reduce_kernel in cmake

* split reduce_grad kernels

* fix cmake build error

* format code

* fix standalone_executor_test error

1927aff9

[DoubleGrad] Enabled test_imperative_star_gan_with_gradient_penalty.py under eager mode (#41730) · 27f28e82

由 Zhanlue Yang 提交于 4月 15, 2022

* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad

* Fixed elementwise issue

* Addressed CI failures

* [DoubleGrad] Enabled test_imperative_triple_grad test cases under eager_mode

* [DoubleGrad] Enabled test_autograd_functional_dynamic.py under eager mode

* Enabled more test cases

* [DoubleGrad] Enabled test_imperative_star_gan_with_gradient_penalty.py under eager mode

* Adjusted test_imperative_star_gan_with_gradient_penalty.py

27f28e82

Add eager string tensor (#41039) · a22b68b8

由 Jack Zhou 提交于 4月 15, 2022

* Add core.eager.StringTensor __init__ which pyarray args can be passed

* Add the numpy method of core.eager.StringTensor

* revert tensor.to_string modification

* Add ToPyObject for core.eager.StringTensor

* Add debug string for core.eager.StringTensor

* Remove place args of core.eager.StringTensor temporarily

* Fix check string_tensor error

* remove dtype of core.eager.StringTensor

* add core.eager.StringTensor unittest

* remove pstring from VarDesc

* Add InitStringTensorWithStringTensor

* Remove to_string modification

* Remove zero_copy arg from StringTensor creator

a22b68b8

C

polish tensor depreacted method warning (#41807) · e83e44c7
由 Chen Weihang 提交于 4月 15, 2022

e83e44c7
Z

Add API: Sparse Convolution3D (#41434) · 1665594d
由 zhangkaihuo 提交于 4月 15, 2022

1665594d

Change cuDNN Conv kernel for auto tune feature (#41313) · 35acfeda

由 limingshu 提交于 4月 15, 2022

* change cudnn helper for auto-tune

* Add FLAGS_use_autotune to set the global status of autotune and change the order of choosing algorithm.

* Fix the bug in calculating and printing current step cache hit rate.

* Improve the autotune cache and fix unittest.

* Change the key from AlgorithmType to int64_t.

* Fix unittest for cpu-only env.

* change ChooseAlgoByWorkspace for heuristic mode
Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>

35acfeda

fix batch norm memory issue (#41717) · 42abcc08

由 hong 提交于 4月 15, 2022

* try to fix batch norm memory issue

* fix batch norm memroy alloc bug

* polish some code

42abcc08

14 4月, 2022 9 次提交

L
[KP] Add registry for elementwise_add/max/min/sub/div/mul/floordiv on XPU2 with KP lib (#41494) · fbe2c311
由 Lijunhui 提交于 4月 14, 2022
```
* regist elementwise_xxx
```
fbe2c311
C

remove all is initialized using (#41766) · 4733fe60
由 Chen Weihang 提交于 4月 14, 2022

4733fe60

[Phi] Support construct Scalar by using Non-CPU Tensor (#41765) · 54ccc308

由 YuanRisheng 提交于 4月 14, 2022

* support construct scalar using non-cpu tensor

* fix bugs when run unittest

* fix compile bugs

* fix bugs when run ci

* fix compile bugs

* fix bugs when move copy

* perfect unit test

* perfect unittest

* update according to comment

* add target dependency

* deal with conflict

* fix bugs when run unit test

* fix unit test bugs

54ccc308

Fix to #38693 (minimal UT) (#41026) · d0f3296b

由 Jacek Czaja 提交于 4月 14, 2022

* Add UT

- Added missed data_layout

- Added missing conversions

- NDHWC added

- NDHWC support in data_transform

- another fix

- condddate change

- fix

u- fix

- fix

- fix

- fix

- fix

- fix to hack

- compilation fix

- fix to automatic merge

* - reduced UT

* - fix

* - lint

* - fix to lint

d0f3296b

Z
[PHI] Support some c++ api in paddle namespace (#41778) · b075dee8
由 zyfncg 提交于 4月 14, 2022
```
* support some c++ api in paddle namespace

* change c++ api namespace in custom op
```
b075dee8

[DoubleGrad] Enabled test_autograd_functional_dynamic.py under eager mode (#41668) · ad9585b6

由 Zhanlue Yang 提交于 4月 14, 2022

* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad

* Fixed elementwise issue

* Addressed CI failures

* [DoubleGrad] Enabled test_imperative_triple_grad test cases under eager_mode

* [DoubleGrad] Enabled test_autograd_functional_dynamic.py under eager mode

* Enabled more test cases

* Fixed performance issues

* Fixed minor issue

ad9585b6

A

[Op]Fix adam/adamw beta1_pow/beta2_pow place while copying (#41732) · 4ae76d21
由 Aurelius84 提交于 4月 14, 2022

4ae76d21
C

remove inner_place using (#41768) · de2a3942
由 Chen Weihang 提交于 4月 14, 2022

de2a3942
C
[Phi] Unify dispatch macros to visit (#41653) · 2ab986ae
由 Chen Weihang 提交于 4月 14, 2022
```
* chnage dispatch to visit

* resolve conflict
```
2ab986ae

13 4月, 2022 10 次提交
- Z
  Add yaml and unittest for SGD (#41485) · 6d1e03a2
  由 zyfncg 提交于 4月 13, 2022
```
* add sgd yaml

* change python api

* open eager mode in sgd

* fix bug
```
  6d1e03a2
- T
  Revert "[Phi] Support construct Scalar by using Non-CPU Tensosr (#41528)" (#41740) · 404c4a6b
  由 tianshuo78520a 提交于 4月 13, 2022
```
This reverts commit fe214af2.
```
  404c4a6b
- H
  Add expand equal all yaml (#41540) · e53d1837
  由 hong 提交于 4月 13, 2022
```
* add expand, poisson

* add poison grad

* add expand equal_all poisson triangular solve yaml
```
  e53d1837
- Y
  [Phi] Support construct Scalar by using Non-CPU Tensosr (#41528) · fe214af2
  由 YuanRisheng 提交于 4月 13, 2022
```
* support construct scalar using non-cpu tensor

* fix bugs when run unittest

* fix compile bugs

* fix bugs when run ci

* fix compile bugs

* fix bugs when move copy

* perfect unit test

* perfect unittest

* update according to comment

* add target dependency
```
  fe214af2
- Z
  Fix problem of infermeta with vector output (#41646) · b2390438
  由 zyfncg 提交于 4月 13, 2022
```
* remove stack_grad infershape

* fix bug of output with null

* fix bug
```
  b2390438
- R
  Add yaml for deformable_conv and deformable_conv_v1 OPs (#41644) · b8968390
  由 Ruibiao Chen 提交于 4月 13, 2022
```
* Add yaml for deformable_conv and deformable_conv_v1 OPs

* Add UT

* Add to skipped_phi_api list for infrt
```
  b8968390
- C
  [Yaml]Add adam yaml (#41561) · 8cbf79a3
  由 chentianyu03 提交于 4月 13, 2022
```
* add adam yaml

* add adam final_state api

* add adam_impl
```
  8cbf79a3
- Z
  
  Add kernel sparse_mask_helper; sparse_coo_tensor_grad (#41586) · acd08a9b
  由 zhangkaihuo 提交于 4月 13, 2022
  
  acd08a9b
- A
  
  [CustomDevice] move member variable to dense_tensor.h (#41702) · d84934da
  由 Aganlengzi 提交于 4月 13, 2022
  
  d84934da
- C
  [Phi&CustomOp] Remove deprecated enum PlaceType for custom op & add warning (#41647) · 78ef1071
  由 Chen Weihang 提交于 4月 13, 2022
```
* remove old custom op placetype

* replace dist  placetype using

* add with gpu macro

* fix mutable_data error

* fix set value error

* add comment
```
  78ef1071
12 4月, 2022 13 次提交

Add layer norm yaml (#41589) · 43d5cca6

由 hong 提交于 4月 12, 2022

* add layer norm infermeta

* add layer norm yaml

* polish layer norm infer meta

* add layer norm to black list

43d5cca6

C
exchange assign and assign_raw kernel name (#41625) · de49a4b7
由 chentianyu03 提交于 4月 12, 2022
```
* exchange assign and assign_raw kernel name

* fix register error
```
de49a4b7
H

fix depthwise dnn bug (#41666) · 7b627dd8
由 hong 提交于 4月 12, 2022

7b627dd8

[KP] Add Logical/compare/bitwise registry & UT (#40802) · 3749198e

由 Lijunhui 提交于 4月 12, 2022

* init commit no push

* collect comile errors

* bitwise UT

* fix compile problem

* cancel comments

* restore miss deletion

* fix compilation

* fix UT

* NO stash in multiple branch at the same times

* fix error

* combine .cu from gpu and kps

* replace gpu by kps

* fix by Chen-weihang

* Revert "Fix kps compile error in Junhui logic compare bitwise"

* fix backend test

* rm comments
Co-authored-by: NChen Weihang <chenweihang@baidu.com>

3749198e

W

add fp16 kernel to clip_grad (#41661) · 137dc3e3
由 wuyefeilin 提交于 4月 12, 2022

137dc3e3
Z
[DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad (#41451) · 0b4c3c20
由 Zhanlue Yang 提交于 4月 12, 2022
```
* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad

* Fixed elementwise issue

* Addressed CI failures
```
0b4c3c20

[CustomOp] Add context pool unittests (#41085) · 59ec9599

由 Chen Weihang 提交于 4月 12, 2022

* add context pool unittests

* fix timeout

* polish details

* change option pos

* add dll decl for wndows

* fix pre-commit error

* move dll_decl and export DeviceContext

* replace lost dll_decl.h

59ec9599

A
[Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw (#41641) · fdeec8c3
由 Aurelius84 提交于 4月 12, 2022
```
* [Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw

* fix xpu unittest failed
```
fdeec8c3
J
fix_paddle_numel_check (#41607) · 51cae7f7
由 JingZhuangzhuang 提交于 4月 12, 2022
```
* fix_paddle_numel_check

* fix_paddle_numel_check
```
51cae7f7

add a inner loop for index_select_grad_init() in index_select op when dealing... · bc01242b

由 FlyingQianMM 提交于 4月 12, 2022

add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563)

* replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op

* use CUDA_KERNEL_LOOP_TYPE

* fix code style

* replace index_select_grad_init with SetConstant

bc01242b

[CustomOp]Add new method for custom double grad (#41538) · 362c7c80

由 Chen Weihang 提交于 4月 12, 2022

* add new method for custom double grad

* add tanh double grad unittest

* change year

* revert tensor init method

362c7c80

Z
[Phi] Support setting size of vector<Tensor> for out in yaml (#41576) · dead24dd
由 zyfncg 提交于 4月 12, 2022
```
* support setting vector out size in yaml

* support setting size of vector<tensor> for out in yaml
```
dead24dd
Z

fix data transform problem for cudnn backend (#41622) · c055b50c
由 zyfncg 提交于 4月 12, 2022

c055b50c

11 4月, 2022 1 次提交
- Y
  [Phi]Add multi_dot/maxout/multiplex op yaml (#41550) · 36d76840
  由 YuanRisheng 提交于 4月 11, 2022
```
* add multi_dot,maxout,multiplex yaml

* add code converage
```
  36d76840

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致