提交 · c040bbd7b3c4df056ef2107982d0fbd8489dfa2f · 机器未来 / Paddle

16 3月, 2022 8 次提交

[Phi] Migrate multiplex, qr, tril_triu op kernel to phi (#40007) · dce87e3d

由 caozhou 提交于 3月 16, 2022

* migrate multiplex op kernel

* migrate qr cpu kernel

* migrate tril_triu op kernel

* fix multiplex kernel

* add kernel sig

* fix dependence and bug

* fix multiplex error

* fix npu include error

* fix conflict

* fix conflict and delete tril_triu

* fix date and multiplex input

* adapt header file order

* fix header file include

* fix conflict

* delete cholesky_solve_op.h

* delete triangular_solve_op.h

dce87e3d

X

tranfer cumprod and kldiv_loss infershape to phi (#40575) · 6d205516
由 xiongkun 提交于 3月 16, 2022

6d205516
C

move isclose infershape (#40595) · c7637700
由 Chen Weihang 提交于 3月 16, 2022

c7637700

[Phi] Move grid sample op kernel into phi (#40585) · 8fd20b5b

由 Chen Weihang 提交于 3月 16, 2022

* add grid sample phi kernel

* add grid sample phi kernel and remove original kernel

* replace mutable_data by alloc

8fd20b5b

Q

[MLU] support amp O1 of mlu (#40461) · ad81f22c
由 qipengh 提交于 3月 16, 2022

ad81f22c
Z

Fixed issue with default-valued attributes (#40368) · f748b433
由 Zhanlue Yang 提交于 3月 16, 2022

f748b433
C

move gather infershape (#40594) · 59e5c49f
由 Chen Weihang 提交于 3月 16, 2022

59e5c49f

[Auto Parallel] Add the support for the auto completion of while_op (#39939) · ec6b8fbd

由 Yulong Ao 提交于 3月 16, 2022

* [Auto Parallel] Support the auto completion of while_op

* [Auto Parallel] Improve the completion algorithms

* [Auto Parallel] Fix bugs for ernie inference

* [Auto Parallel] Remove attrs which cannot be pickled

* [Auto Parallel] make the dims_mappings of LodTensorArray vars empty

* [Auto Parallel] Fix bugs for the ernie inference in the pipeline parallel

* [Auto Parallel] Remove unncessary comments

* [Auto Parallel] Fix a bug of the CMakeLists

* [Auto Parallel] Use the newest APIs to write the unit test

* [Auto Parallel] Remove unnecessary statements

ec6b8fbd

15 3月, 2022 21 次提交

[Phi] Move determinant op kernel into phi (#40539) · a04a6bd5

由 Chen Weihang 提交于 3月 15, 2022

* add determinant phi kernel

* remove original determinant op kernel

* add determinant grad [hi kernel

* fix determinant test failed

* remove original determinant grad op kernel

a04a6bd5

[phi] modify the shape OP and move inferMeta of shape,matrix_pow,multi_dot (#40506) · 31729a62

由 Liu-xiandong 提交于 3月 15, 2022

* [phi] move matrix_power op

* MatrixInverse fluid -> phi

* modify the CMake to fix compile bug

* delete useless comment

* mutable memory -> phi Alloc

* modify the include file

* modify the include file

* fix bug in CI compiler

* [phi]modify the shape OP and move inferMeta of shape,matrix_pow,multi_dot

* delete useless comment

* fix bug in CI

* modify after review

31729a62

add number count op (#39224) · 9bdee437

由 Roc 提交于 3月 15, 2022

* add expert count op

add ut for expert_count

* update UT only for cuda

* fix for rocm

* update ut

* add moe module

* add expert count op

add ut for expert_count

* update UT only for cuda

* update ut

* add moe module

* make expert count private

* rename expert count op
Co-authored-by: Nhlygit66666 <2570058140@qq.com>

9bdee437

X
run python api in eager model and filter the out in argument list (#40523) · 4d886f75
由 xiongkun 提交于 3月 15, 2022
```
* run python api in eager model and filter the out in argument list

* fix code
```
4d886f75
Z
Fixed issues with generated scale operator (#40482) · 30417999
由 Zhanlue Yang 提交于 3月 15, 2022
```
* Fixed issues with generated scale operator

* Fixed minor issues
```
30417999
F
[NPU] add AMP O1 support (#40362) · 69dd43d1
由 furnace 提交于 3月 15, 2022
```
* [NPU] add AMP O1 support

* [NPU] fix NOTE and warnings
```
69dd43d1

[Phi] Move gather op kernel into phi (#40500) · 0c703fe7

由 Chen Weihang 提交于 3月 15, 2022

* add phi gather kernel

* update year

* remove original gather opkernel

* add gather grad phi kernels

* remove origin gather grad kernel

* fix failed npu and xpu

* fix xpu compile failed

0c703fe7

oneDNN NHWC fixes (#40049) · dde9cec0

由 Jacek Czaja 提交于 3月 15, 2022

* - Prototype of third solution

- fix

- compilation fixes

- fix

- fixe

- fix

- fix

- compilation fix

- comment fix

- lint

update mkldnn conv_elementwise_add_fuse_pass ut

- NHWC changes to prelu

- alhpa dims

- UT fix

- fix to UT

- lint

- Some fixes

- added to BWD of prelu NHWC support

- reverted removal of resetting cu_layout in clearing of caching

* - Small changes

* - compilation fix

* - fix

* - fix

* lint

* - fixes after internal review

* - compilation fix

* - lint

dde9cec0

T
add shard_id (#40261) · 6b7d4845
由 Thunderbrook 提交于 3月 15, 2022
```
* shard_id

* format
```
6b7d4845

[phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass... · 64223620

由 xiongkun 提交于 3月 15, 2022

[phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass the tests of these four kernels (#39770)

* tranfer and pass the lgamma unittest

* merge and pass the test

* transfer kldiv_loss and kldiv_loss_grad; pass the unitest

* trafer the isclose and cumprod kernel

* change PT_REGISTER -> PD_REGISTER

* fix by code review

* fix by code review

* fix

* remove enforce include dependence from scalar

* fix

* fix by code review

* fix by code review

64223620

[Phi]move reduce_min/any/all kernel (#40374) · c46e661d

由 chentianyu03 提交于 3月 15, 2022

* add reduce_min kernel

* remove raw reduce_min kernel

* add reduce min

* add reduce any all impl

* add bool reduce Kernel

* remove raw any/all kernel

* add any all kernel

* rm comment

c46e661d

Added more profile signposts to dygraph (#40201) · 36db75b4

由 Zhanlue Yang 提交于 3月 15, 2022

* Added more signposts to dygraph profiling

* Fixed minor issues

* Refactored signpost names

* Fixed typo

* Removed debug codes

* Fixed typo

* Adjusted signpost names

* Fixed issues from branch merge

36db75b4

Move one hot to phi (#39876) · 7701db37

由 hong 提交于 3月 15, 2022

* move one hot to phi; test=develop

* fix bugs; test=develop

* fix bugs; test=develop

* add infer meta; test=develop

* fix bugs; test=develop

* resolve confilct

* resolve confilct

* fix bug;

* fix error; test=develop

* update; test=develop

* polish code; test=develop

* add one api in eager mode; test=develop

* add one hot test; test=develop

* remove use less code; test=develop

* fix bug; test=develop

* polish code; test=develop

* polish code; test=develop

7701db37

C

Fix truncated norm operator (#40287) · 0c333543
由 Chang Xu 提交于 3月 15, 2022

0c333543

[Phi]Move Tanh/BRelu/LeakyRelu/ThresholdedRelu Kernels to Phi (#40385) · d7112180

由 YuanRisheng 提交于 3月 15, 2022

* move activation op

* adjust code format

* fix compile bugs

* fix ci bugs

* code format adjust

* code format adjust2

* activate ci status

* modify according to comment

* move activation kernel

* revert relu6

* reduce add code

* perfect use_phi_functor

* completing func name

* fix bugs when run ci

* fix bugs when run infr

* modifpy infrt get kernel signature

d7112180

Q

[MLU] add check_finite_and_unscale op for amp (#40458) · 42c7bb47
由 qipengh 提交于 3月 15, 2022

42c7bb47
Z

[Phi]Move searchsorted kernel to phi (#40520) · 85f8fd9b
由 Zhang Zheng 提交于 3月 15, 2022

85f8fd9b

[Dygraph] Refactoring of reducer in DataParallel (#40389) · 1a32391c

由 Haohongxiang 提交于 3月 15, 2022

* refactor reducer

* modify cmakelists

* solve conflicts

* rename group and update process_group

* fix bugs of ProcessGroupNCCL

* modify for CIs

* refactoring reducer

1a32391c

Remove pybind index error (#40538) · 47d764a3

由 zyfncg 提交于 3月 15, 2022

* change the exception of getitem from pybind type to PADDLE_ENFORCE

* fix bug

* remove pybind::index_error exception

47d764a3

[Phi]Move kron kernel to phi (#40427) · f181d47f

由 Zhang Zheng 提交于 3月 15, 2022

* first commit

* fix

* fix

* fix compile eeror

* fix

* fix complex

* fix

* fix

* fix npu

* fix

* modify accroding to comments

* fix

f181d47f

C

move allclose infershape (#40508) · 5d08a447
由 Chen Weihang 提交于 3月 15, 2022

5d08a447

14 3月, 2022 11 次提交

[Phi]Add diag_v2 grad kernel (#40447) · e157f2af

由 Siming Dai 提交于 3月 14, 2022

* Add diag grad kernel

* fix unittest case

* add float16, remove const &

* delete diag_grad in op_utils.h

e157f2af

Z
[PHI] Move set_value_grad kernel form fluid to phi (#40478) · 3149e399
由 zyfncg 提交于 3月 14, 2022
```
* move set_value_grad kernel form fluid to phi

* add unittest for passing coverage ci
```
3149e399

Add an elementwise + activation fusion pass. (#36541) · 3f219160

由 Tomasz Socha 提交于 3月 14, 2022

* Add elementwise add and activation fuse pass

* Fix copy ellision

* More flexible pattern detector

* More flexible fusion pass

* Update lists for pass

* Add support for Pow operator

* Add support for more activation types

* Style

* Rename fusion pass

* First version of tests

* Dirty version of pass

* Polished version

* Update pbtxt

* Style

* Update names

* Style

* Use PADDLE_ENFORCE_EQ

* Save error message to variable

* WO for error checks

* CR

* Static style check

* Add missing 'activation_scale' attribute

* Add relu6 and sigmoid activations

* Style

* Fix fuse list formating

* Sync filenames for fuse pass files

* Fix cmake after move

* Fix registration

* Fix pass name in tests

* Add missing activations to checker

* WIPS

* Working mul op

* Working sub

* Working Add

* Remove pten includes

* Remove some forward declarations

* Remove Includes

* Fixes

* Remove default kernels

* Add check if post_ops attributes are avaliable

* Style

* Code adjustment

* Register default kernels

* We have year 2022 not 2021...
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

* Fast review fixes
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

* Review Fix

* Rename one_dnn -> onednn

* Style after review

* Fast and dirty fix for quantization

* Update tests

* Style

* Fix mkldnn_quantizer config

* Add Joanna's suggestion.

* Check if operator is explicitly disables on OneDNN

* Try to use unregistered attributes

* Style

* Test new framework

* FXI

* FXII

* Update test

* Style
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

3f219160

F

[MLU] add merged_momentum mlu kernel (#40406) · 1f7b2516
由 fwenguang 提交于 3月 14, 2022

1f7b2516

optimize group_norm op backward (#39944) · 5720537e

由 crystal 提交于 3月 14, 2022

* optimize backwad

* optimize group_norm backward

* Add vectorized code

* move assignment code

* merge function

* move code

* optimize code

* Modify function name

5720537e

Optimize bilinear_interp backward (#39423) · 9e1f762c

由 Lijunhui 提交于 3月 14, 2022

* bilinear_bw init

* optimize code

* optimize

* optimize 2

* optimize functions

* modify func name

9e1f762c

X

[phi]migrate fmax,fmin kernel to phi (#40140) · bb801960
由 Xiaoxu Chen 提交于 3月 14, 2022

bb801960

Support custom op and paddle.autograd.bacward in eager (#40423) · 227fa408

由 Jiabin Yang 提交于 3月 14, 2022

* eager, test=develop

* fix bug, test=develop

* eager, test=develop

* merge legacy to fluid

* eager, test=develop

* eager, test=develop

* Refactor TensorAdd func by template and remove gradient_accumulation in eager

* Remove needless target name

* eager, test=develop

* eager, test=develop

* Use overload instead of template

* Remove legacy code

* Remove legacy code

* selectedrows, test=develop

* Remove DataType test

* eager, test=develop

* eager, test=develop

* support gan, test=develop

* Using Tensor directly instead of using EagerTensor

* support gradient_accumulation

* make test_imperative_lod_tensor_to_selected_rows longer

* make test_imperative_lod_tensor_to_selected_rows longer

* refine code

* ptb, test=develop

* Rename all EagerTensor to Tensor

* Rename some EagerTensor to Tensor

* rename EagerTensor to EagerVariable

* eager, test=develop

* eager, test=develop

* eager, test=develop

* eager, test=develop

* add more test

* eager, test=develop

* Support copiable selected rows and merge develop

* save load, eager, test=develop

* save load, eager, test=develop

* refine, test=develop

* remove useless _set_value method

* refine, test=develop

* refine, test=develop

* revert static_runner, test=develop

* EagerTensor to Tensor, test=develop

* refine, test=develop

* refine, test=develop

* clear grad, test=develop

* merge, develop

* merge, develop

* merge, test=develop

* merge, test=develop

* Support quant and part of slice

* support legacy static save

* extend slim tests time

* remove imperative on inference

* remove imperative on inference

* merge develop

* fix typo

* fix typo

* split slice related code into 2 part for imperative and eager

* split slice from inference

* split slice from inference

* fix test_tensor_register_hook

* support custom op in eager mode

* fix inference deps error

* split eager utils from custom operator

* fix type match

* fix typo
Co-authored-by: NWang Huan <wanghuan29@baidu.com>
Co-authored-by: NWeilong Wu <veyron_wu@163.com>
Co-authored-by: Nwanghuancoder <wanghuancoder@163.com>

227fa408

Optimize performance of log_softmax (#38992) · 250e254f

由 Zhang Zheng 提交于 3月 14, 2022

* Optimize performance of log_softmax

* delete unity build

* modify to phi

* fix

* fixfixfixfix

* fix

* fix

* fix

* fix

* simplify

* fix

* fix enforce

250e254f

0

adjust params order for eager.Tensor._copy_to (#40449) · c6ec8b9f
由 0x45f 提交于 3月 14, 2022

c6ec8b9f

[KP] Add unittests for... · f269ca3f

由 Lijunhui 提交于 3月 14, 2022

[KP] Add unittests for brelu,ceil,celu,elu,floor,hard_shrink,hard_sigmoid,log1p,logsigmoid,relu6,silu,soft_relu,softsign,swish (#40448)

* solve unexecuted UT

* add 24 activation op UT

* append swish&thresholded_relu to kpfirst_list

* rm thresholded_relu

f269ca3f

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致