提交 · f275bf6d5a4aa8d70babddbec0b48826b8bcb41e · 机器未来 / Paddle

11 4月, 2022 1 次提交
- L
  [cherry-pick 2.3]fix bug when TruncatedNormal cannot fall back in cpu (#41595) · f275bf6d
  由 Liu-xiandong 提交于 4月 11, 2022
```
fix bug when TruncatedNormal cannot fall back in cpu
```
  f275bf6d
05 4月, 2022 1 次提交

Implement AutoTuneStatus class for Kernel Auto Tune (#41218) · b0f8000e

由 Zhang Ting 提交于 4月 05, 2022

* switch autotune

* implement AutoTuneCache

* implement AutoTuneCache class

* add pybind api

* add dygraph test

* support static mode and eager mode and improve unittests

* rename the SwitchAutoTune Class and improve tests

* improve AutoTuneStatus and reduce the cost of tests

b0f8000e

02 4月, 2022 1 次提交
- L
  [KP] fix bug in phi static graph mode (#41269) · d0f46aac
  由 Liu-xiandong 提交于 4月 02, 2022
```
* [KP] fix bug in phi static graph mode

* modify the useless code
```
  d0f46aac
01 4月, 2022 3 次提交

[Phi] Move softmax with cross entropy kernel into phi (#40832) · e6ec98fe

由 Chen Weihang 提交于 4月 01, 2022

* add cross_entropy_with_softmax phi kernel

* remove softmax_with_cross_entropy kernel

* add softmax_with_cross_entropy grad kernel

* remove original op kernel

* refine cross entropy impl

* fix pointer error

* revert kernel cu change

* fix xpu failed

* fix cinn failed

* fix npu failed

* add forward sig

* add check_nan_inf for pt kernel

* remove repeat cmake item

* fix unittest error

e6ec98fe

[Phi]Interploatd kernels into phi (#40855) · d65a7a46

由 chentianyu03 提交于 4月 01, 2022

* add interploate cpu kernel

* fix nullptr bug

* add interpolate gpu kernel

* fix unit test error

* remove raw kernels

* add cuda kernel impl

* add infermeta

* recover accidentally deleted kernels in interpolate op

* fix grad x_grad name error

* remove interpolate_v2_op.h

* rm unused codes

* fix xpu build error

* fix build error

* fix namespace error

* add register header for nup

* fix infermeta error

* modify by review

* add the missing args in test_trt_convert_nearest_interp_v2

d65a7a46

L
[KP] fix bug in activation xpu kp kernel (#41219) · 705776ca
由 Liu-xiandong 提交于 4月 01, 2022
```
* fix bug in activation xpu kp kernel

* delete useless comment
```
705776ca

31 3月, 2022 2 次提交

Z
[Phi] Rename ScalarArray to IntArray (#40975) · e559fe41
由 zyfncg 提交于 3月 31, 2022
```
* rename scalar_array to int_array

* update cmake

* fix conflict

* remove useless log
```
e559fe41

[KP] fix bug in phi kp (#41069) · ac5548a2

由 Liu-xiandong 提交于 3月 31, 2022

* [KP] fix bug in phi kp

* delete useless comment

* update

* update

* choose the xpu kp kernel in phi

ac5548a2

28 3月, 2022 1 次提交
- C
  [Phi]Remove in_dtype, out_dtype in redcue grad (#40906) · 0c024cb9
  由 chentianyu03 提交于 3月 28, 2022
```
* remove in_dtype, out_dtype in redcue grad

* set the dtype and layout in noneedbufferInputs func
```
  0c024cb9
24 3月, 2022 1 次提交

[Phi] Move mul op kernel into phi (#40833) · 1b491818

由 Chen Weihang 提交于 3月 24, 2022

* add mul phi kernel

* remove mul op kernel

* remove original mul grad op

* fix cinn test

* fix dygraph test failed

1b491818

23 3月, 2022 2 次提交

[Eager] Slice (#40587) · b07d239c

由 wanghuancoder 提交于 3月 23, 2022

* fix some slice bug, test=develop

* eager slice, test=develop

* eager slice, test=develop

* refine, test=develop

* refine, test=develop

* fix bug, test=develop

* refine, test=develop

* rename function name, test=develop

b07d239c

Z
Removed redundant use of declarations.h (#40703) · 2a1b4c07
由 Zhanlue Yang 提交于 3月 23, 2022
```
* Removed redundant use of declarations.h

* Fixed minor bug
```
2a1b4c07

21 3月, 2022 1 次提交
- Z
  
  [MLU]add compiler options and remove redundant code (#40705) · a6f77fdf
  由 zn 提交于 3月 21, 2022
  
  a6f77fdf
18 3月, 2022 1 次提交

[Phi]Move hierarchical_sigmoid kernel to phi (#40553) · 64a7cbd3

由 Zhang Zheng 提交于 3月 18, 2022

* first commit

* fix compile error

* support std::vector<std::srting>

* fix

* fix op support on GPU by chenweihang

* pass test

* infershape

* add set_dtype

* fix order

* fix

* unify the impl of dt and sr

* fix

64a7cbd3

17 3月, 2022 2 次提交

[Phi] Move assign kernel into phi (#40022) · 1904572a

由 Chen Weihang 提交于 3月 17, 2022

* move assign kernel init commit

* change vec<tensor> to vec<tensor*>

* support tensor array

* support api declare

* fix test_list failed

* fix npu and xpu failed

* fix infrt failed

* remove assign array size in operator

* move assign sr header into sr dir

* add infermeta for assign

* test op success

* fix test_list failed

* fix kunlun failed

* add set host allocator in tests

* support tensor array in arg ctx

* open set layout in share_meta

* fix meta tensor layout error

* fix test failed

1904572a

Q

[ROCm] fix bfloat16 support, test=develop (#40401) · da558f0e
由 Qi Li 提交于 3月 17, 2022

da558f0e

16 3月, 2022 3 次提交
- Z
  [Phi] Move roi_align grad kernel and infershape from fuild to phi (#40556) · 3898080e
  由 zyfncg 提交于 3月 16, 2022
```
* move roi_align_grad kernel

* move roi_align grad kernel and infershape to phi

* remove roi_align infershape
```
  3898080e
- L
  [KP]fix bug that cannot fallback to CPU normally in XPU KP (#40576) · 603f8425
  由 Liu-xiandong 提交于 3月 16, 2022
```
* [kp]fix bug that cannot fallback to CPU normally in XPU KP

* fix bug in static graph
```
  603f8425
- Q
  
  [MLU] support amp O1 of mlu (#40461) · ad81f22c
  由 qipengh 提交于 3月 16, 2022
  
  ad81f22c
15 3月, 2022 4 次提交

X
run python api in eager model and filter the out in argument list (#40523) · 4d886f75
由 xiongkun 提交于 3月 15, 2022
```
* run python api in eager model and filter the out in argument list

* fix code
```
4d886f75
F
[NPU] add AMP O1 support (#40362) · 69dd43d1
由 furnace 提交于 3月 15, 2022
```
* [NPU] add AMP O1 support

* [NPU] fix NOTE and warnings
```
69dd43d1

Added more profile signposts to dygraph (#40201) · 36db75b4

由 Zhanlue Yang 提交于 3月 15, 2022

* Added more signposts to dygraph profiling

* Fixed minor issues

* Refactored signpost names

* Fixed typo

* Removed debug codes

* Fixed typo

* Adjusted signpost names

* Fixed issues from branch merge

36db75b4

Move one hot to phi (#39876) · 7701db37

由 hong 提交于 3月 15, 2022

* move one hot to phi; test=develop

* fix bugs; test=develop

* fix bugs; test=develop

* add infer meta; test=develop

* fix bugs; test=develop

* resolve confilct

* resolve confilct

* fix bug;

* fix error; test=develop

* update; test=develop

* polish code; test=develop

* add one api in eager mode; test=develop

* add one hot test; test=develop

* remove use less code; test=develop

* fix bug; test=develop

* polish code; test=develop

* polish code; test=develop

7701db37

14 3月, 2022 1 次提交
- F
  Move Pool OPs to phi (#40208) · 88ec08a7
  由 From00 提交于 3月 14, 2022
```
* Move Pool OPs to phi

* Fix CI error

* Fix conflicts
```
  88ec08a7
12 3月, 2022 1 次提交
- C
  Fix eager benchmark test failed (#40468) · 70f83f1d
  由 Chen Weihang 提交于 3月 12, 2022
```
* fix eager benchmark test failed

* fix test_tracer failed
```
  70f83f1d
11 3月, 2022 2 次提交

[Phi] Remove needless deps in unittests (#40256) · 89ed57e2

由 Chen Weihang 提交于 3月 11, 2022

* remove needless deps in unittests

* add gpu marco

* fix other unittests

* fix kernel name error

* fix test_prepare_op

* fix failed dygraph unittests

* fix gpu failed tests

* fix cinn test failed

* fix cinn test failed

* fix dropout tests

89ed57e2

[Phi] Reduce grad (#40263) · f452ad5c

由 chentianyu03 提交于 3月 11, 2022

* add reduce_sum grad kernel

* add reduce_grad

* modify reduce grad

* update reduce grad functions

* fix build error

* add argument mapping

* move cast input after grad

* add dims.size=1 cpu reduce_sum grad compute method

* update reduce grad GPU

* remove raw reduce_sum_grad kernel

* modify header files

* add namespace funcs for reduce_grad_funcstions

f452ad5c

10 3月, 2022 1 次提交
- L
  
  solve unexecuted UT (#40397) · bd4dc3be
  由 Lijunhui 提交于 3月 10, 2022
  
  bd4dc3be
09 3月, 2022 1 次提交
- Z
  [PHI] Move set_value kernel to phi (#40195) · cd28cddb
  由 zyfncg 提交于 3月 09, 2022
```
* save code

* fix bug of set_value

* add coverage test
```
  cd28cddb
08 3月, 2022 1 次提交

[Phi]Move Relu/Cos/Sin/Tan/Acos/Asin/Atan/Sinh/Cosh/Asinh/Acosh/Atanh kernels... · 975f99ab

由 YuanRisheng 提交于 3月 08, 2022

[Phi]Move Relu/Cos/Sin/Tan/Acos/Asin/Atan/Sinh/Cosh/Asinh/Acosh/Atanh kernels in Activation to Phi (#40175)

* move activation op

* adjust code format

* fix compile bugs

* fix ci bugs

* code format adjust

* code format adjust2

* activate ci status

* modify according to comment

975f99ab

07 3月, 2022 2 次提交
- X
  [OpTest] Support to test paddle API end-to-end for check_eager (#40169) · 79a32715
  由 xiongkun 提交于 3月 07, 2022
```
* add python api test in TestOp

* test_python_api if self.python_api is set

* fix code by CR
```
  79a32715
- Z
  [MLU]support reduce tensors on mlu (#40000) · b4eb413e
  由 zn 提交于 3月 07, 2022
```
* [MLU]support reduce tensors on mlu

* [MLU]fix compiler options
```
  b4eb413e
03 3月, 2022 2 次提交
- R
  
  [CustomRuntime] migrate CustomRuntime into phi (#39908) · b4665d23
  由 ronnywang 提交于 3月 03, 2022
  
  b4665d23
- C
  
  fix output var may be nullptr and cause segment fault bug (#40079) · 2ffa6436
  由 chentianyu03 提交于 3月 03, 2022
  
  2ffa6436
02 3月, 2022 3 次提交
- L
  add check for backward hook (#40041) · 1980e33a
  由 Leo Chen 提交于 3月 02, 2022
```
* add check for backward hook

* refine ut
```
  1980e33a
- Q
  [MLU] adapt matmul op (#39727) · b4d931e8
  由 qipengh 提交于 3月 02, 2022
```
* [MLU] adapt matmul op

* [MLU] fix phi namespace
```
  b4d931e8
- F
  
  Fix bug for prepare phi OP (#40033) · fb0cadfd
  由 From00 提交于 3月 02, 2022
  
  fb0cadfd
01 3月, 2022 1 次提交

[PHI] Remove reseting dtype, layout and allocation by arg_def for outputs in executor (#39781) · 4fbcf6f4

由 zyfncg 提交于 3月 01, 2022

* remove SetAllocationForOutputTenosr

* add place param for copy kernel

* recover SetAllocationForOutputTenosr

* polish code

* fix empty_dev api bug

* remove reseting dtype and layout for output in executor

* fix merge bug

* [Phi] Add ClearHolder when re-alloc on new place in DeviceContext

* fix hostAlloc

* remove setting output allocation

* remove full_kernel_impl.h

* fix bug of xpu full_like
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

4fbcf6f4

28 2月, 2022 2 次提交

Trace level env (#39926) · f335d9e1

由 liutiexing 提交于 2月 28, 2022

* add align for WorkQueue

* add spinlock

* merge develop

* merge

* Add EventsWaiter

* Revert "Add EventsWaiter"

This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.

* Add host_trace_level env variable

* Revert "Optimize perf of softmax_with_cross_entropy (#39553)"

This reverts commit bbe5228c.
Co-authored-by: Nliutiexing <liutiexing@google.com>
Co-authored-by: NZzSean <18818272991@163.com>

f335d9e1

[Pten->Phi PR4] Rename pten in funcs to phi (#39961) · eb42dd52

由 Chen Weihang 提交于 2月 28, 2022

* rename pten_utils to phi_utils

* rename pten_utils target

* rename Pten to Phi

* replace pten with phi

* resolve conflict

eb42dd52

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致