提交 · 581d9b1b6230d08e2fecb62295b3c012924b57c3 · PaddlePaddle / Paddle

25 3月, 2023 1 次提交
- Z
  
  fix operator need_prepare_phi_data_ · 581d9b1b
  由 zhangjun 提交于 3月 25, 2023
  
  581d9b1b
10 3月, 2023 1 次提交

[New features]Add function node in phi_kernel for MKLDNN (#51073) · a0a6dc6a

由 HappyHeavyRain 提交于 3月 10, 2023

* Add function node in phi_kernel for MKLDNN

* fix the bug in 'BuildInferVarKernelContext'

* add infer_varkernel_utils.cc

* fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable

* change the code according to first review

* change the code according to first review

* change the mode of paddle_build.sh

* change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_'

* add the error information

* fix NotFound infomation warning

* fix NotFound infomation warning

* fix NotFound infomation warning

a0a6dc6a

09 3月, 2023 1 次提交

[PHI] Register custom kernel for all type of custom device (#51262) · 782454bd

由 zyfncg 提交于 3月 09, 2023

* register custom kernel for all type of custom device

* fix bug

* fix GetKernelInputArgDef

* fix amp bug

* fix TransToPhiPlace

* adapt interpreter_util

782454bd

06 3月, 2023 1 次提交
- R
  Remove InterpretercoreInferShapeContext (#51209) · 5c1eda19
  由 Ruibiao Chen 提交于 3月 06, 2023
```
* Remove InterpretercoreInferShapeContext

* Fix lod errors
```
  5c1eda19
27 2月, 2023 1 次提交
- C
  
  revert operator.cc (#50895) · ec814cf5
  由 csy0225 提交于 2月 27, 2023
  
  ec814cf5
24 2月, 2023 1 次提交
- N
  
  Fix KP operator Kernel selection error (#50178) · 6ef3f2ce
  由 niuliling123 提交于 2月 24, 2023
  
  6ef3f2ce
22 2月, 2023 1 次提交

Fix some typos. (#50429) · 93b2bf4b

由 Shuangchi He 提交于 2月 22, 2023

* Fix some typos.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* pre-commit
Signed-off-by: Yulv-git <yulvchi@qq.com>

---------
Signed-off-by: Yulv-git <yulvchi@qq.com>

93b2bf4b

21 2月, 2023 2 次提交

D
[Custom Device] Add static custom back_list (#50666) · d79d5933
由 duanyanhui 提交于 2月 21, 2023
```
* add static custom back_list

* rm comments

* fix log

* fix comment
```
d79d5933

Optimize the ernie inference performance on xpu backend. (#50357) · b39afb13

由 csy0225 提交于 2月 21, 2023

* Optimize the ernie inference performance on xpu

* fix enable runtime cache logic

* when op's input shape has changed, should create a new runtime context

* fix

* set flag when input shape has changed

b39afb13

16 2月, 2023 2 次提交

Z

[XPU] fix dropout pass; add multi_encoder_xpu_fuse_pass & multi_encoder_xpu kernel (#50499) · c8aa6405
由 zhupengyang 提交于 2月 16, 2023

c8aa6405

[phi decoupling] remove variable.h in phi (#50407) · 905cefd4

由 Huang Jiyi 提交于 2月 16, 2023

* move variable_utils from phi_api_utils to fluid

* fix coment

* update include

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* update

* update

* fix CI-Windows-OpenBLAS

* fix bugs

* fix bugs

* fix bugs

* update include

* move variable_utils to phi_utils

* fix namespace

905cefd4

08 2月, 2023 1 次提交

[PHI]Unify Fluid and PHI kernel (#49328) · e92e3aab

由 YuanRisheng 提交于 2月 08, 2023

* unify_kernel

* fix compile bugs

* modify macro name

* perfect code according comment

* fix compile bugs

* fix compile bugs

* fix ci bugs

* fix ci bug

* fix ci bugs

* fix ci bugs

* modify code according comment

* rm conv_fusion_op

e92e3aab

17 1月, 2023 1 次提交

[PHI]Change feed_op to phi kernel (#49116) · f7f1dc03

由 YuanRisheng 提交于 1月 17, 2023

* change feed_op to phi kernel

* fix ci bugs

* fix build bugs

* fix ci bugs

* fix compile bugs

* fix ci bugs

* perfect code

* perfect comment code

* fix install bugs

* modify code according comment

* remove visitor in feed_op

* modify according comment

* perfect code according comment

* add infershape

* fix py3 bugs

* fix getexpected kernel type

* fix getexpected kernel type

* fix ci bugs

* add registry for custom device

* fix py3 bugs

* fix floating point error

* fix py3 test bugs

f7f1dc03

04 1月, 2023 1 次提交

[Unify KernelKey] change OpKernelType->KernelKey (#49138) · 4383494f

由 HongyuJia 提交于 1月 04, 2023

* execute use kernel_key first

* change OpKernelType->KernelKey

* fix py3 compile error, remove redundant header files

* fix build_strategy_test

* fix DataType::RAW

* fix custom_type test: operator_test.cc

* fix transform place

* fix backends_are_same_class

* try fix place TransDataDevice

* support all KernelKey

* fix TransformData

* fix place_are_same_class

* fix merge

* fix test_params_no_grad

* fix specific place of GetExpectedKernelType

* fix specific place of GetExpectedKernelType

* fix GetKernelTypeForVar

* fix dtype error

* fix fetch_v2

* change GetKernelTypeForVar

* fix interpreter

* fix typo error

* polish codes

* polish codes

* polish codes

* fix conflict

4383494f

03 1月, 2023 1 次提交
- A
  [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op (#49472) · 5ac96468
  由 Aurelius84 提交于 1月 03, 2023
```
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op

* add GetExpectedKernelType
```
  5ac96468
30 12月, 2022 3 次提交

H

fix possible bug (#49367) · 18f0ab86
由 HongyuJia 提交于 12月 30, 2022

18f0ab86

在文档中统一静态图模式与动态图模式的英文翻译 (#49170) · a186e60d

由 Sanbu 提交于 12月 30, 2022

* 1219

* temporarily change the num_diff_files limit, test=document_fix

* Revert "temporarily change the num_diff_files limit, test=document_fix"

This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20.

* for codestyle

* remove duplicate license

* `static mode` -> `static graph mode`

* Update hybrid_parallel_inference.py

* Update layer_function_generator.py

* Update manipulation.py

* reset
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

a186e60d

W
Fix default GetExpectedKernelType for ops supported tensor attrs (#49414) · 8a859554
由 WangZhen 提交于 12月 30, 2022
```
* Fix default GetExpectedKernelType for ops supported tensor attrs
```
8a859554

28 12月, 2022 1 次提交

[new-exec] Ahead-Of-Time choosing kernel (#48789) · 63d2d722

由 Leo Chen 提交于 12月 28, 2022

* add skip run

* alloc minimum memory

* skip check_size in Alloc

* skip check_size in Alloc

* skip check_size in Alloc

* fix cases when tensor is initialized or empty

* alloc empty output for place info

* add test

* increase timeout

* format code

* skip cpu

* add cudnn_deterministic

* fit for hostAlloc

* follow comments

* change check_size to fake_alloc

63d2d722

19 12月, 2022 1 次提交
- H
  
  simplify FallbackToCpu (#49124) · 7ffde4bc
  由 HongyuJia 提交于 12月 19, 2022
  
  7ffde4bc
12 12月, 2022 1 次提交

[PHI]Add new Tensor type and migrate save_combine kernel (#47856) · ecf892f0

由 YuanRisheng 提交于 12月 12, 2022

* add new tensor

* fix windows compile bugs

* fix ci bugs

* fix ci bugs

* fix ci bugs

* perfect according comment

* fix ci compile bugs

* add raw tensor

* fix ci bugs

* modify code by comment

* delete String

ecf892f0

09 12月, 2022 1 次提交
- L
  move share_buffer kernel to phi (#48858) · c2e77ba3
  由 Leo Chen 提交于 12月 09, 2022
```
* move share_buffer kernel to phi

* fix ut

* add source file

* fix window links
```
  c2e77ba3
08 12月, 2022 1 次提交
- W
  
  [Inference] Enable infer shape cache. (#48312) · f88713e1
  由 Wilber 提交于 12月 08, 2022
  
  f88713e1
06 12月, 2022 1 次提交
- Q
  add xpu_support op function (#48606) · 06b32b38
  由 QingshuChen 提交于 12月 06, 2022
```
*test=kunlun
```
  06b32b38
05 12月, 2022 1 次提交
- Y
  
  fix onednn bugs (#48714) · 35ebf2b4
  由 YuanRisheng 提交于 12月 05, 2022
  
  35ebf2b4
01 12月, 2022 1 次提交
- H
  [Fix Type] Fix typo error (#48391) · 47e7b7a5
  由 HongyuJia 提交于 12月 01, 2022
```
* fix typo error

* pass CI-coverage
```
  47e7b7a5
29 11月, 2022 1 次提交
- S
  
  [PHI decoupling] Move MKLDNN code (#48352) · fa051eec
  由 Sławomir Siwek 提交于 11月 29, 2022
  
  fa051eec
28 11月, 2022 1 次提交
- Y
  [BugFix]Fix OneDNN Kernels Bug when use pass (#48364) · df82fd35
  由 YuanRisheng 提交于 11月 28, 2022
```
* Fix onednn kernel bugs

* fix gpu bugs
```
  df82fd35
26 11月, 2022 1 次提交
- L
  fix jit input var not ready error (#48351) · ab6a3dad
  由 Leo Chen 提交于 11月 26, 2022
```
* hot fix

* fix compile

* merge develop

* follow comments
```
  ab6a3dad
25 11月, 2022 1 次提交

[PROFILER] add flops for Profiler (#47766) · 3d1981ad

由 Chitsing KUI 提交于 11月 25, 2022

* attr ready

* op ip ready

* start dynamic

* end2end ok

* input shape to map, stat by op

* layer wip

* first version ready

* fix proto depds

* fix profiler deps

* fix flops typo, rm tuple shape

3d1981ad

17 11月, 2022 1 次提交
- Z
  Clip intermediate output of op when save inference model (#48026) · fafc7be2
  由 zyfncg 提交于 11月 17, 2022
```
* clip extra and intermediate output of op

* fix bug

* fix bug

* polich code

* polich log
```
  fafc7be2
15 11月, 2022 1 次提交

mkldnn directory cleanup (#47779) · 8a339d24

由 Sławomir Siwek 提交于 11月 15, 2022

* cleanup unused code

* unify is_int8 is_bfloat16

* Simplify matmul_v2 FWD kernel

* remove RunKernel methods

* remove import namespace

* remove headers

* clean fluid/phi cross imports

* remove fluid axpy_handler

* delete fluid methods

* activations

* OneDNNMemDesc

* MKLDNNFormatForSize

* MatchShapeToLayout

* MKLDNNMemoryFormat

* MKLDNNFormat

* ReorderMKLDNNHandler

* to_void_cast

* review suggestions

* interpolate

* remove fluid depedency

8a339d24

11 11月, 2022 1 次提交

Refine shape op lanch method for standalone executor (#47843) · 981d1a10

由 zhangbo9674 提交于 11月 11, 2022

* refine shape op in new_exe

* Revert "refine shape op in new_exe"

This reverts commit 0e0336ddc5eede3da019b348a0bcc0ef0f3be64e.

* refine shape op in new_exe

* refine shape expected_kernel_type

* add SelectedRows check for shape op

* refine code

981d1a10

07 11月, 2022 1 次提交

[Restore PR] Remove hard code of PADDLE_WITH_CUDA (#47630) · 908a381d

由 HongyuJia 提交于 11月 07, 2022

* move cudnn hardcode outside GetExpectedKernelType

* add header file

* debug

* update interpreter_util with hardcode

* update interpreter_util headerfile

* solve activation hardcode

* debug with CI

* add mkldnn_op_list header file

* temporarily uncomment mkldnn

* temporarily uncomment mkldnn

* delete sequence_softmax cudnn hardcode

* add hardcode to data_transfer.cc

* update data_transfer headerfile

* try fix segment fault

* update cudnn&miopen_helper

* reset HasAttr of DygraphExctnCtx

* debug, this commit should pass all CI

* debug should pass CI, temporarily disable activation

* debug should pass CI

* fix default_attr=nullptr bug

* clean debug code

* Call SetDnnFallback function in the base class

* activation fallback to plain kernel

* fix default GetExpectedKernelType find wrong kernel

* search cudnn kernel instead of fallback

* fix cudnn_handle bug

* remove tanh use_cudnn

* restore tanh use_cudnn

* debug tanh

* fix tanh bug

* delete activation cudnn kernel

* polish code

908a381d

03 11月, 2022 1 次提交

[Opt Kernel Selection] Opt CanMKLDNNBeUsed performance (#47563) · 9adad42d

由 HongyuJia 提交于 11月 03, 2022

* opt CanMKLDNNBeUsed performance

* fix nullptr bug

* fix OpBase default_attrs=nullptr bug

* fix OpBase default_attrs=nullptr bug

* fix OpBase default_attrs=nullptr bug

9adad42d

02 11月, 2022 1 次提交
- H
  Revert "[Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325)" (#47582) · a57a19ea
  由 HongyuJia 提交于 11月 02, 2022
```
This reverts commit f9134045.
```
  a57a19ea
01 11月, 2022 2 次提交

[Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325) · f9134045

由 HongyuJia 提交于 11月 01, 2022

* move cudnn hardcode outside GetExpectedKernelType

* add header file

* debug

* update interpreter_util with hardcode

* update interpreter_util headerfile

* solve activation hardcode

* debug with CI

* add mkldnn_op_list header file

* temporarily uncomment mkldnn

* temporarily uncomment mkldnn

* delete sequence_softmax cudnn hardcode

* add hardcode to data_transfer.cc

* update data_transfer headerfile

* try fix segment fault

* update cudnn&miopen_helper

* reset HasAttr of DygraphExctnCtx

* debug, this commit should pass all CI

* debug should pass CI, temporarily disable activation

* debug should pass CI

* fix default_attr=nullptr bug

* clean debug code

f9134045

Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9

由 Chen Weihang 提交于 10月 31, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* fix map at error

* Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* remove useless extra attrs

* replace mkldnn_engine by onednn_engine
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

c923e6c9

26 10月, 2022 1 次提交
- C
  Remove the declaration of using LoDTensor in framework/lod_tensor.h (Part2) (#46953) · 1cb12ff5
  由 Chen Weihang 提交于 10月 25, 2022
```
* remove using lodtensor part2

* resolve code format error

* resolve conflict

* resolve conflict

* replace added frameworrk tensor
```
  1cb12ff5
25 10月, 2022 1 次提交

[Kernel Selection] Remove hard code of PADDLE_WITH_MKLDNN (Part2 add dnn_fallback flag) (#47200) · 6f5e7826

由 HongyuJia 提交于 10月 25, 2022

* use dnn_fallback flag to delete mkldnn hardcode

* polish code style

* fix protected error

* fix const error

* fix reduce_op fallback

* fix pool_op fallback

* add Set function of dnn_fallback_

6f5e7826

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功