提交 · 4a4f3f800c271892acb81afbd6b4dca10f3f6af9 · PaddlePaddle / Paddle

04 11月, 2022 1 次提交

[PHI] Migrate pool2d and pool2d_grad kernels (#47423) · ca4bed7b

由 Piotr Paturej 提交于 11月 04, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* fix map at error

* Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* remove useless extra attrs

* replace mkldnn_engine by onednn_engine

* Migrate pool+grad to PHI

* Update paddle/fluid/operators/mkldnn/test_mkldnn_op_nhwc.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* Update paddle/phi/kernels/onednn/pool_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* Update paddle/phi/kernels/onednn/pool_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: NChen Weihang <chenwhpro@163.com>
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

ca4bed7b

03 11月, 2022 2 次提交

sparse attention kernel is used from 11.8 (#47594) · 7648f429
由 zhouweiwei2014 提交于 11月 03, 2022

7648f429

[PHI] Migrate softmax kernel (#47339) · b8ae3858

由 Sławomir Siwek 提交于 11月 03, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* remove redundant imports

* migrate softmax

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* merge dev

* fix map at error

* adjust attribute

* adapt funcs to PHI
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>

b8ae3858

02 11月, 2022 1 次提交

Add build option for CUDNN Frontend API (#47524) · eb100c7b

由 Tian Zheng 提交于 11月 02, 2022

* Add build option for CUDNN Frontend API

* Fix review comments

* Change namespace for cudnn_frontend.h

eb100c7b

01 11月, 2022 1 次提交

Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9

由 Chen Weihang 提交于 10月 31, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* fix map at error

* Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* remove useless extra attrs

* replace mkldnn_engine by onednn_engine
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

c923e6c9

31 10月, 2022 1 次提交
- R
  [CustomDevice] GetCCLComm add custom device support (#47168) · 34d13d6a
  由 ronnywang 提交于 10月 31, 2022
```
* [CustomDevice] GetCCLComm add custom device support

* update

* update

* update
```
  34d13d6a
27 10月, 2022 1 次提交

Update of PHI transpose_grad (#47311) · 493fbfd7

由 Jacek Czaja 提交于 10月 27, 2022

* - halfway transforming transpose grad

- Fixes

- buildable

* - lint

* rerunning the process

493fbfd7

25 10月, 2022 1 次提交
- J
  Added workaround for elementwise oneDNN kernel (#47080) · 0abf7560
  由 jakpiase 提交于 10月 25, 2022
```
* return proper state

* fix for dims

* fix
```
  0abf7560
19 10月, 2022 1 次提交
- Y
  
  add nvtxRangePush/Pop for naive_executor and refine some code (#47139) · de6e7431
  由 Yuanle Liu 提交于 10月 19, 2022
  
  de6e7431
17 10月, 2022 2 次提交
- R
  
  Fix warning message format error (#47045) · 13284437
  由 RedContritio 提交于 10月 17, 2022
  
  13284437
- D
  [Custom Device] Add singleton to custom device (#46963) · 73196e5a
  由 duanyanhui 提交于 10月 17, 2022
```
* add singleton to custom device

* Update custom_device.cc

Init device_init_flag_ in default
```
  73196e5a
11 10月, 2022 1 次提交
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
10 10月, 2022 1 次提交

[PHI] transpose2_grad op migration (#46139) · e3407a80

由 Paulina Gacek 提交于 10月 10, 2022

* op migrated, Copy(OneDNNContext, ...) added

* mutable_data & op registration in fluid removed

* refactoring

* OneDNNGetDataType to uppercase

* missing cpu check added, handler moved to .h file

* name changed to transpose_grad

* Copy changed back to TensorCopy

* Resizing corrected, Copy(OneDNNContext) removed

e3407a80

28 9月, 2022 1 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

22 9月, 2022 2 次提交
- P
  [PHI] Migrate sgd and stack oneDNN kernels (#46374) · 4ae37aee
  由 Piotr Paturej 提交于 9月 22, 2022
```
* Convert slice+grad oneDNN fluid kernels to PHI

* Change mutable_data to Alloc

* Refactor licences
```
  4ae37aee
- H
  [mkldnn] Fix elementwise_sub sign reverse for mkldnn (#46049) · ab97b760
  由 Hui Zhang 提交于 9月 22, 2022
```
* fix sub sign reverse for mkldnn

* refactor code as comment

* remove useless

* format code
```
  ab97b760
21 9月, 2022 1 次提交
- P
  
  Revert pool+grad oneDNN kernel conversion (#45989) · dc31d2aa
  由 Piotr Paturej 提交于 9月 21, 2022
  
  dc31d2aa
20 9月, 2022 2 次提交
- S
  [PHI] migrate softmax_grad kernel (#46257) · 4dad95cc
  由 Sławomir Siwek 提交于 9月 20, 2022
```
* init

* remove softmaxop

* merge dev

* correct dir

* style
```
  4dad95cc
- P
  [PHI] Migrate slice, slice_grad, split, pad and pad3d oneDNN kernels (#46101) · b232b5e9
  由 Piotr Paturej 提交于 9月 20, 2022
```
* Convert split, pad and pad3d kernels

* Convert slice+grad oneDNN fluid kernels to PHI

* change out->mutable_data to dev_ctx.Alloc
```
  b232b5e9
18 9月, 2022 1 次提交
- R
  
  Add INT8 support for fused_multi_transformer_op (#45284) · 3d7e2118
  由 RichardWooSJTU 提交于 9月 18, 2022
  
  3d7e2118
16 9月, 2022 2 次提交

Support broadcast elementwise operators with int64 index type (#45741) · 20b5bf84

由 sneaxiy 提交于 9月 16, 2022

* support int64 non-broadcast

* support broadcast case for int64 index

* fix bug

* support more Arity

* remove some codes

* upgrade patchelf to v0.15.0 to pass CI build

* fix bug

* fix patchelf installation

* add debug flags

* remove useless codes

* fix viterbi_decode and set_value op uts

* remove always enable int64

20b5bf84

[CustomDevice] add new executor support (#46038) · 268f097e

由 ronnywang 提交于 9月 16, 2022

* [CustomDevice] add custom_device_resource_pool & device_event_custom_device

* update

* update

* update

* update

268f097e

14 9月, 2022 1 次提交

delay tensorrt registry (#45824) · d7d35ff8

由 JingZhuangzhuang 提交于 9月 14, 2022

* Delay TensorRT registry
* Add unused define
* Fix TensorRT test
* fix function to reference
* Update trt_plugin.h

d7d35ff8

08 9月, 2022 1 次提交

[PHI] Migrate cast, clip+grad and pool+grad oneDNN kernels (#45775) · 1a929c31

由 piotrekobi 提交于 9月 08, 2022

* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* Move classes from mkldnn_reuse.h to onednn_reuse.h

* Migrate pool+grad, clip+grad and cast oneDNN kernels to PHI

* Refactor grad kernels into separate files

* Fix CI failures

* Fix Codestyle

* Implement reviewer suggestions

* Add new lines after includes for readability
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>

1a929c31

06 9月, 2022 1 次提交
- W
  
  enable memory optimize when fp16. (#45792) · 1967c6a6
  由 Wilber 提交于 9月 06, 2022
  
  1967c6a6
05 9月, 2022 2 次提交

[PHI] Move oneDNN helper classes to new location (#45626) · 269bd1fe

由 piotrekobi 提交于 9月 05, 2022

* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* Move classes from mkldnn_reuse.h to onednn_reuse.h

* Move more functions from mkldnn_helper.h to onednn_helpper.h

* Change MKLDNN to OneDNN in VLOG message
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>

269bd1fe

S

fix some op int32 exceed range (#45711) · a1dbee23
由 sneaxiy 提交于 9月 05, 2022

a1dbee23

04 9月, 2022 1 次提交

[PHI] Migrate gaussian_random kernel (#45481) · 4e3d222d

由 Sławomir Siwek 提交于 9月 04, 2022

* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* change header path

* change fluid import to phi

4e3d222d

02 9月, 2022 1 次提交
- K
  
  move onednn file from phi/kernels/funcs/onednn to phi/backends/onednn (#45659) · 6813f41e
  由 kangguangli 提交于 9月 02, 2022
  
  6813f41e
01 9月, 2022 1 次提交
- L
  remove circular dependency of device_context and allocator (#45455) · 934171ae
  由 Leo Chen 提交于 9月 01, 2022
```
* refine cmake of framework

* add deps for dense tensor

* fix deps

* remove alloc(ctx)

* add depends on mkldnn
```
  934171ae
24 8月, 2022 1 次提交

【Hackathon No.34】优化 poisson op (#45160) · 3c14b094

由 Rayman 提交于 8月 24, 2022

* 【Hackathon No.34】优化 poisson op

* [poisson] code style fix

* modify code style

* prevent from big number

* modify code style

* modify code style

* modify import

* modify import

* modify code style

3c14b094

23 8月, 2022 1 次提交

[CustomDevice] add profiler apis (#45130) · da51baf2

由 ronnywang 提交于 8月 23, 2022

* [CustomDevice] add profiler apis

* migrate CalculateEstOccupancy into cuda_tracer

* update

* add ut

da51baf2

22 8月, 2022 1 次提交
- R
  
  [CustomDevice] fix custom ccl (#45276) · 307ad60d
  由 ronnywang 提交于 8月 22, 2022
  
  307ad60d
18 8月, 2022 1 次提交

change to async mode for xpu multi-card training in static graph mode, test=kunlun (#45024) · 41bdf41d

由 zhangxiaoci 提交于 8月 18, 2022

* change to async mode for xpu multi-card training in static graph mode

* minor bugfix

* irrelevant. move to another pr

* move change to other pr

* fix stream issue

* fix 'stream not meet with current context' error

* fix branch diverge, test=kunlun

41bdf41d

10 8月, 2022 2 次提交
- Z
  add macro control in enforce_xpu.h, test=kunlun (#45022) · 9e74211f
  由 zhangxiaoci 提交于 8月 10, 2022
```
* add macro control in enforce_xpu.h, test=kunlun

* minor bugfix

* minor bugfix
```
  9e74211f
- L
  [new-exec] set cuda device before run (#44985) · 68b06ba6
  由 Leo Chen 提交于 8月 10, 2022
```
* set cuda device before run

* add header file

* fix compile
```
  68b06ba6
05 8月, 2022 1 次提交
- Q
  
  [DCU] fix hipDeviceAttributeManagedMemory not support on DTK, test=develop (#44816) · 075d7219
  由 Qi Li 提交于 8月 05, 2022
  
  075d7219
01 8月, 2022 2 次提交
- [Sparse] optimize sparse attention (#44743) · 1149a378
  由 zhouweiwei2014 提交于 8月 01, 2022
  
  1149a378
- W
  infer context fix place error. (#44726) · 74e46a93
  由 Wilber 提交于 8月 01, 2022
```
* infer context fix place error.

* update

* update
```
  74e46a93
29 7月, 2022 1 次提交

move CUDAStream to phi (#44529) · da3743fd

由 Leo Chen 提交于 7月 29, 2022

* init

* move CUDAStream to phi

* fix compilation

* merge develop

* add stream_owned_ member

* split cuda_stream.h

* fix cpu compile

* fix constructor

* fix bug

* fix windows compile

* fix inference test_levit

* fix windows tests

da3743fd

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功