提交 · 0280c0b9089e2ea2f1f26fd4582e96e29e93a4ac · BaiXuePrincess / Paddle

13 10月, 2022 1 次提交

[cherry-pick] [PHI] transpose2_grad op migration (#46139) (#46873) · 0280c0b9

由 Sławomir Siwek 提交于 10月 13, 2022

* Revert pool+grad oneDNN kernel conversion (#45989)

* [PHI] transpose2_grad op migration (#46139)

* op migrated, Copy(OneDNNContext, ...) added

* mutable_data & op registration in fluid removed

* refactoring

* OneDNNGetDataType to uppercase

* missing cpu check added, handler moved to .h file

* name changed to transpose_grad

* Copy changed back to TensorCopy

* Resizing corrected, Copy(OneDNNContext) removed
Co-authored-by: NPiotr Paturej <48731682+piotrekobi@users.noreply.github.com>
Co-authored-by: NPaulina Gacek <paulina.gacek@intel.com>

0280c0b9

11 10月, 2022 1 次提交
- S
  Revert pool+grad oneDNN kernel conversion (#45989) (#46860) · 7b3837e6
  由 Sławomir Siwek 提交于 10月 11, 2022
```
Co-authored-by: NPiotr Paturej <48731682+piotrekobi@users.noreply.github.com>
```
  7b3837e6
10 10月, 2022 3 次提交

[cherry-pick] [PHI] Migrate sgd and stack oneDNN kernels (#46374) (#46729) · 25d61cd1

由 Sławomir Siwek 提交于 10月 10, 2022

* [PHI] Migrate sgd and stack oneDNN kernels (#46374)

* Convert slice+grad oneDNN fluid kernels to PHI

* Change mutable_data to Alloc

* Refactor licences

* update dependencies
Co-authored-by: NPiotr Paturej <48731682+piotrekobi@users.noreply.github.com>

25d61cd1

[PHI] Migrate slice, slice_grad, split, pad and pad3d oneDNN kernels (#46101) (#46726) · 51a91fee

由 Sławomir Siwek 提交于 10月 10, 2022

* Convert split, pad and pad3d kernels

* Convert slice+grad oneDNN fluid kernels to PHI

* change out->mutable_data to dev_ctx.Alloc
Co-authored-by: NPiotr Paturej <48731682+piotrekobi@users.noreply.github.com>

51a91fee

S
[PHI] migrate softmax_grad kernel (#46257) (#46725) · 44ecae6c
由 Sławomir Siwek 提交于 10月 10, 2022
```
* init

* remove softmaxop

* merge dev

* correct dir

* style
```
44ecae6c

26 9月, 2022 1 次提交
- H
  [cherrypick] Fix elementwise_sub sign reverse for mkldnn (#46107) · 6990edfe
  由 Hui Zhang 提交于 9月 26, 2022
```
* fix sub sign reverse for mkldnn

* refactor code as comment

* remove useless
```
  6990edfe
19 9月, 2022 2 次提交
- M
  Add INT8 support for fused_multi_transformer_op (#45284) (#46169) · db368d5b
  由 minghaoBD 提交于 9月 19, 2022
```
Co-authored-by: NRichardWooSJTU <37864677+RichardWooSJTU@users.noreply.github.com>
```
  db368d5b
- S
  
  fix broadcast kernel (#46158) · 860f6077
  由 sneaxiy 提交于 9月 19, 2022
  
  860f6077
14 9月, 2022 1 次提交
- J
  cherry pick delay tensorrt log (#45958) · 2ca65904
  由 JingZhuangzhuang 提交于 9月 14, 2022
```
* cherry pick delay tensorrt log
* Update trt_plugin.h
```
  2ca65904
08 9月, 2022 1 次提交

[PHI] Migrate cast, clip+grad and pool+grad oneDNN kernels (#45775) · 1a929c31

由 piotrekobi 提交于 9月 08, 2022

* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* Move classes from mkldnn_reuse.h to onednn_reuse.h

* Migrate pool+grad, clip+grad and cast oneDNN kernels to PHI

* Refactor grad kernels into separate files

* Fix CI failures

* Fix Codestyle

* Implement reviewer suggestions

* Add new lines after includes for readability
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>

1a929c31

06 9月, 2022 1 次提交
- W
  
  enable memory optimize when fp16. (#45792) · 1967c6a6
  由 Wilber 提交于 9月 06, 2022
  
  1967c6a6
05 9月, 2022 2 次提交

[PHI] Move oneDNN helper classes to new location (#45626) · 269bd1fe

由 piotrekobi 提交于 9月 05, 2022

* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* Move classes from mkldnn_reuse.h to onednn_reuse.h

* Move more functions from mkldnn_helper.h to onednn_helpper.h

* Change MKLDNN to OneDNN in VLOG message
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>

269bd1fe

S

fix some op int32 exceed range (#45711) · a1dbee23
由 sneaxiy 提交于 9月 05, 2022

a1dbee23

04 9月, 2022 1 次提交

[PHI] Migrate gaussian_random kernel (#45481) · 4e3d222d

由 Sławomir Siwek 提交于 9月 04, 2022

* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* change header path

* change fluid import to phi

4e3d222d

02 9月, 2022 1 次提交
- K
  
  move onednn file from phi/kernels/funcs/onednn to phi/backends/onednn (#45659) · 6813f41e
  由 kangguangli 提交于 9月 02, 2022
  
  6813f41e
01 9月, 2022 1 次提交
- L
  remove circular dependency of device_context and allocator (#45455) · 934171ae
  由 Leo Chen 提交于 9月 01, 2022
```
* refine cmake of framework

* add deps for dense tensor

* fix deps

* remove alloc(ctx)

* add depends on mkldnn
```
  934171ae
24 8月, 2022 1 次提交

【Hackathon No.34】优化 poisson op (#45160) · 3c14b094

由 Rayman 提交于 8月 24, 2022

* 【Hackathon No.34】优化 poisson op

* [poisson] code style fix

* modify code style

* prevent from big number

* modify code style

* modify code style

* modify import

* modify import

* modify code style

3c14b094

23 8月, 2022 1 次提交

[CustomDevice] add profiler apis (#45130) · da51baf2

由 ronnywang 提交于 8月 23, 2022

* [CustomDevice] add profiler apis

* migrate CalculateEstOccupancy into cuda_tracer

* update

* add ut

da51baf2

22 8月, 2022 1 次提交
- R
  
  [CustomDevice] fix custom ccl (#45276) · 307ad60d
  由 ronnywang 提交于 8月 22, 2022
  
  307ad60d
18 8月, 2022 1 次提交

change to async mode for xpu multi-card training in static graph mode, test=kunlun (#45024) · 41bdf41d

由 zhangxiaoci 提交于 8月 18, 2022

* change to async mode for xpu multi-card training in static graph mode

* minor bugfix

* irrelevant. move to another pr

* move change to other pr

* fix stream issue

* fix 'stream not meet with current context' error

* fix branch diverge, test=kunlun

41bdf41d

10 8月, 2022 2 次提交
- Z
  add macro control in enforce_xpu.h, test=kunlun (#45022) · 9e74211f
  由 zhangxiaoci 提交于 8月 10, 2022
```
* add macro control in enforce_xpu.h, test=kunlun

* minor bugfix

* minor bugfix
```
  9e74211f
- L
  [new-exec] set cuda device before run (#44985) · 68b06ba6
  由 Leo Chen 提交于 8月 10, 2022
```
* set cuda device before run

* add header file

* fix compile
```
  68b06ba6
05 8月, 2022 1 次提交
- Q
  
  [DCU] fix hipDeviceAttributeManagedMemory not support on DTK, test=develop (#44816) · 075d7219
  由 Qi Li 提交于 8月 05, 2022
  
  075d7219
01 8月, 2022 2 次提交
- [Sparse] optimize sparse attention (#44743) · 1149a378
  由 zhouweiwei2014 提交于 8月 01, 2022
  
  1149a378
- W
  infer context fix place error. (#44726) · 74e46a93
  由 Wilber 提交于 8月 01, 2022
```
* infer context fix place error.

* update

* update
```
  74e46a93
29 7月, 2022 1 次提交

move CUDAStream to phi (#44529) · da3743fd

由 Leo Chen 提交于 7月 29, 2022

* init

* move CUDAStream to phi

* fix compilation

* merge develop

* add stream_owned_ member

* split cuda_stream.h

* fix cpu compile

* fix constructor

* fix bug

* fix windows compile

* fix inference test_levit

* fix windows tests

da3743fd

26 7月, 2022 2 次提交
- R
  
  [CustomDevice] add blas_axpby api for gradient_accumulator (#44584) · 0d51fcf1
  由 ronnywang 提交于 7月 26, 2022
  
  0d51fcf1
- W
  inference multi stream support handle lazy init. (#44563) · 1892a441
  由 Wilber 提交于 7月 26, 2022
```
* multi stream support handle lazy init.

* support eigen lazy init

* update

* fix ci problem
```
  1892a441
22 7月, 2022 1 次提交
- Y
  
  Add code of occupancy computing on DCU and avoid threadID bug for DCU profiler (#44520) · 8037901b
  由 yuguo 提交于 7月 22, 2022
  
  8037901b
20 7月, 2022 1 次提交
- L
  
  add eigen3 dependency for phi_backends (#44479) · fbfdea51
  由 Leo Chen 提交于 7月 20, 2022
  
  fbfdea51
19 7月, 2022 1 次提交

compile phi/backends into one static library (#44373) · 1047cb17

由 Leo Chen 提交于 7月 19, 2022

* compile into one static library

* fix xpu compile

* fix xpu compile

* fix inference compile

* fix inference compile

* add custom test

* revert one file

1047cb17

18 7月, 2022 2 次提交
- [Sparse] Add sparse matmul kernel(coo*dense->dense) (#44346) · 3f70b1d3
  由 zhouweiwei2014 提交于 7月 18, 2022
  
  3f70b1d3
- R
  
  [CustomDevice] remove unused file (#44358) · fd6dcdfe
  由 ronnywang 提交于 7月 18, 2022
  
  fd6dcdfe
15 7月, 2022 1 次提交
- Z
  support KL2 multi-card training, *test=kunlun (#43889) · 270f25e9
  由 zhangxiaoci 提交于 7月 15, 2022
```
* update xccl lib
    * use separate streams for compute/comm on XPU
    * add broadcast op to xpu2_op_list
```
  270f25e9
14 7月, 2022 2 次提交

[Phi]Improve the mechanism for mkldnn kernel in PHI (#43941) · e9b4d0be

由 YuanRisheng 提交于 7月 14, 2022

* adapt mkldnn kernel in PHI

* fix ci compile bugs

* fix compile bugs

* fix compile bugs

* fix compile bugs

* fix compile bugs

* delete comment

* fix compile bugs in windows-inference

* delete code for converage

* modify code by review

* modify code by review

* add todo

* fix compile bugs

* fix compile bugs

* fix compile bugs

* fix unittest bugsx

e9b4d0be

R
[CustomDevice] add custom ccl 1/2 (#44294) · d88e77a7
由 ronnywang 提交于 7月 14, 2022
```
* [CustomDevice] add custom ccl api

* add ut
```
d88e77a7

13 7月, 2022 1 次提交
- R
  [CustomKernel] capi add eager mode support (#44164) · 033ef5e9
  由 ronnywang 提交于 7月 13, 2022
```
* [CustomKernel] add capi eager mode support

* add ut

* add capi test
```
  033ef5e9
12 7月, 2022 1 次提交
- C
  [PHI] Clean glog header in public header (#44216) · b0c9f24a
  由 Chen Weihang 提交于 7月 12, 2022
```
* clean glog header in public header

* move marco pos
```
  b0c9f24a
06 7月, 2022 1 次提交
- H
  
  minor fix VLOG for xpu. test=kunlun. (#44099) · 502062da
  由 houj04 提交于 7月 06, 2022
  
  502062da
05 7月, 2022 1 次提交
- R
  Dataloader add custom device support (#44013) · a0dc361c
  由 ronnywang 提交于 7月 05, 2022
```
* Dataloader add custom device support

* update test=document_fix
```
  a0dc361c

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致