提交 · bc5bae1681a342037b0bc79edae664efcb958364 · PaddlePaddle / Paddle

30 3月, 2023 1 次提交
- K
  mv paddle/fluid/platform/device/xpu/tests 2 test/xpu/cpp (#52243) · bc5bae16
  由 Kim 提交于 3月 30, 2023
```
* mv paddle/fluid/platform/device/xpu/tests 2 test/xpu/cpp

* add missing cmake
```
  bc5bae16
24 3月, 2023 1 次提交

[PHI Decoupling]Remove memory header (Part3) (#51288) · 3d78e759

由 YuanRisheng 提交于 3月 24, 2023

* decouple memory copy

* fix ci bugs

* fix ci compile bugs

* fix rocm compile

* fix ci bugs

* decouple memory

* deal with conflict

* fix xpu compile bugs

* fix xpu bugs

* deal with xpu bugs

* fix cmake bugs

* fix windows bugs

* fix ci bugs

* fix ci bugs

* delete redundance code

* add code for pybind

* fix py3 bugs

* fix ci bugs

3d78e759

21 3月, 2023 1 次提交

[PHI decoupling] Move DataType* from paddle:experimental to phi namespace (#51716) · 4638a62e

由 iSerendipity 提交于 3月 21, 2023

* move DataType from paddle::experimental to phi

* convert namespace

* convert namespace

* convert namespace

* clarify namespace

* convert more datatype

* Revert "convert more datatype"

This reverts commit 083b462959e6a22d4d8767707b628b95b396642e.

* convert more in auto_code_generator

* fix conflicts for XPU

* fix namespace conflicts

* fix errors

* Revert "fix errors"

This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876.

* fix errors

* fix formatting

4638a62e

16 3月, 2023 1 次提交
- H
  [phi decoupling] remove fluid gpu_info usage in phi (#51699) · 907433a7
  由 Huang Jiyi 提交于 3月 16, 2023
```
* remove fluid thread_data_registry

* update

* fix bug
```
  907433a7
15 3月, 2023 1 次提交
- J
  
  modify cmake rules temporarily (#51644) · 521bba9c
  由 JingZhuangzhuang 提交于 3月 15, 2023
  
  521bba9c
06 3月, 2023 1 次提交

[phi decoupling] decouple dependency to device_context in phi (Part 1) (#50865) · a1006b2b

由 Huang Jiyi 提交于 3月 06, 2023

* move DeviceContextPool to phi

* add EmplaceExternalContextFunc

* update namespace

* update cmake

* fix bugs and create context_pool_impl.h

* replace platform::is_xxx_place

* fix bugs

* update generator

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix enforce usage

* Revert "fix enforce usage"

This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27.

* fix bugs

* rm XPUDeviceContext and CustomDeviceContext

* fix bugs

* fix fix context init bug

* fix bugs after merge

* fix bugs

* fix name

* fix mutable_data

* update and fix bugs

* fix bugs

* update

* fix bugs

* fix name

* fix bugs

* merge

* fix bugs

* create context_pool in phi/backends

* create context_pool in phi/backends

* fix bugs

* fix xpu bugs

* fix rocm bugs

* fix bugs

* fix bugs

* fix bugs

* fix xpu bugs

* update

* update

* fix bugs

* fix bugs

a1006b2b

24 2月, 2023 1 次提交
- N
  
  Fix KP operator Kernel selection error (#50178) · 6ef3f2ce
  由 niuliling123 提交于 2月 24, 2023
  
  6ef3f2ce
21 2月, 2023 1 次提交

[PHI Decoupling]Remove memory header (Part1) (#50419) · 1cfcb71d

由 YuanRisheng 提交于 2月 21, 2023

* decouple_memory

* perfect memory utils

* fix ci bugs

* fix inference bugs

* fix custom test bugs

* fix converage bugs

* modify code according comment

* modify namespace

* deal with compile bugs

1cfcb71d

16 2月, 2023 1 次提交
- H
  [XPU] update xccl to 1.0.8 and xdnn to 20230215 (#50247) · b8008580
  由 houj04 提交于 2月 16, 2023
```
* [XPU] update xccl to 1.0.8

* update xdnn. add uint8 for concat and split.

* update xdnn to 20230215.
```
  b8008580
16 1月, 2023 1 次提交

CUDA12.0 integration (#49539) · 1885d55a

由 zlsh80826 提交于 1月 16, 2023

* Update warpctc for cuda-12

* Deprecate cudaProfilerInitialize for CUDA > 11

* Deprecate CUSPARSE_MV_ALG_DEFAULT for CUDA_VERSION >= 11040

* Add the missing thrust header

1885d55a

20 12月, 2022 1 次提交

[PHI decouple] move dropout_impl and cuda_graph_with_memory_pool from fluid to phi (#49139) · 579784e2

由 huangjiyi 提交于 12月 20, 2022

* move dropout_impl from fluid to phi

* move cuda_graph_with_memory_pool from fluid to phi

* update namespace

* remove cuad_graph in fluid

* fix mac-build

* fix bugs

* correct CodeStyle

* fix mac-build

* fix mutable_data

* fix stl include

* fix copy param

579784e2

09 12月, 2022 1 次提交
- P
  
  [PHI decoupling] move "flags.h" from fluid to phi (#48696) · 39ffef0d
  由 PuQing 提交于 12月 09, 2022
  
  39ffef0d
08 12月, 2022 2 次提交

[PHI decoupling] move cuda_graph from fluid to phi (#48686) · a4d9851b

由 huangjiyi 提交于 12月 08, 2022

* move cuda_graph from fluid to phi

* move device_memory_aligment from fluid to phi

* Revert "move device_memory_aligment from fluid to phi"

This reverts commit b92fcd39a0a50fdac13278f49be0237a85f3a13f.

* update xpu cmake

a4d9851b

Q
rm kunlun xpu2_op_list (#48826) · 83c41459
由 QingshuChen 提交于 12月 08, 2022
```
*test=kunlun
```
83c41459

06 12月, 2022 3 次提交
- Q
  add xpu_support op function (#48606) · 06b32b38
  由 QingshuChen 提交于 12月 06, 2022
```
*test=kunlun
```
  06b32b38
- H
  
  [XPU] add tile_grad op (#48720) · 8de336f9
  由 houj04 提交于 12月 06, 2022
  
  8de336f9
- Y
  add xpu centered rmsprop (#48658) · 54b756e2
  由 ykkk2333 提交于 12月 06, 2022
```
* add stat tool

* add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun

* add xpu rmsprop centered, test=kunlun
```
  54b756e2
05 12月, 2022 1 次提交
- H
  
  move device_memory_aligment from fluid to phi (#48694) · 796499fd
  由 huangjiyi 提交于 12月 05, 2022
  
  796499fd
02 12月, 2022 1 次提交

add silu, silu_grad, unfold and unfold_grad xpu kernels (#48325) · f71de378

由 ykkk2333 提交于 12月 02, 2022

* add stat tool

* add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun

* add silu, unfold and their grads,test=kunlun

f71de378

30 11月, 2022 2 次提交

Z
Fix bug of wrong eigen dependency (#48485) · 35902ec6
由 zyfncg 提交于 11月 30, 2022
```
* fix bug of eigen_dependency

* fix xpu compile
```
35902ec6

use correct xpu stream for synchronization (#48470) · 16562a9d

由 james 提交于 11月 30, 2022

some legacy code still use xpu_wait() for stream sync -- it only syncs
default stream. this PR replaces them with dev_ctx.Wait() to ensure
that correct stream is always used

16562a9d

29 11月, 2022 2 次提交
- A
  [PHI decoupling]migrate enforce_custom.h from fluid to phi (#48422) · 9896ac1e
  由 Asthestarsfalll 提交于 11月 29, 2022
```
* migrate enforce_custom.h from fluid to phi

* move to backends/custom/
```
  9896ac1e
- H
  
  add floor fp32 op *test=kunlun (#48458) · 9d4b4be3
  由 haosicheng 提交于 11月 29, 2022
  
  9d4b4be3
28 11月, 2022 4 次提交

[PHI decoupling] move several header files from fluid to phi (#48415) · fd9c91c3

由 huangjiyi 提交于 11月 28, 2022

* decouple cudnn_desc.h from fluid

* move cudnn_desc.h from fluid to phi

* fix bugs

* decouple cudnn_helper.h from fluid

* fix bugs

* move cudnn_helper.h from fluid to phi

* add fluid cudnn_helper.h

* move miopen_desc.h from fluid to phi

* move miopen_helper.h from fluid to phi

* fix bugs

* move gpu_dnn.h from fluid to phi

* fix bugs

* update copyright year

* simplify gpu_dnn.h in fluid

* fix bugs

* fix xpu build bug

* fix compile bug

* fix bug

fd9c91c3

[Phi decouple] remove dependece to "paddle/fluid/platform/device/xpu/xxx.h" in phi (#48420) · 2bae75ed

由 huangjiyi 提交于 11月 28, 2022

* rm fluid “xpu_header.h” deps in phi

* move part of xpu_op_list.h from fluid to phi

* add fluid xpu_op_list deps

* add glog deps for xpu_op_list in phi

* fix PR-CI-Kunlun

2bae75ed

H

add square fp16 *test=kunlun (#48095) · 81d0a3cc
由 haosicheng 提交于 11月 28, 2022

81d0a3cc
张
Remove LoDTensor and Tensor in fluid except operators folder (#48416) · 4527d249
由张春乔提交于 11月 28, 2022
```
* Update communicator.cc

* Update communicator.cc

* remove LoDTensor

* remove LoDTensor and Tensor
```
4527d249

25 11月, 2022 1 次提交
- H
  
  fix xpu compile on phi::enforce. (#48345) · d90469a4
  由 houj04 提交于 11月 25, 2022
  
  d90469a4
24 11月, 2022 3 次提交

Z

add exp_grad, hard_sigmoid and hard_sigmoid_grad for xpu, test=kunlun (#48307) · d2f87d96
由 zhangyikun02 提交于 11月 24, 2022

d2f87d96
Z

add pad3d and pad3d_grad op for xpu, test=kunlun (#48306) · 22555e96
由 zhangyikun02 提交于 11月 24, 2022

22555e96

[PHI decoupling] simplify "convert_utils.h" in fluid (#48168) · de4310e6

由 huangjiyi 提交于 11月 24, 2022

* rm dependence to "convert_utils.h" in some files

* fix bugs

* replace DataType2String with DataTypeToString

* replace framework::DataTypeSize with phi::SizeOf

* mv convert_function from fluid to phi and rm old map

* recommit with pre-commit

* repalce ProtoVarType with ProtoDataType and update comment.

* fix error about include "dnnl.hpp"

* revert add dep mkldnn to convert_utils in phi

* add mkldnn deps in convert_utils.h in phi

* move deps to convert_utils.h in phi

de4310e6

23 11月, 2022 2 次提交
- Y
  add masked_select_grad kernel (#48137) · db0ea0ce
  由 ykkk2333 提交于 11月 23, 2022
```
* add stat tool

* add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun

* add masked_selected_grad kernel,test=kunlun
```
  db0ea0ce
- Z
  
  add warpctc kernel and change cast_v2 to cast for xpu, test=kunlun (#48134) · 25ffe9c2
  由 zhangyikun02 提交于 11月 23, 2022
  
  25ffe9c2
22 11月, 2022 1 次提交

[PHI decoupling] remove "gpu_device_function.h" in fluid. (#48117) · 4da1a0fe

由 huangjiyi 提交于 11月 22, 2022

* move "paddle/phi/backends/gpu/gpu_device_function.h" to phi

* update copyright years

* rm "fluid/platform/device/gpu/gpu_device_function.h" in phi

* rm dependence to "gpu_device_function.h" in fluid

* rm gpu_device_function.h etc in fluid

* fix rocm-complie bugs

* fix cuda_helper_test.cu bugs

4da1a0fe

21 11月, 2022 1 次提交
- T
  
  add adamw suppor xpu, test=kunlun (#48114) · 27e252d9
  由 taixiurong 提交于 11月 21, 2022
  
  27e252d9
18 11月, 2022 4 次提交

Z
Fix bug of zero_allocator in HostAlloc (#48108) · 7f92e27e
由 zyfncg 提交于 11月 18, 2022
```
* fix bug of zero_allocator in host

* fix test compile bug

* add unittest

* update test
```
7f92e27e

CUDNN v8 Implementation of Convolution Kernels (#47454) · 14a6e67b

由 Tian Zheng 提交于 11月 18, 2022

* Refactor conv_kernel and conv_grad_kernel to provide interface for CUDNNv8 implementation

* Fix macro

* Add implementation for conv_kernel and conv_grad_kernel

* Modification after rebase onto latest develop

* Modify plan cache to comply with the API of phi::autotune

* Refactor to reduce duplicate code

* Review fix:
- move functions in  conv_kernel_impl_v8.h and conv_grad_kernel_impl_v8.h to conv_kernel.cu and conv_grad_kernelk.cu
- add const specifier for input tensor
- add logging when plans fail to execute
- move CudnnConvBwdFilterV8 and CudnnConvBwdDataV8 to conv_cudnn_frontend.h

* - move plan building outside of cache

* Fix ROCM build

14a6e67b

W
[PHI decoupling] remove "gpu_primitives.h" in fluid (#48063) · 9918bf9c
由 Wang Xin 提交于 11月 18, 2022
```
* remove "gpu_primitives.h" in fluid namespace

* fix PR-CI-GpuPS fail

* fix PR-CI-GpuPS fail
```
9918bf9c
Z

cast and gradient_accumulator support double for xpu, test=kunlun (#47800) · 982d5ff7
由 zhangyikun02 提交于 11月 18, 2022

982d5ff7

17 11月, 2022 1 次提交
- T
  
  xpu-paddlepaddle-41 [任务] ffn and attention test=kunlun (#46658) · 071708fa
  由 taixiurong 提交于 11月 17, 2022
  
  071708fa

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功