提交 · 27ee6e714046e8cf6dd913854da167233f7f7c41 · PaddlePaddle / Paddle

18 11月, 2022 2 次提交

[PHI decoupling] move "gpu_device_function.h" from fluid to phi (#48097) · 27ee6e71

由 huangjiyi 提交于 11月 18, 2022

* move "paddle/phi/backends/gpu/gpu_device_function.h" to phi

* update copyright years

* rm "fluid/platform/device/gpu/gpu_device_function.h" in phi

* fix rocm-complie bugs

27ee6e71

CUDNN v8 Implementation of Convolution Kernels (#47454) · 14a6e67b

由 Tian Zheng 提交于 11月 18, 2022

* Refactor conv_kernel and conv_grad_kernel to provide interface for CUDNNv8 implementation

* Fix macro

* Add implementation for conv_kernel and conv_grad_kernel

* Modification after rebase onto latest develop

* Modify plan cache to comply with the API of phi::autotune

* Refactor to reduce duplicate code

* Review fix:
- move functions in  conv_kernel_impl_v8.h and conv_grad_kernel_impl_v8.h to conv_kernel.cu and conv_grad_kernelk.cu
- add const specifier for input tensor
- add logging when plans fail to execute
- move CudnnConvBwdFilterV8 and CudnnConvBwdDataV8 to conv_cudnn_frontend.h

* - move plan building outside of cache

* Fix ROCM build

14a6e67b

16 11月, 2022 1 次提交
- W
  
  move "gpu_primitives.h" to phi (#48015) · 9adca1e7
  由 Wang Xin 提交于 11月 16, 2022
  
  9adca1e7
11 11月, 2022 1 次提交
- W
  
  remove "paddle/fluid/framework/op_registry.h" from phi (#47868) · 78c8c7de
  由 Wang Xin 提交于 11月 11, 2022
  
  78c8c7de
10 11月, 2022 1 次提交

[PHI Decoupling] remove "paddle/fluid/platform/float16.h" and... · 8164b97a

由 huangjiyi 提交于 11月 10, 2022

[PHI Decoupling] remove "paddle/fluid/platform/float16.h" and "paddle/fluid/platform/for_range.h" in phi. (#47817)

* rm "paddle/fluid/platform/float16.h" in phi

* rm "paddle/fluid/platform/for_range.h" in phi

8164b97a

09 11月, 2022 1 次提交

[PHI decoupling] remove "paddle/fluid/platform/dynload/xxx.h" in phi (#47787) · 7c302538

由 huangjiyi 提交于 11月 09, 2022

* rm "paddle/fluid/platform/dynload/cudnn.h" in phi

* rm "paddle/fluid/platform/dynload/mklml.h" in phi

* rm "paddle/fluid/platform/dynload/rocblas.h" in phi

* replace "paddle::platform::dynload::" with "phi::dynload::" in phi

* revert "blas_impl.cu.h"

7c302538

08 11月, 2022 1 次提交
- J
  removing dependent to fluid/framework/eigen.h in phi (#47675) · c7cd8d98
  由 jzhang533 提交于 11月 08, 2022
```
* removing dependent to fluid/framework/eigen.h in phi

* more fix according to PR-CI-Py3 fail
```
  c7cd8d98
07 11月, 2022 1 次提交
- Y
  Define ConvRunner to wrapper the call of cudnn conv functions. (#47576) · c331e2ce
  由 Yiqun Liu 提交于 11月 07, 2022
```
* Define ConvRunner to wrapper the call of cudnn conv functions.

* Use ConvKind in SearchAlgorithm.
```
  c331e2ce
02 11月, 2022 1 次提交
- Y
  [PHI]Standardise some C++ API (Part3) (#47532) · fe8c6796
  由 YuanRisheng 提交于 11月 02, 2022
```
* Standardise batch norm

* standardize conv3d and depwise_conv2d

* fix ci bugs
```
  fe8c6796
01 11月, 2022 1 次提交

Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9

由 Chen Weihang 提交于 10月 31, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* fix map at error

* Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* remove useless extra attrs

* replace mkldnn_engine by onednn_engine
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

c923e6c9

25 10月, 2022 1 次提交
- [Zero-Dim] support input 0D Tensor for softmax/log_softmax/gumbel_softmax (#47251) · ac3b882f
  由 zhouweiwei2014 提交于 10月 25, 2022
  
  ac3b882f
24 10月, 2022 2 次提交
- Y
  
  Enhance the implementation of some conv functions. (#47281) · bc47e7ac
  由 Yiqun Liu 提交于 10月 24, 2022
  
  bc47e7ac
- Y
  
  Move the header file of conv cudnn and miopen to phi directory. (#47248) · 31f57f29
  由 Yiqun Liu 提交于 10月 24, 2022
  
  31f57f29
19 10月, 2022 1 次提交
- Y
  Enable to record whether the conv algo is got by exhaustive search to fix... · 3bc4b850
  由 Yiqun Liu 提交于 10月 19, 2022
```
Enable to record whether the conv algo is got by exhaustive search to fix autotune cache bug. (#47065)
```
  3bc4b850
13 10月, 2022 1 次提交
- C
  
  fix softmax memory align (#46902) · 71748805
  由 carryyu 提交于 10月 13, 2022
  
  71748805
29 9月, 2022 1 次提交
- C
  
  Optimize softmax's performance when dim_size >= 100000. (#46535) · 9012787f
  由 carryyu 提交于 9月 29, 2022
  
  9012787f
09 9月, 2022 1 次提交
- S
  Fix softmax op when the input shape is larger than INT32_MAX (#45897) · 38edea9a
  由 sneaxiy 提交于 9月 09, 2022
```
* fix softmax int64

* follow comments
```
  38edea9a
07 9月, 2022 1 次提交
- W
  [OpAttr]Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose (#45620) · fe169bf1
  由 WangZhen 提交于 9月 07, 2022
```
Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose
```
  fe169bf1
05 9月, 2022 1 次提交
- A
  [OpAttr]ksize of pool2d support Tensor type of adaptive_avg_pool2d API (#45660) · f3c6d19c
  由 Aurelius84 提交于 9月 05, 2022
```
* [OpAttr]ksize of pool2d support Tensor type

* fix unittest

* add unittest
```
  f3c6d19c
25 8月, 2022 1 次提交

optimize conv algo cache (#41891) · 1cd7e68b

由 hong 提交于 8月 25, 2022

* optimizer conv alog speed

* code polish

* remove useless code

* fix compile error

* fix cpu compile error

* not use cudnn alog t

* add search cache max number

* polish code

* fix cache test bug

* add groups data format to conv args

* fix cache test bug

* fix cudnn_deterministic bug

* fix test switch auto tune bug

* fix test swith autotune bug;

* fix conv cache bug

* fix cache test error

* fix cache test bug

* fix windows mac compile error

* fix workspace search error

* update cudnn cache

* fix cache test bug; test=develop

* fix autotune swith test error

* polish code

* oplish code

1cd7e68b

23 8月, 2022 1 次提交
- N
  
  Delete the template parameter BLockSize in Kernel Primitive API (#45220) · 1a0cd447
  由 niuliling123 提交于 8月 23, 2022
  
  1a0cd447
03 8月, 2022 1 次提交

[operator migration] Migrate affine grid op (#44663) · d94b9686

由 Thomas Young 提交于 8月 03, 2022

* save change

* save change by YSL

* save change by YSL

* change by YSL

* test pre commit

* Revert "test pre commit"

This reverts commit eee5e116331186cc544de871b4a5174a6431f17c.

* fix code style

* fix ctest

* temp save

* save change

* change by YSL

* final change by ysl

* fix ci

* fix code style

* delete unuse code

* change by ysl

d94b9686

21 6月, 2022 2 次提交
- S
  resort .cu headers, set clang-format not sort include block and consider .cu... · 829723f2
  由 Sing_chan 提交于 6月 21, 2022
```
resort .cu headers, set clang-format not sort include block and consider .cu as main source file (#43633)
```
  829723f2
- Z
  
  slice large tensor for cudnn_softmax (#43681) · bd5e97d3
  由 Zhang Ting 提交于 6月 21, 2022
  
  bd5e97d3
10 6月, 2022 1 次提交
- C
  [Phi] Fix depthwise conv yaml error (#43379) · f551d9fe
  由 Chen Weihang 提交于 6月 10, 2022
```
* fix depthwise conv yaml error

* fix depthwise conv double grad error
```
  f551d9fe
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
01 6月, 2022 1 次提交

[Yaml]add conv3d, depthwise_conv2d yaml (#42807) · 5f2c251c

由 chentianyu03 提交于 6月 01, 2022

* add conv3d yaml

* add conv3d_grad, conv3d_double_grad

* add final_state_conv3d test case

* add conv3d double test case

* add depthwise_conv2d grad yaml

* add depthwise_conv2d double grad test case

* modify the order of args

* add depthwise_conv2d_grad_grad config

5f2c251c

30 5月, 2022 1 次提交
- C
  
  Implement fused_gate_attention operator for AlphaFold. (#42018) · fdcdbec5
  由 crystal 提交于 5月 30, 2022
  
  fdcdbec5
27 5月, 2022 1 次提交

[Phi] Change optional tensor from `optional<const Tensor&>` to `optional<Tensor>` (#42939) · 6d78524c

由 zyfncg 提交于 5月 27, 2022

* refactor the optional tensor

* remove optiona<MetaTensor> in InferMeta

* fix bug

* fix optional<vector<Tensor>>

* fix bug

* fix rmsprop

* fix amp of eager_gen

* polish code

* fix deleted code

* fix merge conflict

* polish code

* remove is_nullopt_

* fix merge conflict

* fix merge conflict

6d78524c

15 4月, 2022 1 次提交

[DoubleGrad] Enabled test_imperative_star_gan_with_gradient_penalty.py under eager mode (#41730) · 27f28e82

由 Zhanlue Yang 提交于 4月 15, 2022

* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad

* Fixed elementwise issue

* Addressed CI failures

* [DoubleGrad] Enabled test_imperative_triple_grad test cases under eager_mode

* [DoubleGrad] Enabled test_autograd_functional_dynamic.py under eager mode

* Enabled more test cases

* [DoubleGrad] Enabled test_imperative_star_gan_with_gradient_penalty.py under eager mode

* Adjusted test_imperative_star_gan_with_gradient_penalty.py

27f28e82

12 4月, 2022 1 次提交
- H
  
  fix depthwise dnn bug (#41666) · 7b627dd8
  由 hong 提交于 4月 12, 2022
  
  7b627dd8
09 4月, 2022 2 次提交

H

add depthwise conv hip support (#41537) · b3b8d345
由 hong 提交于 4月 09, 2022

b3b8d345

Autotune the workspace_size_limit in conv. (#40338) · b937cdc5

由 limingshu 提交于 4月 09, 2022

* Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode.

* Use the system cudaMalloc and cudaFree to allocate workspace during searching.

* Enable switch of two kind of workspace setting methods.
Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>

b937cdc5

06 4月, 2022 1 次提交

Add conv yaml (#41354) · 7ed7c6c7

由 hong 提交于 4月 06, 2022

* update

* add conv yaml

* add backward

* remove useless code

* fix bug

* fix bug

* revert fluid dygraph conv2d

* remove useless infermeta function

* fix meta fn deluplicat error

* conv using custom impl

* remove amp include

* fix bug

* use cudnn = true

* fix test mkldnn caching bug

7ed7c6c7

22 3月, 2022 1 次提交

Change bn muable data to phi (#40748) · d9a41fc4

由 hong 提交于 3月 22, 2022

* move mutable_data to context alloc

* move mutable_data to context alloc

* remvoe duplicate code

d9a41fc4

21 3月, 2022 1 次提交
- F
  Move conv-transpose OPs to phi (#40675) · 1eb96eec
  由 From00 提交于 3月 21, 2022
```
* Move conv-transpose OPs to phi

* Fix CI errors

* Fix CI errors
```
  1eb96eec
16 3月, 2022 1 次提交
- Z
  Optimize the computation of log_softmax (#40612) · 2dec25db
  由 Zhang Zheng 提交于 3月 16, 2022
```
* Optimize the computation of log_softmax

* modify the var name
```
  2dec25db
14 3月, 2022 2 次提交

Optimize performance of log_softmax (#38992) · 250e254f

由 Zhang Zheng 提交于 3月 14, 2022

* Optimize performance of log_softmax

* delete unity build

* modify to phi

* fix

* fixfixfixfix

* fix

* fix

* fix

* fix

* simplify

* fix

* fix enforce

250e254f

F
Move Pool OPs to phi (#40208) · 88ec08a7
由 From00 提交于 3月 14, 2022
```
* Move Pool OPs to phi

* Fix CI error

* Fix conflicts
```
88ec08a7

12 3月, 2022 1 次提交
- C
  [Phi] Add softmax infermeta functions (#40471) · ec09ef26
  由 Chen Weihang 提交于 3月 12, 2022
```
* rename softmax kernel name

* move softmax infershape

* fix failed test
```
  ec09ef26

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功