1. 18 11月, 2022 2 次提交
    • H
      [PHI decoupling] move "gpu_device_function.h" from fluid to phi (#48097) · 27ee6e71
      huangjiyi 提交于
      * move "paddle/phi/backends/gpu/gpu_device_function.h" to phi
      
      * update copyright years
      
      * rm "fluid/platform/device/gpu/gpu_device_function.h" in phi
      
      * fix rocm-complie bugs
      27ee6e71
    • T
      CUDNN v8 Implementation of Convolution Kernels (#47454) · 14a6e67b
      Tian Zheng 提交于
      * Refactor conv_kernel and conv_grad_kernel to provide interface for CUDNNv8 implementation
      
      * Fix macro
      
      * Add implementation for conv_kernel and conv_grad_kernel
      
      * Modification after rebase onto latest develop
      
      * Modify plan cache to comply with the API of phi::autotune
      
      * Refactor to reduce duplicate code
      
      * Review fix:
      - move functions in  conv_kernel_impl_v8.h and conv_grad_kernel_impl_v8.h to conv_kernel.cu and conv_grad_kernelk.cu
      - add const specifier for input tensor
      - add logging when plans fail to execute
      - move CudnnConvBwdFilterV8 and CudnnConvBwdDataV8 to conv_cudnn_frontend.h
      
      * - move plan building outside of cache
      
      * Fix ROCM build
      14a6e67b
  2. 16 11月, 2022 1 次提交
  3. 11 11月, 2022 1 次提交
  4. 10 11月, 2022 1 次提交
  5. 09 11月, 2022 1 次提交
  6. 08 11月, 2022 1 次提交
  7. 07 11月, 2022 1 次提交
  8. 02 11月, 2022 1 次提交
  9. 01 11月, 2022 1 次提交
    • C
      Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9
      Chen Weihang 提交于
      * add extra attr property set
      
      * add type_info for all context
      
      * add onednn context to all context
      
      * fix context compile error
      
      * simplify conv kernel args
      
      * pass runtime attr into dev_ctx
      
      * fix marco error
      
      * clear conv_grad_kernel extra args
      
      * merge conv_grad_grad into conv_grad
      
      * clear conv2d_grad_grad extra attrs
      
      * clear yaml and eager extra attr
      
      * fix conv1d error
      
      * change to thread local
      
      * fix npu compile failed
      
      * try to fix windows compile failed
      
      * add conv2d onednn phi kernel
      
      * fix ci bugs (#36)
      
      * fix compile bugs (#38)
      
      * fix extra input transform bug (#39)
      
      * support dynamic created attr (#40)
      
      * reset extra info gen code
      
      * rm conv_grad_grad kernel
      
      * reimpl pass attr adapting
      
      * add int attr support
      
      * remove vector inputnames creating
      
      * fix map at error
      
      * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      
      * remove useless extra attrs
      
      * replace mkldnn_engine by onednn_engine
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      c923e6c9
  10. 25 10月, 2022 1 次提交
  11. 24 10月, 2022 2 次提交
  12. 19 10月, 2022 1 次提交
  13. 13 10月, 2022 1 次提交
  14. 29 9月, 2022 1 次提交
  15. 09 9月, 2022 1 次提交
  16. 07 9月, 2022 1 次提交
  17. 05 9月, 2022 1 次提交
  18. 25 8月, 2022 1 次提交
    • H
      optimize conv algo cache (#41891) · 1cd7e68b
      hong 提交于
      * optimizer conv alog speed
      
      * code polish
      
      * remove useless code
      
      * fix compile error
      
      * fix cpu compile error
      
      * not use cudnn alog t
      
      * add search cache max number
      
      * polish code
      
      * fix cache test bug
      
      * add groups data format to conv args
      
      * fix cache test bug
      
      * fix cudnn_deterministic bug
      
      * fix test switch auto tune bug
      
      * fix test swith autotune bug;
      
      * fix conv cache bug
      
      * fix cache test error
      
      * fix cache test bug
      
      * fix windows mac compile error
      
      * fix workspace search error
      
      * update cudnn cache
      
      * fix cache test bug; test=develop
      
      * fix autotune swith test error
      
      * polish code
      
      * oplish code
      1cd7e68b
  19. 23 8月, 2022 1 次提交
  20. 03 8月, 2022 1 次提交
    • T
      [operator migration] Migrate affine grid op (#44663) · d94b9686
      Thomas Young 提交于
      * save change
      
      * save change by YSL
      
      * save change by YSL
      
      * change by YSL
      
      * test pre commit
      
      * Revert "test pre commit"
      
      This reverts commit eee5e116331186cc544de871b4a5174a6431f17c.
      
      * fix code style
      
      * fix ctest
      
      * temp save
      
      * save change
      
      * change by YSL
      
      * final change by ysl
      
      * fix ci
      
      * fix code style
      
      * delete unuse code
      
      * change by ysl
      d94b9686
  21. 21 6月, 2022 2 次提交
  22. 10 6月, 2022 1 次提交
  23. 05 6月, 2022 1 次提交
  24. 01 6月, 2022 1 次提交
    • C
      [Yaml]add conv3d, depthwise_conv2d yaml (#42807) · 5f2c251c
      chentianyu03 提交于
      * add conv3d yaml
      
      * add conv3d_grad, conv3d_double_grad
      
      * add final_state_conv3d test case
      
      * add conv3d double test case
      
      * add depthwise_conv2d grad yaml
      
      * add depthwise_conv2d double grad test case
      
      * modify the order of args
      
      * add depthwise_conv2d_grad_grad config
      5f2c251c
  25. 30 5月, 2022 1 次提交
  26. 27 5月, 2022 1 次提交
  27. 15 4月, 2022 1 次提交
    • Z
      [DoubleGrad] Enabled test_imperative_star_gan_with_gradient_penalty.py under eager mode (#41730) · 27f28e82
      Zhanlue Yang 提交于
      * [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad
      
      * Fixed elementwise issue
      
      * Addressed CI failures
      
      * [DoubleGrad] Enabled test_imperative_triple_grad test cases under eager_mode
      
      * [DoubleGrad] Enabled test_autograd_functional_dynamic.py under eager mode
      
      * Enabled more test cases
      
      * [DoubleGrad] Enabled test_imperative_star_gan_with_gradient_penalty.py under eager mode
      
      * Adjusted test_imperative_star_gan_with_gradient_penalty.py
      27f28e82
  28. 12 4月, 2022 1 次提交
  29. 09 4月, 2022 2 次提交
  30. 06 4月, 2022 1 次提交
    • H
      Add conv yaml (#41354) · 7ed7c6c7
      hong 提交于
      * update
      
      * add conv yaml
      
      * add backward
      
      * remove useless code
      
      * fix bug
      
      * fix bug
      
      * revert fluid dygraph conv2d
      
      * remove useless infermeta function
      
      * fix meta fn deluplicat error
      
      * conv using custom impl
      
      * remove amp include
      
      * fix bug
      
      * use cudnn = true
      
      * fix test mkldnn caching bug
      7ed7c6c7
  31. 22 3月, 2022 1 次提交
  32. 21 3月, 2022 1 次提交
  33. 16 3月, 2022 1 次提交
  34. 14 3月, 2022 2 次提交
  35. 12 3月, 2022 1 次提交