1. 15 12月, 2022 1 次提交
    • S
      [PHI decoupling] Remove fluid imports from MKLDNN code (#48981) · 4d5a5533
      Sławomir Siwek 提交于
      * fix wrong handler name
      
      * mkldnn_engine -> onednn_engine
      
      * remove fluid/errors.h imports
      
      * remove fluid/enforce.h imports
      
      * remove note and unnecessary import
      
      * remove fluid/pretty_log.h imports
      
      * remove fluid/place.h imports
      
      * remove fluid/data_layout_transform.h imports
      
      * remove fluid/device_context.h imports
      
      * remove mkldnn_helper code
      
      * remove fluid/mkldnn_reuse.h imports
      
      * pretty_log import
      4d5a5533
  2. 12 12月, 2022 1 次提交
    • Optimization of Eigh op with ssyevj_batched runtime api (#48560) · 16e364d3
      傅剑寒 提交于
      * fix codestyle
      
      * add double complex<float> complex<double> dtype support for syevj_batched
      
      * fix use_syevj flag for precision loss when input dtype of syevj_batch is complex128 in some case
      
      * optimize eigh in different case
      
      * fix missing ; bug
      
      * fix use_syevj bug
      
      * fix use_cusolver_syevj_batched flag
      16e364d3
  3. 09 12月, 2022 1 次提交
  4. 08 12月, 2022 3 次提交
  5. 07 12月, 2022 1 次提交
  6. 06 12月, 2022 3 次提交
  7. 05 12月, 2022 2 次提交
  8. 02 12月, 2022 1 次提交
  9. 30 11月, 2022 2 次提交
  10. 29 11月, 2022 4 次提交
  11. 28 11月, 2022 5 次提交
  12. 25 11月, 2022 3 次提交
  13. 24 11月, 2022 4 次提交
  14. 23 11月, 2022 3 次提交
  15. 22 11月, 2022 1 次提交
    • H
      [PHI decoupling] remove "gpu_device_function.h" in fluid. (#48117) · 4da1a0fe
      huangjiyi 提交于
      * move "paddle/phi/backends/gpu/gpu_device_function.h" to phi
      
      * update copyright years
      
      * rm "fluid/platform/device/gpu/gpu_device_function.h" in phi
      
      * rm dependence to "gpu_device_function.h" in fluid
      
      * rm gpu_device_function.h etc in fluid
      
      * fix rocm-complie bugs
      
      * fix cuda_helper_test.cu bugs
      4da1a0fe
  16. 21 11月, 2022 2 次提交
  17. 18 11月, 2022 3 次提交
    • Z
      Fix bug of zero_allocator in HostAlloc (#48108) · 7f92e27e
      zyfncg 提交于
      * fix bug of zero_allocator in host
      
      * fix test compile bug
      
      * add unittest
      
      * update test
      7f92e27e
    • T
      CUDNN v8 Implementation of Convolution Kernels (#47454) · 14a6e67b
      Tian Zheng 提交于
      * Refactor conv_kernel and conv_grad_kernel to provide interface for CUDNNv8 implementation
      
      * Fix macro
      
      * Add implementation for conv_kernel and conv_grad_kernel
      
      * Modification after rebase onto latest develop
      
      * Modify plan cache to comply with the API of phi::autotune
      
      * Refactor to reduce duplicate code
      
      * Review fix:
      - move functions in  conv_kernel_impl_v8.h and conv_grad_kernel_impl_v8.h to conv_kernel.cu and conv_grad_kernelk.cu
      - add const specifier for input tensor
      - add logging when plans fail to execute
      - move CudnnConvBwdFilterV8 and CudnnConvBwdDataV8 to conv_cudnn_frontend.h
      
      * - move plan building outside of cache
      
      * Fix ROCM build
      14a6e67b
    • W
      [PHI decoupling] remove "gpu_primitives.h" in fluid (#48063) · 9918bf9c
      Wang Xin 提交于
      * remove "gpu_primitives.h" in fluid namespace
      
      * fix PR-CI-GpuPS fail
      
      * fix PR-CI-GpuPS fail
      9918bf9c