1. 17 12月, 2022 1 次提交
  2. 16 12月, 2022 1 次提交
  3. 06 12月, 2022 1 次提交
    • Z
      Clear extra input (Bias, ResidualData) in OpMaker of conv2d (#47579) · 0a2dfa38
      zyfncg 提交于
      * delete Bias and ResidualData in OpMaker of conv2d
      
      * delete extra input of conv3d
      
      * refactor pass of conv_bias_fusion
      
      * fix mkldnn dependency
      
      * fix mkldnn compile
      
      * fix test_conv_bias_mkldnn_fuse_pass
      
      * police some code
      
      * remove useless log
      
      * fix analyzer_vit_ocr_tester
      
      * fix conv_activation_mkldnn_fuse_pass
      
      * fix test_analyzer_ocr
      
      * add fused_conv_sig
      
      * fix performence regression
      
      * fix performance regression
      0a2dfa38
  4. 05 12月, 2022 1 次提交
  5. 18 11月, 2022 1 次提交
    • T
      CUDNN v8 Implementation of Convolution Kernels (#47454) · 14a6e67b
      Tian Zheng 提交于
      * Refactor conv_kernel and conv_grad_kernel to provide interface for CUDNNv8 implementation
      
      * Fix macro
      
      * Add implementation for conv_kernel and conv_grad_kernel
      
      * Modification after rebase onto latest develop
      
      * Modify plan cache to comply with the API of phi::autotune
      
      * Refactor to reduce duplicate code
      
      * Review fix:
      - move functions in  conv_kernel_impl_v8.h and conv_grad_kernel_impl_v8.h to conv_kernel.cu and conv_grad_kernelk.cu
      - add const specifier for input tensor
      - add logging when plans fail to execute
      - move CudnnConvBwdFilterV8 and CudnnConvBwdDataV8 to conv_cudnn_frontend.h
      
      * - move plan building outside of cache
      
      * Fix ROCM build
      14a6e67b
  6. 31 10月, 2022 1 次提交
  7. 20 10月, 2022 1 次提交
  8. 19 9月, 2022 1 次提交
  9. 09 9月, 2022 1 次提交
  10. 06 9月, 2022 1 次提交
  11. 02 9月, 2022 1 次提交
  12. 30 8月, 2022 1 次提交
    • H
      [phi] Transfer coalesce_tensor to phi (#45478) · cf9d651b
      HongyuJia 提交于
      * add coalesce_tensor kernel
      
      * polist coalesce_tensor kernel
      
      * add sig and InferMeta
      
      * add testcase
      
      * add legacy_api.yaml
      
      * fix infermeta
      
      * fix yaml
      
      * fix kernel implementation
      
      * add compile dependency of phi/kernels
      
      * fix MetaConfig
      
      * add python api
      
      * add and fix testcase
      
      * rnn.py add import
      
      * change _C_ops.coalesce_tensor
      
      * remove useless comments
      
      * add SetBackend
      
      * restore XPU kernel temporarily
      
      * fix code according to PR comments
      cf9d651b
  13. 26 8月, 2022 1 次提交
    • K
      Transfer transfer_layout from fluid to phi (#45261) · 985f2a4a
      kangguangli 提交于
      * remove fluid kernel and activate phi kernel
      
      * fix parameter error
      
      * transfer mkldnn part
      
      * modify header file path
      
      * fix compile error
      
      * transfer special case
      
      * fix lod setting and special case for layout setting
      
      * add testcase and refine code
      985f2a4a
  14. 12 8月, 2022 1 次提交
  15. 05 8月, 2022 2 次提交
    • Y
      [MKLDNN]Move mkldnn activation kernel to phi (#44365) · 2dfa88d2
      YuanRisheng 提交于
      * move mkldnn activation kernel
      
      * fix compile bugs
      
      * fix compile bugs
      
      * deal with conflict
      
      * fix compile bugs
      
      * fix windows compile bugs
      
      * mkldnn unittest fix
      
      * change mutable to alloc
      
      * fix unittest bugs
      
      * modify code according comment
      2dfa88d2
    • F
      move fft kernels to phi (#44714) · 153f1138
      Feiyu Chan 提交于
      * move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft
      * make stft_op use fft from phi/kernels/funcs, clean code
      153f1138
  16. 03 8月, 2022 1 次提交
  17. 01 8月, 2022 1 次提交
  18. 29 7月, 2022 1 次提交
  19. 19 7月, 2022 1 次提交
  20. 16 7月, 2022 1 次提交
    • W
      [Phi] Migrate solve kernel to phi (#44363) · c0a7830f
      Weilong Wu 提交于
      * draft version
      
      * draft version
      
      * draft version
      
      * migrate solve kernel to phi
      
      * polish
      
      * polish
      
      * re useless header file, fix a bug in grad_kernel_impl
      
      * add header file in need
      c0a7830f
  21. 14 7月, 2022 1 次提交
    • Y
      [Phi]Improve the mechanism for mkldnn kernel in PHI (#43941) · e9b4d0be
      YuanRisheng 提交于
      * adapt mkldnn kernel in PHI
      
      * fix ci compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * delete comment
      
      * fix compile bugs in windows-inference
      
      * delete code for converage
      
      * modify code by review
      
      * modify code by review
      
      * add todo
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix unittest bugsx
      e9b4d0be
  22. 29 6月, 2022 1 次提交
  23. 24 6月, 2022 1 次提交
    • Y
      [Phi]Change Copy from Kernel to basic component utils (#43622) · 2739bd73
      YuanRisheng 提交于
      * perfect copy
      
      * deal with conflict
      
      * deal with conflict
      
      * fix compile bugs
      
      * fix unittest bugs
      
      * change code format
      
      * deal with conflict
      
      * modify code by review
      
      * fix ce bugs
      
      * fix ce bugs
      
      * add lo
      
      * perfect code format
      
      * deal with conflicts
      2739bd73
  24. 23 6月, 2022 1 次提交
  25. 16 6月, 2022 1 次提交
  26. 10 6月, 2022 1 次提交
  27. 04 6月, 2022 1 次提交
  28. 25 4月, 2022 1 次提交
  29. 15 4月, 2022 2 次提交
    • C
      [Phi]Reduce kernels into multiply files (#41747) · 1927aff9
      chentianyu03 提交于
      * split reduce_kernel
      
      * rm reduce_kernel in cmake
      
      * split reduce_grad kernels
      
      * fix cmake build error
      
      * format code
      
      * fix standalone_executor_test error
      1927aff9
    • L
      Change cuDNN Conv kernel for auto tune feature (#41313) · 35acfeda
      limingshu 提交于
      * change cudnn helper for auto-tune
      
      * Add FLAGS_use_autotune to set the global status of autotune and change the order of choosing algorithm.
      
      * Fix the bug in calculating and printing current step cache hit rate.
      
      * Improve the autotune cache and fix unittest.
      
      * Change the key from AlgorithmType to int64_t.
      
      * Fix unittest for cpu-only env.
      
      * change ChooseAlgoByWorkspace for heuristic mode
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      35acfeda
  30. 10 4月, 2022 1 次提交
  31. 01 4月, 2022 1 次提交
    • C
      [Phi] Move softmax with cross entropy kernel into phi (#40832) · e6ec98fe
      Chen Weihang 提交于
      * add cross_entropy_with_softmax phi kernel
      
      * remove softmax_with_cross_entropy kernel
      
      * add softmax_with_cross_entropy grad kernel
      
      * remove original op kernel
      
      * refine cross entropy impl
      
      * fix pointer error
      
      * revert kernel cu change
      
      * fix xpu failed
      
      * fix cinn failed
      
      * fix npu failed
      
      * add forward sig
      
      * add check_nan_inf for pt kernel
      
      * remove repeat cmake item
      
      * fix unittest error
      e6ec98fe
  32. 30 3月, 2022 1 次提交
  33. 28 3月, 2022 2 次提交
    • H
      Move meshgrid to phi (#40994) · ca871957
      hong 提交于
      * move momentum, rmsprop to phi; test=develop
      
      * update
      
      * update
      
      * update
      
      * update
      
      * udpate; test=develop
      
      * fix xpu npu bugs; test=develop
      
      * fix npu bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows compile error; test=develop
      
      * polish code; test=develop
      
      * fix conflict; test=develop
      
      * add meshgrid;
      
      * update
      
      * polish code
      
      * polish code;
      
      * fix bug
      
      * format; remove useless code
      
      * fix npu bug
      
      * fix bug
      ca871957
    • 0
      [Phi] Move warpctc OP to phi (#40023) · cb183762
      0x45f 提交于
      * moving OP
      
      * move forward
      
      * move grad and infershape
      
      * code format
      
      * format code
      
      * fix code
      
      * fix code
      
      * fix CMakerLists.txt
      
      * fix comments
      
      * Refine CMakeLists for rocm ci
      cb183762
  34. 27 3月, 2022 1 次提交
    • J
      Add StringTensor (#39830) · 0695e1ac
      Jack Zhou 提交于
      * add string tensor and case convert kernels
      
      * Add strings empty kernel; Reorganize the structure of case convert kernel
      
      * Add string infermeta
      
      * Update mutable_data of string tensor
      
      * rename kernel name
      
      * add string copy tmp
      
      * Fix strings copy device bug
      
      * add utf8 gpu converter
      
      * add string tensor c++ api
      
      * Remove mutable_data of string tensor
      
      * update string tensor interface
      
      * remove charcases_flag.h
      
      * remove some fluid headers
      
      * Add make_ddim
      
      * __HIPCC__ -> PADDLE_WITH_HIP
      
      * remove fluid headers
      
      * fix cpu compile
      
      * remove std::hash
      
      * Fix cudaMalloc
      
      * Remove strings/impl directory
      
      * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps
      
      * Add empty kernel test
      
      * Remove some comments
      
      * Modify lower/upper api encoding type: string->bool
      
      * STRING->PSTRING; Add CreateInferLikeMeta
      
      * Add code gen for C++ String API
      
      * remove strings_api_utils.h
      
      * Add ignore file (strings_api.h, strings_api.cc)
      
      * update strings gen script
      
      * change args order of case convert kernels
      
      * Add comments for pstring, StringTensor
      
      * cpstring_internal.h -> cpstring_impl.h
      
      * Update accordding to comments:
      
      1. Remove fluid headers
      2. paddle::platform::errors -> phi::errors
      3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()'
      4. Use camel code style
      
      * Remove all singletons in strings kernels
      
      * fix rocm compile
      
      * Fix py3 compile
      
      * Fix c++ coverage
      
      * 1. Add pstring proto type
      2. Add StringTensor debug info
      3. Rename case_convert_kernel to strings_lower_upper
      4. Remove serialize derialize strings kernel
      
      * DataLayout::PSTRING -> DataLayout::PSTRING_UNION
      
      * Register pstring data type
      
      * Fix strings api gen
      
      * Fix dense tensor register pstring dtype
      
      * Fix error messages
      
      * remove line
      
      * add pstring unittest
      
      * remove test string api unitest
      
      * remove empty line
      
      * Remove some headers to decrease the size of executable file
      0695e1ac
  35. 25 3月, 2022 1 次提交
    • A
      [Phi] Migrate Adam and AdamW into Phi (#40351) · 56cd3407
      Aurelius84 提交于
      * [Phi] Migrate Adam and Adamw into Phi
      
      * fix compile error and unittest ok
      
      * fix compile error and unittest ok
      
      * fix undefined reference to fLI::FLAGS
      
      * test depend on operator
      
      * fix cmake
      
      * fix xpu compile
      
      * fix infrt
      
      * fix amp_type_traits
      
      * fix amp_type_traits
      
      * modify according reviewer
      
      * modify according reviewer
      
      * fix dtype float16
      
      * fix typo
      
      * fix Cmake
      
      * fix code style
      56cd3407
  36. 24 3月, 2022 1 次提交
  37. 23 3月, 2022 1 次提交