1. 24 11月, 2022 1 次提交
    • H
      [Phi Support CuDNN] Support ALL CuDNN (#47865) · 1623f1b4
      HongyuJia 提交于
      * support default use_gpudnn=True
      
      * fully support cudnn in phi
      
      * add header file
      
      * add white_list, verify accuracy
      
      * phi support all cudnn
      
      * opt affine_grad
      
      * try different arches of pretrained_model
      
      * try different arches of pretrained_model
      
      * add debug string
      
      * debug eager_method
      
      * add debug string, pass all local ctest
      
      * polish all debug code
      
      * delete use_cudnn relevant code autogen
      
      * fix depthwise_conv2d
      
      * Share all other members of Tensor except use_cudnn
      
      * polish codes according to review opinion
      
      * polish codes according to review opinion, fix bug
      
      * polish codes according to review opinion, opt performance
      
      * polish codes according to review opinion, fix pooling.py
      1623f1b4
  2. 28 9月, 2022 1 次提交
  3. 05 9月, 2022 1 次提交
  4. 29 7月, 2022 1 次提交
    • Y
      [Phi] Add yaml for assign_value (#44596) · 88584396
      Yulong Ao 提交于
      * [Phi] Add yaml for assign_value
      
      * [Phi] Fix the bug of the assign api and modify the unittest
      
      * [Phi] Fix the bug when the tensor does not have the backend info
      
      * [Phi] Replace the functional-style cast init by the brace-init
      
      * [Phi] Cast the data explicitly
      88584396
  5. 14 7月, 2022 1 次提交
    • Y
      [Phi]Improve the mechanism for mkldnn kernel in PHI (#43941) · e9b4d0be
      YuanRisheng 提交于
      * adapt mkldnn kernel in PHI
      
      * fix ci compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * delete comment
      
      * fix compile bugs in windows-inference
      
      * delete code for converage
      
      * modify code by review
      
      * modify code by review
      
      * add todo
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix unittest bugsx
      e9b4d0be
  6. 14 4月, 2022 1 次提交
  7. 07 4月, 2022 1 次提交
  8. 28 3月, 2022 1 次提交
  9. 22 3月, 2022 2 次提交
  10. 21 3月, 2022 1 次提交
  11. 28 2月, 2022 1 次提交
  12. 20 2月, 2022 1 次提交
  13. 11 2月, 2022 1 次提交
  14. 29 1月, 2022 1 次提交
    • C
      [PTen] Tidy pten core headers (#39188) · dd990981
      Chen Weihang 提交于
      * open header for custom kernel
      
      * add core utils
      
      * tidy core code
      
      * tify header
      
      * tidy include
      
      * tidy namespace
      
      * resolve conflit
      
      * fix unittest and coverage
      
      * remove platform using
      
      * resolve conflict
      
      * resolve conflict
      
      * fix digamma namespace error
      
      * fix xpu full kernel error
      
      * fix xpu full kernel error
      
      * polish details
      
      * add place for lib storage
      dd990981
  15. 16 12月, 2021 1 次提交
  16. 03 12月, 2021 1 次提交
    • Z
      【PTen】C++ API Code-Generation (#37668) · c58c4ede
      zyfncg 提交于
      * add code-gen file
      
      * add api-gen in cmake
      
      * adjust the code format
      
      * temp save the code
      
      * add genen-api module into pten
      
      * merge the develop code
      
      * fix merge conflict
      
      * fix code conflict with develop
      
      * support reduce_mean/sum
      
      * fix the CI requirement
      
      * fix requirement problem of CI
      
      * remove original api code
      
      * fix bug caused by removing original api
      c58c4ede
  17. 15 11月, 2021 1 次提交
    • C
      [Pten] Refactor the implementation of custom operator (#37122) · 1e598f1a
      Chen Weihang 提交于
      * move extension into pten [no-verify]
      
      * append tensor methods by ext_tensor [no-verify]
      
      * append other tensor methods [no-verify]
      
      * ext related files tidy [no-verify]
      
      * include relation tidy [no-verify]
      
      * add pten tensor test [no-verify]
      
      * replace tensor in custom op & compile success
      
      * refine tensor constructor for unittest
      
      * custom relu jit run success
      
      * fix all custom op unittests
      
      * add inference cmake adapt [no-verify]
      
      * fix failed unittests
      
      * fix windows failed unittests
      
      * try to fix kunlun and inference failed
      
      * fix test_elementwise_api error
      
      * try to fix win compile failed
      
      * fix kunlun fp16 type error
      
      * remove useless haddle error macro
      
      * add custom linear op test
      
      * fix compile failed & add win symbols
      
      * fix non pten kernel cast failed
      
      * add dll decl for api
      
      * polish several deetails
      
      * polish details by review comment
      
      * add dll_decl for register
      1e598f1a