1. 28 11月, 2022 1 次提交
  2. 25 11月, 2022 2 次提交
  3. 24 11月, 2022 4 次提交
    • H
      [PHI decoupling] simplify "convert_utils.h" in fluid (#48168) · de4310e6
      huangjiyi 提交于
      * rm dependence to "convert_utils.h" in some files
      
      * fix bugs
      
      * replace DataType2String with DataTypeToString
      
      * replace framework::DataTypeSize with phi::SizeOf
      
      * mv convert_function from fluid to phi and rm old map
      
      * recommit with pre-commit
      
      * repalce ProtoVarType with ProtoDataType and update comment.
      
      * fix error about include "dnnl.hpp"
      
      * revert add dep mkldnn to convert_utils in phi
      
      * add mkldnn deps in convert_utils.h in phi
      
      * move deps to convert_utils.h in phi
      de4310e6
    • H
      [Phi Support CuDNN] Support ALL CuDNN (#47865) · 1623f1b4
      HongyuJia 提交于
      * support default use_gpudnn=True
      
      * fully support cudnn in phi
      
      * add header file
      
      * add white_list, verify accuracy
      
      * phi support all cudnn
      
      * opt affine_grad
      
      * try different arches of pretrained_model
      
      * try different arches of pretrained_model
      
      * add debug string
      
      * debug eager_method
      
      * add debug string, pass all local ctest
      
      * polish all debug code
      
      * delete use_cudnn relevant code autogen
      
      * fix depthwise_conv2d
      
      * Share all other members of Tensor except use_cudnn
      
      * polish codes according to review opinion
      
      * polish codes according to review opinion, fix bug
      
      * polish codes according to review opinion, opt performance
      
      * polish codes according to review opinion, fix pooling.py
      1623f1b4
    • W
      do not calc reduce_all in eager mode (#48199) · bcf75132
      wanghuancoder 提交于
      * do not calc reduce_all in eager mode
      
      * refine python c cast list
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      bcf75132
    • W
      dense tensor in eager mode support data_ptr (#48235) · 3f265815
      wanghuancoder 提交于
      * dense tensor in eager mode support data_ptr
      3f265815
  4. 23 11月, 2022 1 次提交
  5. 21 11月, 2022 4 次提交
  6. 18 11月, 2022 3 次提交
  7. 17 11月, 2022 2 次提交
  8. 16 11月, 2022 2 次提交
  9. 14 11月, 2022 4 次提交
  10. 10 11月, 2022 4 次提交
  11. 09 11月, 2022 4 次提交
    • W
      Get grads from cpp for optimizer to avoid gpu idel time (#47709) · 261ebb0c
      WangZhen 提交于
      * Get params and grads in cpp to avoid gpu idel time
      
      * Using python param instead of cpp return param to fix test_asp_optimize_dynamic.py
      
      * Get grads from cpp and construct params_grads on python
      
      * Check meta and remove comments
      261ebb0c
    • P
      Enable fc passes (#45704) · 7e914386
      Paulina Gacek 提交于
      * Analysis API interface for disabling fc passes
      
      * Unit tests corrected
      
      * Python API added
      
      * test runs only when PADDLE_WITH_MKLDNN
      
      * Fc op changed to relu in matmul_op_test
      
      * Disable fc passes in tests where acc drops
      
      * code formating
      
      * Unit test for analysisConf added
      
      * Unit test gpu added
      
      * fc passes disabled when iterations=0 in gru test
      
      * style
      
      * passes disabled when fp32 in gru test
      
      * fc passes disabled in lstm test
      
      * Import from inference, not fluid in doc
      7e914386
    • W
      refactor: ProcessGroupNCCL (#47740) · ae14bad1
      Wen Sun 提交于
      ae14bad1
    • W
      refine python call error report (#47724) · 5c7fce47
      wanghuancoder 提交于
      * refine python call error report
      5c7fce47
  12. 08 11月, 2022 2 次提交
  13. 07 11月, 2022 4 次提交
  14. 04 11月, 2022 1 次提交
  15. 03 11月, 2022 1 次提交
  16. 01 11月, 2022 1 次提交