1. 24 11月, 2022 3 次提交
    • H
      [Phi Support CuDNN] Support ALL CuDNN (#47865) · 1623f1b4
      HongyuJia 提交于
      * support default use_gpudnn=True
      
      * fully support cudnn in phi
      
      * add header file
      
      * add white_list, verify accuracy
      
      * phi support all cudnn
      
      * opt affine_grad
      
      * try different arches of pretrained_model
      
      * try different arches of pretrained_model
      
      * add debug string
      
      * debug eager_method
      
      * add debug string, pass all local ctest
      
      * polish all debug code
      
      * delete use_cudnn relevant code autogen
      
      * fix depthwise_conv2d
      
      * Share all other members of Tensor except use_cudnn
      
      * polish codes according to review opinion
      
      * polish codes according to review opinion, fix bug
      
      * polish codes according to review opinion, opt performance
      
      * polish codes according to review opinion, fix pooling.py
      1623f1b4
    • W
      do not calc reduce_all in eager mode (#48199) · bcf75132
      wanghuancoder 提交于
      * do not calc reduce_all in eager mode
      
      * refine python c cast list
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      
      * refine
      bcf75132
    • W
      dense tensor in eager mode support data_ptr (#48235) · 3f265815
      wanghuancoder 提交于
      * dense tensor in eager mode support data_ptr
      3f265815
  2. 23 11月, 2022 1 次提交
  3. 21 11月, 2022 4 次提交
  4. 18 11月, 2022 3 次提交
  5. 17 11月, 2022 2 次提交
  6. 16 11月, 2022 2 次提交
  7. 14 11月, 2022 4 次提交
  8. 10 11月, 2022 4 次提交
  9. 09 11月, 2022 4 次提交
    • W
      Get grads from cpp for optimizer to avoid gpu idel time (#47709) · 261ebb0c
      WangZhen 提交于
      * Get params and grads in cpp to avoid gpu idel time
      
      * Using python param instead of cpp return param to fix test_asp_optimize_dynamic.py
      
      * Get grads from cpp and construct params_grads on python
      
      * Check meta and remove comments
      261ebb0c
    • P
      Enable fc passes (#45704) · 7e914386
      Paulina Gacek 提交于
      * Analysis API interface for disabling fc passes
      
      * Unit tests corrected
      
      * Python API added
      
      * test runs only when PADDLE_WITH_MKLDNN
      
      * Fc op changed to relu in matmul_op_test
      
      * Disable fc passes in tests where acc drops
      
      * code formating
      
      * Unit test for analysisConf added
      
      * Unit test gpu added
      
      * fc passes disabled when iterations=0 in gru test
      
      * style
      
      * passes disabled when fp32 in gru test
      
      * fc passes disabled in lstm test
      
      * Import from inference, not fluid in doc
      7e914386
    • W
      refactor: ProcessGroupNCCL (#47740) · ae14bad1
      Wen Sun 提交于
      ae14bad1
    • W
      refine python call error report (#47724) · 5c7fce47
      wanghuancoder 提交于
      * refine python call error report
      5c7fce47
  10. 08 11月, 2022 2 次提交
  11. 07 11月, 2022 4 次提交
  12. 04 11月, 2022 1 次提交
  13. 03 11月, 2022 1 次提交
  14. 01 11月, 2022 3 次提交
  15. 31 10月, 2022 1 次提交
  16. 28 10月, 2022 1 次提交