1. 01 5月, 2022 1 次提交
  2. 28 4月, 2022 1 次提交
  3. 27 4月, 2022 1 次提交
    • Z
      Optimize performance of dygraph (v4) (#42196) · 37e2f027
      zyfncg 提交于
      * optimize performance of dygraph
      
      * optimize performance of dygraph and elementwise_add
      
      * optimize the trace op
      
      * fix bug
      
      * fix bug
      
      * fix unittest bug
      
      * fix code format
      37e2f027
  4. 25 4月, 2022 2 次提交
  5. 18 4月, 2022 2 次提交
  6. 16 4月, 2022 1 次提交
  7. 14 4月, 2022 2 次提交
  8. 13 4月, 2022 1 次提交
  9. 12 4月, 2022 2 次提交
    • L
      [KP] Add Logical/compare/bitwise registry & UT (#40802) · 3749198e
      Lijunhui 提交于
      * init commit no push
      
      * collect comile errors
      
      * bitwise UT
      
      * fix compile problem
      
      * cancel comments
      
      * restore miss deletion
      
      * fix compilation
      
      * fix UT
      
      * NO stash in multiple branch at the same times
      
      * fix error
      
      * combine .cu from gpu and kps
      
      * replace gpu by kps
      
      * fix by Chen-weihang
      
      * Revert "Fix kps compile error in Junhui logic compare bitwise"
      
      * fix backend test
      
      * rm comments
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      3749198e
    • F
      add a inner loop for index_select_grad_init() in index_select op when dealing... · bc01242b
      FlyingQianMM 提交于
      add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563)
      
      * replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op
      
      * use CUDA_KERNEL_LOOP_TYPE
      
      * fix code style
      
      * replace index_select_grad_init with SetConstant
      bc01242b
  10. 07 4月, 2022 1 次提交
  11. 03 4月, 2022 1 次提交
    • F
      add maximum limit for grid of index_select (#41127) · af8d2482
      FlyingQianMM 提交于
      * limit grid dim for index select
      
      * mv LimitGridDim into gpu_launch_config.h
      
      * fix conflicts
      
      * fix conflicts
      
      * fix code style
      
      * set block to 256
      
      * fix grid setting
      
      * set dtype of block_dim to unsigned int
      af8d2482
  12. 02 4月, 2022 2 次提交
  13. 01 4月, 2022 1 次提交
    • C
      [Phi]Interploatd kernels into phi (#40855) · d65a7a46
      chentianyu03 提交于
      * add interploate cpu kernel
      
      * fix nullptr bug
      
      * add interpolate gpu kernel
      
      * fix unit test error
      
      * remove raw kernels
      
      * add cuda kernel impl
      
      * add infermeta
      
      * recover accidentally deleted kernels in interpolate op
      
      * fix grad x_grad name error
      
      * remove interpolate_v2_op.h
      
      * rm unused codes
      
      * fix xpu build error
      
      * fix build error
      
      * fix namespace error
      
      * add register header for nup
      
      * fix infermeta error
      
      * modify by review
      
      * add the missing args in test_trt_convert_nearest_interp_v2
      d65a7a46
  14. 31 3月, 2022 2 次提交
  15. 30 3月, 2022 3 次提交
  16. 29 3月, 2022 3 次提交
  17. 28 3月, 2022 1 次提交
    • H
      Move some activation to phi (#40727) · e77a947e
      hong 提交于
      * update
      
      * add forward case
      
      * update
      
      * update; test=develop
      
      * add some grad kernel; test=develop
      
      * move gpu kernel; test=develop
      
      * update
      
      * update;
      
      * update test;
      
      * fix selected rows bug;
      
      * add mix vector include ;
      
      * add mixed vector depen; test=develop
      
      * add logit grad signature;
      
      * polish code
      
      * fix bug;
      
      * add namespace for abs
      
      * revert code
      
      * not move softsign
      
      * revmove duplate register;
      
      * fix softsign bug
      
      * polish code
      
      * format
      
      * format
      
      * fix bug
      
      * remove cmake dep
      
      * add square sqrt selected rows support
      
      * update
      
      * remove clip norm
      
      * add standalone executor sqrt dep
      
      * standalone exec denp sqrt
      
      * remove sqrt op in cmkaelist
      
      * open some case
      e77a947e
  18. 27 3月, 2022 1 次提交
    • H
      Move slice to phi (#40736) · b8236b7b
      hong 提交于
      * move slice to pten
      
      * merge develop; test=develop
      
      * fix slice bug;
      
      * update
      
      * update
      
      * fix error
      
      * update
      
      * fix bug
      
      * polish code
      
      * polish code
      
      * polish code
      
      * try to fix windows bug
      
      * add gpu compile flag;
      
      * try to fix
      
      * remov template;
      
      * polish code;
      
      * fix npu bug;
      
      * fix npu bug
      
      * fix npu bug; test=develop
      
      * fix slice bug;
      
      * remove no need dep
      b8236b7b
  19. 26 3月, 2022 1 次提交
  20. 25 3月, 2022 5 次提交
  21. 24 3月, 2022 2 次提交
  22. 23 3月, 2022 4 次提交