1. 12 4月, 2022 2 次提交
    • L
      [KP] Add Logical/compare/bitwise registry & UT (#40802) · 3749198e
      Lijunhui 提交于
      * init commit no push
      
      * collect comile errors
      
      * bitwise UT
      
      * fix compile problem
      
      * cancel comments
      
      * restore miss deletion
      
      * fix compilation
      
      * fix UT
      
      * NO stash in multiple branch at the same times
      
      * fix error
      
      * combine .cu from gpu and kps
      
      * replace gpu by kps
      
      * fix by Chen-weihang
      
      * Revert "Fix kps compile error in Junhui logic compare bitwise"
      
      * fix backend test
      
      * rm comments
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      3749198e
    • F
      add a inner loop for index_select_grad_init() in index_select op when dealing... · bc01242b
      FlyingQianMM 提交于
      add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563)
      
      * replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op
      
      * use CUDA_KERNEL_LOOP_TYPE
      
      * fix code style
      
      * replace index_select_grad_init with SetConstant
      bc01242b
  2. 07 4月, 2022 1 次提交
  3. 03 4月, 2022 1 次提交
    • F
      add maximum limit for grid of index_select (#41127) · af8d2482
      FlyingQianMM 提交于
      * limit grid dim for index select
      
      * mv LimitGridDim into gpu_launch_config.h
      
      * fix conflicts
      
      * fix conflicts
      
      * fix code style
      
      * set block to 256
      
      * fix grid setting
      
      * set dtype of block_dim to unsigned int
      af8d2482
  4. 02 4月, 2022 2 次提交
  5. 01 4月, 2022 1 次提交
    • C
      [Phi]Interploatd kernels into phi (#40855) · d65a7a46
      chentianyu03 提交于
      * add interploate cpu kernel
      
      * fix nullptr bug
      
      * add interpolate gpu kernel
      
      * fix unit test error
      
      * remove raw kernels
      
      * add cuda kernel impl
      
      * add infermeta
      
      * recover accidentally deleted kernels in interpolate op
      
      * fix grad x_grad name error
      
      * remove interpolate_v2_op.h
      
      * rm unused codes
      
      * fix xpu build error
      
      * fix build error
      
      * fix namespace error
      
      * add register header for nup
      
      * fix infermeta error
      
      * modify by review
      
      * add the missing args in test_trt_convert_nearest_interp_v2
      d65a7a46
  6. 31 3月, 2022 2 次提交
  7. 30 3月, 2022 3 次提交
  8. 29 3月, 2022 3 次提交
  9. 28 3月, 2022 1 次提交
    • H
      Move some activation to phi (#40727) · e77a947e
      hong 提交于
      * update
      
      * add forward case
      
      * update
      
      * update; test=develop
      
      * add some grad kernel; test=develop
      
      * move gpu kernel; test=develop
      
      * update
      
      * update;
      
      * update test;
      
      * fix selected rows bug;
      
      * add mix vector include ;
      
      * add mixed vector depen; test=develop
      
      * add logit grad signature;
      
      * polish code
      
      * fix bug;
      
      * add namespace for abs
      
      * revert code
      
      * not move softsign
      
      * revmove duplate register;
      
      * fix softsign bug
      
      * polish code
      
      * format
      
      * format
      
      * fix bug
      
      * remove cmake dep
      
      * add square sqrt selected rows support
      
      * update
      
      * remove clip norm
      
      * add standalone executor sqrt dep
      
      * standalone exec denp sqrt
      
      * remove sqrt op in cmkaelist
      
      * open some case
      e77a947e
  10. 27 3月, 2022 1 次提交
    • H
      Move slice to phi (#40736) · b8236b7b
      hong 提交于
      * move slice to pten
      
      * merge develop; test=develop
      
      * fix slice bug;
      
      * update
      
      * update
      
      * fix error
      
      * update
      
      * fix bug
      
      * polish code
      
      * polish code
      
      * polish code
      
      * try to fix windows bug
      
      * add gpu compile flag;
      
      * try to fix
      
      * remov template;
      
      * polish code;
      
      * fix npu bug;
      
      * fix npu bug
      
      * fix npu bug; test=develop
      
      * fix slice bug;
      
      * remove no need dep
      b8236b7b
  11. 26 3月, 2022 1 次提交
  12. 25 3月, 2022 5 次提交
  13. 24 3月, 2022 2 次提交
  14. 23 3月, 2022 5 次提交
  15. 22 3月, 2022 2 次提交
    • H
      Change bn muable data to phi (#40748) · d9a41fc4
      hong 提交于
      * move mutable_data to context alloc
      
      * move mutable_data to context alloc
      
      * remvoe duplicate code
      d9a41fc4
    • H
      Move embedding to phi (#39901) · 0331cfda
      hong 提交于
      * move embeding to phi;
      
      * update sig; test=develop
      
      * move reset impl to phi; test=develop
      
      * remove old register; test=develop
      
      * fix cpu bf16 bug; test=develop
      
      * fix lookup speed error
      
      * polish code
      
      * fix paddle throw type
      0331cfda
  16. 21 3月, 2022 4 次提交
  17. 18 3月, 2022 2 次提交
  18. 17 3月, 2022 2 次提交
    • N
    • H
      Move layer norm to phi (#40193) · 681a6865
      hong 提交于
      * update
      
      * fix bugs; test=develop
      
      * update; test=develop
      
      * fix test compile error; test=develop
      
      * fix cpu compile error; test=develop
      
      * fix test error; test=develo
      
      * fix layer_norm_op plugin error; test=develop
      
      * fix error; test=develop
      
      * fix test bug; test=develop
      
      * update; test=develop
      
      * polish code; test=develop
      
      * fix bugs; test=develop
      
      * remove unused depency; test=develop
      
      * polish code; test=develop
      681a6865