1. 07 4月, 2022 3 次提交
  2. 06 4月, 2022 1 次提交
  3. 05 4月, 2022 2 次提交
  4. 04 4月, 2022 2 次提交
  5. 03 4月, 2022 1 次提交
    • C
      [Phi]Concat grad (#41112) · 3f57ef7a
      chentianyu03 提交于
      * add concat_grad kernel
      
      * fix error
      
      * remove comment code
      
      * fix outs nullptr error
      
      * change to phi header
      
      * add concat_grad declare for standalone_executor_test
      3f57ef7a
  6. 02 4月, 2022 5 次提交
  7. 01 4月, 2022 6 次提交
    • L
      fix mac c++ version (#41172) · a2c01db1
      liutiexing 提交于
      * fix mac c++ version
      
      * update
      
      * fix apple systems
      a2c01db1
    • C
      [Phi] Move softmax with cross entropy kernel into phi (#40832) · e6ec98fe
      Chen Weihang 提交于
      * add cross_entropy_with_softmax phi kernel
      
      * remove softmax_with_cross_entropy kernel
      
      * add softmax_with_cross_entropy grad kernel
      
      * remove original op kernel
      
      * refine cross entropy impl
      
      * fix pointer error
      
      * revert kernel cu change
      
      * fix xpu failed
      
      * fix cinn failed
      
      * fix npu failed
      
      * add forward sig
      
      * add check_nan_inf for pt kernel
      
      * remove repeat cmake item
      
      * fix unittest error
      e6ec98fe
    • C
      [Phi]Interploatd kernels into phi (#40855) · d65a7a46
      chentianyu03 提交于
      * add interploate cpu kernel
      
      * fix nullptr bug
      
      * add interpolate gpu kernel
      
      * fix unit test error
      
      * remove raw kernels
      
      * add cuda kernel impl
      
      * add infermeta
      
      * recover accidentally deleted kernels in interpolate op
      
      * fix grad x_grad name error
      
      * remove interpolate_v2_op.h
      
      * rm unused codes
      
      * fix xpu build error
      
      * fix build error
      
      * fix namespace error
      
      * add register header for nup
      
      * fix infermeta error
      
      * modify by review
      
      * add the missing args in test_trt_convert_nearest_interp_v2
      d65a7a46
    • Z
      [GPUPS]fix CMakeLists with pslib (#41225) · 4da4265a
      zmxdream 提交于
      * fix cmake. test=develop
      
      * fix. test=develop
      
      * fix dep for graphs_ps_gpu. test=develop
      
      * update. test=develop
      
      * update. test=develop
      4da4265a
    • A
      [custom kernel] support fallback (#41212) · 9c2a9afd
      Aganlengzi 提交于
      9c2a9afd
    • L
      [new-exec] move WaitEvent/RecordEvent into try-catch (#41222) · 5dae6da0
      Leo Chen 提交于
      * move WaitEvent/RecordEvent into try-catch
      
      * refine supportNpu
      5dae6da0
  8. 31 3月, 2022 8 次提交
  9. 30 3月, 2022 6 次提交
  10. 29 3月, 2022 3 次提交
  11. 28 3月, 2022 3 次提交
    • H
      Move some activation to phi (#40727) · e77a947e
      hong 提交于
      * update
      
      * add forward case
      
      * update
      
      * update; test=develop
      
      * add some grad kernel; test=develop
      
      * move gpu kernel; test=develop
      
      * update
      
      * update;
      
      * update test;
      
      * fix selected rows bug;
      
      * add mix vector include ;
      
      * add mixed vector depen; test=develop
      
      * add logit grad signature;
      
      * polish code
      
      * fix bug;
      
      * add namespace for abs
      
      * revert code
      
      * not move softsign
      
      * revmove duplate register;
      
      * fix softsign bug
      
      * polish code
      
      * format
      
      * format
      
      * fix bug
      
      * remove cmake dep
      
      * add square sqrt selected rows support
      
      * update
      
      * remove clip norm
      
      * add standalone executor sqrt dep
      
      * standalone exec denp sqrt
      
      * remove sqrt op in cmkaelist
      
      * open some case
      e77a947e
    • W
      delete commonsparsetable and communicator from gpups (#40973) · 630f5b89
      wangguanqun 提交于
      * trainer and worker
      
      * delete commonsparsetable from gpups
      
      * delete vlog
      
      * codestyle
      
      * delete communicator from gpups
      630f5b89
    • J
      [Auto parallel] Mixed Precision FP16 Pass (#40615) · b99c1d07
      JZ-LIANG 提交于
      *  add FP16 Pass 
      
      * Support the auto completion of while_op
      
      *  acc aligned
      b99c1d07