1. 05 4月, 2022 1 次提交
    • L
      [new-exec] enable the new standalone executor by default (#41179) · 93ea1297
      Leo Chen 提交于
      * enable new executor by default
      
      * enable stream safe allocator
      
      * test=document_fix;test=coverage
      
      * do not use scope in op kernel
      
      * fit empty program for new executor
      
      * fix communication depend
      
      * fix test_sync_batch_norm
      
      * skip unsupported place
      
      * refine datatransfer
      
      * fit for dirtributed program
      
      * fix dependencpy
      
      * fix some ut
      93ea1297
  2. 04 4月, 2022 2 次提交
  3. 03 4月, 2022 1 次提交
    • C
      [Phi]Concat grad (#41112) · 3f57ef7a
      chentianyu03 提交于
      * add concat_grad kernel
      
      * fix error
      
      * remove comment code
      
      * fix outs nullptr error
      
      * change to phi header
      
      * add concat_grad declare for standalone_executor_test
      3f57ef7a
  4. 02 4月, 2022 5 次提交
  5. 01 4月, 2022 6 次提交
    • L
      fix mac c++ version (#41172) · a2c01db1
      liutiexing 提交于
      * fix mac c++ version
      
      * update
      
      * fix apple systems
      a2c01db1
    • C
      [Phi] Move softmax with cross entropy kernel into phi (#40832) · e6ec98fe
      Chen Weihang 提交于
      * add cross_entropy_with_softmax phi kernel
      
      * remove softmax_with_cross_entropy kernel
      
      * add softmax_with_cross_entropy grad kernel
      
      * remove original op kernel
      
      * refine cross entropy impl
      
      * fix pointer error
      
      * revert kernel cu change
      
      * fix xpu failed
      
      * fix cinn failed
      
      * fix npu failed
      
      * add forward sig
      
      * add check_nan_inf for pt kernel
      
      * remove repeat cmake item
      
      * fix unittest error
      e6ec98fe
    • C
      [Phi]Interploatd kernels into phi (#40855) · d65a7a46
      chentianyu03 提交于
      * add interploate cpu kernel
      
      * fix nullptr bug
      
      * add interpolate gpu kernel
      
      * fix unit test error
      
      * remove raw kernels
      
      * add cuda kernel impl
      
      * add infermeta
      
      * recover accidentally deleted kernels in interpolate op
      
      * fix grad x_grad name error
      
      * remove interpolate_v2_op.h
      
      * rm unused codes
      
      * fix xpu build error
      
      * fix build error
      
      * fix namespace error
      
      * add register header for nup
      
      * fix infermeta error
      
      * modify by review
      
      * add the missing args in test_trt_convert_nearest_interp_v2
      d65a7a46
    • Z
      [GPUPS]fix CMakeLists with pslib (#41225) · 4da4265a
      zmxdream 提交于
      * fix cmake. test=develop
      
      * fix. test=develop
      
      * fix dep for graphs_ps_gpu. test=develop
      
      * update. test=develop
      
      * update. test=develop
      4da4265a
    • A
      [custom kernel] support fallback (#41212) · 9c2a9afd
      Aganlengzi 提交于
      9c2a9afd
    • L
      [new-exec] move WaitEvent/RecordEvent into try-catch (#41222) · 5dae6da0
      Leo Chen 提交于
      * move WaitEvent/RecordEvent into try-catch
      
      * refine supportNpu
      5dae6da0
  6. 31 3月, 2022 8 次提交
  7. 30 3月, 2022 6 次提交
  8. 29 3月, 2022 3 次提交
  9. 28 3月, 2022 5 次提交
  10. 27 3月, 2022 3 次提交
    • L
      [new-exec] fit for mkldnn and inplace op (#40955) · afa0e82c
      Leo Chen 提交于
      * fit for mkldnn and inplace op
      
      * fix compile
      
      * refine ut
      
      * register op version
      
      * fix inplace op
      
      * fix transfer_layout
      afa0e82c
    • T
      add check of data type and support mutable_data with compiled infos (#40920) · 6a94adbe
      TeFeng Chen 提交于
      * support check data type and mutable_data with compiled infos in paddle with cinn
      
      * update cinn_instruction_run_op_test with multi data type
      6a94adbe
    • H
      Move slice to phi (#40736) · b8236b7b
      hong 提交于
      * move slice to pten
      
      * merge develop; test=develop
      
      * fix slice bug;
      
      * update
      
      * update
      
      * fix error
      
      * update
      
      * fix bug
      
      * polish code
      
      * polish code
      
      * polish code
      
      * try to fix windows bug
      
      * add gpu compile flag;
      
      * try to fix
      
      * remov template;
      
      * polish code;
      
      * fix npu bug;
      
      * fix npu bug
      
      * fix npu bug; test=develop
      
      * fix slice bug;
      
      * remove no need dep
      b8236b7b