1. 21 1月, 2022 1 次提交
    • W
      [PTEN] Add cpu context (#38979) · 064bc4b8
      Wilber 提交于
      * add cpu_context.
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix ci problem
      
      * fix npu ci problem
      
      * update
      
      * fix ci compile
      064bc4b8
  2. 18 1月, 2022 1 次提交
  3. 15 1月, 2022 1 次提交
  4. 04 2月, 2021 1 次提交
  5. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  6. 02 9月, 2020 1 次提交
    • W
      Add FetchAsyncOpHandle, and use it in FastThreadedExecutor (#26643) · 2d2c31a6
      wanghuancoder 提交于
      * optimized transformation form tensor to numpy, test=develop
      
      * Modify fetch op handle, from memcpy Sync to memcpy Async, test=develop
      
      * modify CUDAPinnedPlace to CPUPlace, test=develop
      
      * modify CPUPlace to CUDAPinnedPlace, and set default inplace to false, test=develop
      
      * revert fetch_op_handle, add fetch_async_op_handle, test=develop
      
      * revert fetch_op_handle, add fetch_async_op_handle, test=develop
      
      * fix error msg report, test=develop
      
      * fix bug in cpuplace, test=develop
      
      * fix bug in unmerge and tensorarray modle, test=develop
      
      * fix bug, double copy gpu memory, test=develop
      
      * fix chenweihang¡¯s review advice, test=develop
      2d2c31a6
  7. 19 4月, 2020 1 次提交
    • G
      Support LoDTensorArray in fetch (#23645) · 2b896c1f
      guofei 提交于
      * Support LoDTEnsorArray in fetch op
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      2b896c1f
  8. 02 3月, 2020 1 次提交
    • Z
      Unmerged fetch list (#22635) · 89cfa491
      Zhen Wang 提交于
      * update ScopeBufferedSSAGraphExecutor&AsyncSSAGraphExecutor&ThreadedSSAGraphExecutor&FastThreadedSSAGraphExecutor&ParallelSSAGraphExecutor&ParallelExecutor for fetching unmerged results.
      
      * add the unit test for fetch_unmerged.
      
      * update ut for multi-card and multi-cpu.
      
      * add the error message and the user suggestion in FetchOpHandle. test=develop
      89cfa491
  9. 11 7月, 2019 1 次提交
    • Z
      Feature/buffer_shared_inplace (#17911) · d3003a16
      Zeng Jinle 提交于
      * feature/buffer_shared_inplace, test=develop
      
      * refine code, test=develop
      
      * fix elementwise_add op cpu inplace and sum inplace bug, test=develop
      
      * add unittest and debug log, test=develop
      
      * fix parallel_executor scope bug, polish code, test=develop
      
      * fix sum op, activation op, single_in_place_inference bug, test=develop
      
      * remove kLocalExecScopeName, test=develop
      
      * fix unittest,test=develop
      
      * fix out_var first version bug, test=develop
      
      * follow comments,test=develop
      d3003a16
  10. 22 3月, 2019 1 次提交
    • C
      [Speed]Refine ParallelExecutor (#16190) · a6a3b2fb
      chengduo 提交于
      * refine parallelExecutor
      test=develop
      
      * Polish op_handle
      test=develop
      
      * Remove unnecessary op_handle
      test=develop
      
      * Fix Travis CI
      test=develop
      
      * Fix fetch bug
      test=develop
      
      * Remove WaitInputVarGenerated
      
      * Fix OpHandleBase::Run
      test=develop
      
      * debug
      test=develop
      
      * use origin fetch_op_handle
      test=develop
      
      * Revert op_handle_base.cc
      test=develop
      
      * Polish code
      test=develop
      
      * Fix OpHandleBase::Run
      test=develop
      
      * code refine
      
      * test CI and CE
      test=develop
      
      * fix OpHandle::Run
      test=develop
      
      * refine AllReduceOpHandle
      test=develop
      
      * Polish code
      test=develop
      a6a3b2fb
  11. 10 9月, 2018 2 次提交
  12. 18 7月, 2018 1 次提交
  13. 11 5月, 2018 1 次提交
  14. 09 5月, 2018 1 次提交
  15. 19 4月, 2018 1 次提交
  16. 26 3月, 2018 1 次提交
  17. 21 3月, 2018 3 次提交
  18. 07 3月, 2018 2 次提交
  19. 06 3月, 2018 2 次提交
  20. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  21. 10 2月, 2018 2 次提交
  22. 07 2月, 2018 1 次提交
  23. 06 2月, 2018 2 次提交
  24. 01 2月, 2018 1 次提交
  25. 31 1月, 2018 1 次提交
  26. 30 1月, 2018 1 次提交