1. 01 4月, 2021 1 次提交
    • L
      [NPU] support npu profiler (#31684) · 6503ef56
      Leo Chen 提交于
      * support npu profiler
      
      * add python api
      
      * fix bugs
      
      * add wrapper for incomplete type
      
      * update profile proto
      
      * record npu wait
      
      * add xpu placeholder
      6503ef56
  2. 23 3月, 2021 1 次提交
  3. 08 3月, 2021 2 次提交
    • L
      [NPU] add npu kernel for communication op (#31437) · 15823bb0
      lw921014 提交于
      * add allreduce and broadcast without test
      
      * add c_broadcast_test case
      
      * build c_comm_init and c_create_group operators
      
      * make the whole thing compile
      
      * add broadcast and init op test case but run failed
      
      * make unit test compile
      
      * fix broadcast test bug and change into hcom for ccl
      
      * change c_comm_init and c_create_group ops accordingly
      
      * make tests compile
      
      * transfer code to 27
      
      * compiled successfully in 28, but run failed
      
      * test broadcast in 28, but failed
      
      * make hcom primitives work
      
      * change hccl data type for base.h
      
      * fix broadcast bug
      
      * make attributes work
      
      * fix group name bug
      
      * add allreduce but test failed
      
      * allreduce bug for qiuliang
      
      * allreduce finished
      
      * add allgather and reducescatter
      
      * merge all op code
      
      * add allgather test
      
      * finish run all ccl op test exclude send/recv
      
      * all all op and test exclude send/recv
      
      * send_v2_npu.cc recv_v2_npiu.cc compiled
      
      * fix ccl core dump bug and test allgather, reducescatter, broadcast op
      
      * fix allreduce bug just for test
      
      * hcom send&recv test pass, without hcom_destroy
      
      * for qiuliang test
      
      * Ascend Send&Recv Test Pass
      
      * all op (ex send/recv) ok
      
      * fix bug
      
      * merge all ccl op
      
      * style merge to PaddlePaddle
      
      * merge style
      
      * new merge style
      
      * merge style 2
      
      * insert an empty at the end
      
      * disable ctest for hcom to pass ci
      Co-authored-by: Nvoid-main <voidmain1313113@gmail.com>
      Co-authored-by: Nf2hkop <f2huestc@outlook.com>
      15823bb0
    • L
      c956c035
  4. 01 3月, 2021 1 次提交
  5. 23 2月, 2021 2 次提交
  6. 09 2月, 2021 3 次提交
    • L
      [feature] support npu allocator, part 2 (#30972) · 1201cd2e
      Leo Chen 提交于
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      1201cd2e
    • L
      [feature] support npu operator (#30951) · 7e049108
      Leo Chen 提交于
      [feature] support npu operator
      7e049108
    • L
      [feature] support npu allocator (#30840) · 81138239
      Leo Chen 提交于
      [feature] support npu allocator
      81138239
  7. 03 2月, 2021 1 次提交
  8. 27 1月, 2021 1 次提交
  9. 21 1月, 2021 1 次提交
  10. 15 1月, 2021 3 次提交
  11. 14 1月, 2021 2 次提交
  12. 12 1月, 2021 3 次提交
  13. 11 1月, 2021 1 次提交
  14. 07 1月, 2021 1 次提交
  15. 06 1月, 2021 1 次提交
  16. 04 1月, 2021 1 次提交
  17. 28 12月, 2020 2 次提交
  18. 26 12月, 2020 1 次提交
  19. 24 12月, 2020 2 次提交
  20. 19 12月, 2020 1 次提交
  21. 16 12月, 2020 2 次提交
    • T
      update activation op on kunlun (#29577) · af8ded77
      TTerror 提交于
      * fix expand && concat/transpose to new api
      
      * update xpu_header
      
      * update activation op on kunlun
      
      * update activation op on kunlun
      
      * update activation op on kunlun
      
      * update activation op on kunlun
      
      * update activation op on kunlun
      
      * add nearest_interp on kunlun
      
      * update error message
      af8ded77
    • Y
      添加rocm平台支持代码 (#29342) · 76738504
      Y_Xuan 提交于
      * 添加rocm平台支持代码
      
      * 修改一些问题
      
      * 修改一些歧义并添加备注
      
      * 修改代码格式
      
      * 解决冲突后的代码修改
      
      * 修改operators.cmake
      
      * 修改格式
      
      * 修正错误
      
      * 统一接口
      
      * 修改日期
      76738504
  22. 15 12月, 2020 1 次提交
    • Y
      New UT should not exceed 15s (#29492) · 2926e743
      YUNSHEN XIE 提交于
      * added UT should not exceed 15s
      
      * fix error
      
      * UT limit of 15s is the first to be executed
      
      * fix error
      
      * fix error with CI_SKIP_CPP_TEST
      
      * modfied tiemout setting
      
      * fix error
      2926e743
  23. 14 12月, 2020 1 次提交
  24. 11 12月, 2020 2 次提交
    • W
      update for xpu ci. (#29568) · 740c0d58
      Wilber 提交于
      740c0d58
    • L
      Add the strategy of skipping cc/cu test compilation and execution in CI (#29499) · b5d4a1f3
      LoveAn 提交于
      * Add the strategy of skipping cc/cu test compilation and execution in CI, test=develop
      
      * fix if error with CI_SKIP_TEST, test=develop
      
      * fix add properties to test error on Linux/MAC, test=develop
      
      * fix set test properties of test_code_generator error, test=develop
      
      * remove test codes and advance judgment of file modification on Linux, test=develop
      
      * rename CI_SKIP_TEST to CI_SKIP_CPP_TEST, test=document_fix
      
      * Add branch judgement on Linux, test=develop
      b5d4a1f3
  25. 10 12月, 2020 1 次提交
  26. 09 12月, 2020 1 次提交
  27. 08 12月, 2020 1 次提交