1. 12 3月, 2021 3 次提交
  2. 11 3月, 2021 1 次提交
  3. 10 3月, 2021 4 次提交
  4. 09 3月, 2021 1 次提交
  5. 08 3月, 2021 3 次提交
    • L
      [NPU] add npu kernel for communication op (#31437) · 15823bb0
      lw921014 提交于
      * add allreduce and broadcast without test
      
      * add c_broadcast_test case
      
      * build c_comm_init and c_create_group operators
      
      * make the whole thing compile
      
      * add broadcast and init op test case but run failed
      
      * make unit test compile
      
      * fix broadcast test bug and change into hcom for ccl
      
      * change c_comm_init and c_create_group ops accordingly
      
      * make tests compile
      
      * transfer code to 27
      
      * compiled successfully in 28, but run failed
      
      * test broadcast in 28, but failed
      
      * make hcom primitives work
      
      * change hccl data type for base.h
      
      * fix broadcast bug
      
      * make attributes work
      
      * fix group name bug
      
      * add allreduce but test failed
      
      * allreduce bug for qiuliang
      
      * allreduce finished
      
      * add allgather and reducescatter
      
      * merge all op code
      
      * add allgather test
      
      * finish run all ccl op test exclude send/recv
      
      * all all op and test exclude send/recv
      
      * send_v2_npu.cc recv_v2_npiu.cc compiled
      
      * fix ccl core dump bug and test allgather, reducescatter, broadcast op
      
      * fix allreduce bug just for test
      
      * hcom send&recv test pass, without hcom_destroy
      
      * for qiuliang test
      
      * Ascend Send&Recv Test Pass
      
      * all op (ex send/recv) ok
      
      * fix bug
      
      * merge all ccl op
      
      * style merge to PaddlePaddle
      
      * merge style
      
      * new merge style
      
      * merge style 2
      
      * insert an empty at the end
      
      * disable ctest for hcom to pass ci
      Co-authored-by: Nvoid-main <voidmain1313113@gmail.com>
      Co-authored-by: Nf2hkop <f2huestc@outlook.com>
      15823bb0
    • R
      [NPU] squeeze and unsqueeze op for ascend (#31452) · 388c69f2
      Reventon_L 提交于
      Co-authored-by: Nroot <xiayanming@baidu.com>
      388c69f2
    • L
      c956c035
  6. 05 3月, 2021 1 次提交
  7. 04 3月, 2021 4 次提交
  8. 02 3月, 2021 3 次提交
  9. 01 3月, 2021 4 次提交
  10. 26 2月, 2021 3 次提交
  11. 25 2月, 2021 3 次提交
  12. 23 2月, 2021 3 次提交
  13. 22 2月, 2021 1 次提交
  14. 20 2月, 2021 1 次提交
  15. 18 2月, 2021 1 次提交
  16. 09 2月, 2021 3 次提交
    • L
      [feature] support npu allocator, part 2 (#30972) · 1201cd2e
      Leo Chen 提交于
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      1201cd2e
    • L
      [feature] support npu operator (#30951) · 7e049108
      Leo Chen 提交于
      [feature] support npu operator
      7e049108
    • L
      [feature] support npu allocator (#30840) · 81138239
      Leo Chen 提交于
      [feature] support npu allocator
      81138239
  17. 08 2月, 2021 1 次提交