1. 28 6月, 2021 4 次提交
  2. 21 6月, 2021 1 次提交
  3. 11 6月, 2021 1 次提交
  4. 09 6月, 2021 1 次提交
  5. 07 6月, 2021 1 次提交
  6. 29 4月, 2021 1 次提交
  7. 26 4月, 2021 2 次提交
  8. 23 4月, 2021 1 次提交
  9. 21 4月, 2021 1 次提交
    • Z
      【NPU】Merge NPU ccl code (#32381) · c3158527
      zhang wenhui 提交于
      * add allreduce and broadcast without test (#31024)
      
      add allreduce and broadcast without test
      
      * Refactor HCCLCommContext to be compatible with Paddle (#31359)
      
      Refactor HCCLCommContext to be compatible with Paddle (#31359)
      
      * [NPU] add npu kernel for communication op (#31437)
      
      * add allreduce and broadcast without test
      
      * add c_broadcast_test case
      
      * build c_comm_init and c_create_group operators
      
      * make the whole thing compile
      
      * add broadcast and init op test case but run failed
      
      * make unit test compile
      
      * fix broadcast test bug and change into hcom for ccl
      
      * change c_comm_init and c_create_group ops accordingly
      
      * make tests compile
      
      * transfer code to 27
      
      * compiled successfully in 28, but run failed
      
      * test broadcast in 28, but failed
      
      * make hcom primitives work
      
      * change hccl data type for base.h
      
      * fix broadcast bug
      
      * make attributes work
      
      * fix group name bug
      
      * add allreduce but test failed
      
      * allreduce bug for qiuliang
      
      * allreduce finished
      
      * add allgather and reducescatter
      
      * merge all op code
      
      * add allgather test
      
      * finish run all ccl op test exclude send/recv
      
      * all all op and test exclude send/recv
      
      * send_v2_npu.cc recv_v2_npiu.cc compiled
      
      * fix ccl core dump bug and test allgather, reducescatter, broadcast op
      
      * fix allreduce bug just for test
      
      * hcom send&recv test pass, without hcom_destroy
      
      * for qiuliang test
      
      * Ascend Send&Recv Test Pass
      
      * all op (ex send/recv) ok
      
      * fix bug
      
      * merge all ccl op
      
      * style merge to PaddlePaddle
      
      * merge style
      
      * new merge style
      
      * merge style 2
      
      * insert an empty at the end
      
      * disable ctest for hcom to pass ci
      Co-authored-by: Nvoid-main <voidmain1313113@gmail.com>
      Co-authored-by: Nf2hkop <f2huestc@outlook.com>
      
      * Add auto-increasing tag id for Hcom OPs (#31702)
      
      * add c_reduce_sum op (#31793)
      
      add c_reduce_sum op
      
      * update Ascendrc hccl to 20.3 (#32126)
      
      update Ascendrc hccl to 20.3 (#32126)
      
      * fix merge code
      
      * change cmake.txt1
      
      * [NPU] Support npu kernel for c sync stream op (#31386)
      
      * sync stream npu op
      
      * add with_ascend_acl
      
      * update c++ unittest
      
      * compile all failed
      
      * try to pre commit
      
      * after pre commit
      
      * merge&compile&test hccl successfully!
      
      * fix code style
      
      * fix code style
      
      * fix bugs about hccl
      
      * fix some bugs
      
      * fix code style
      
      * fix style
      
      * fix style
      
      * fix
      
      * fixed
      
      * merge develop
      Co-authored-by: Nlw921014 <liuwei921014@yeah.net>
      Co-authored-by: NVoid Main <voidmain1313113@gmail.com>
      Co-authored-by: Nf2hkop <f2huestc@outlook.com>
      Co-authored-by: Nxiayanming <41795079@qq.com>
      c3158527
  10. 15 4月, 2021 1 次提交
    • J
      Update hapi to support AMP (#31417) · fabdb43c
      Jiaqi Liu 提交于
      * make hapi support amp, and add unittest
      
      * make unittest only support GPU
      
      * update parameters for amp in hapi.Model
      
      * update hapi.Model.prepare interface, and update unittest
      
      * fix test_model.py unittest bug
      
      * add grad clear in dygraph
      
      * use_fp16_guard defaults to True, which could avoid nan
      
      * add input check, and add internal doc link to low level api
      
      * update doc, and decrease the sample num of dataset to avoid timeout
      
      * make hapi amp param  support str 'O1' or 'O2'
      
      * resume calling , modify the code of the check part
      
      * upgrade the usage of Fleet API, and disable 'pure_fp16' param
      fabdb43c
  11. 11 1月, 2021 1 次提交
  12. 07 1月, 2021 1 次提交
  13. 27 11月, 2020 2 次提交
    • S
      Support dynamic graph distributed (#28997) · e2d01eb6
      ShenLiang 提交于
      * add reducer
      
      * refine envent for memorycopy
      
      * add concat&split for allreduce
      
      * apply concat & split for fuse tensor
      
      * fix nccl dep
      
      * fix the untest, compile problem and ddp initialize problem
      
      * fix untest for mac & add some comments & solve the repeated param in sublayers
      
      * fix untest for windows & fix document
      e2d01eb6
    • L
      Enhance logger callback for benchmark (#29106) · 9f53f3d0
      LielinJiang 提交于
      * enhance logger callback for benchmark
      9f53f3d0
  14. 25 11月, 2020 1 次提交
  15. 23 11月, 2020 3 次提交
    • L
      Add EarlyStopping (#28691) · 70385518
      LiuChiachi 提交于
      * add early stopping
      
      * add doc for early stopping
      
      * fix sample code bugs
      
      * update infer of mode, update doc, add unittests to increase coverage rate
      
      * fix sample code for early stopping
      
      * update sample code and unittests
      
      * reduce time cost of test_callbacks unittest
      
      * fix model.py code style error
      70385518
    • L
      Update path name of saving in hapi (#28462) · 8c8b42f2
      LiuChiachi 提交于
      * update hapi save_inference_model output pathname
      
      * update hapi save_inference_model output pathname
      
      * use new 2.0-api paddle.static.io.load_inference_model
      
      * add unittests to increase coverage rate
      8c8b42f2
    • L
      Add lr scheduler callback for high level api (#28737) · 00e55ded
      LielinJiang 提交于
      * add lr scheduler
      00e55ded
  16. 13 11月, 2020 1 次提交
  17. 30 10月, 2020 1 次提交
  18. 29 10月, 2020 1 次提交
  19. 22 10月, 2020 1 次提交
    • L
      Update hapi predict interface (#28180) · 68449d19
      LielinJiang 提交于
      * update hapi predict interface
      
      * fix code style
      
      * fix docs
      
      * fix docs
      
      * fix docs
      
      * update docs
      
      * fix codes style
      
      * fix unittest
      
      * fix unittest
      
      * fix coverage
      68449d19
  20. 19 10月, 2020 1 次提交
  21. 15 10月, 2020 1 次提交
  22. 12 10月, 2020 1 次提交
  23. 10 10月, 2020 1 次提交
  24. 30 9月, 2020 1 次提交
    • L
      Remove Input requirement in dygraph for Model (#27557) · d15cbe70
      LiuChiachi 提交于
      * remove input requirment in dygraph Model
      
      * correct unittest
      
      * upadte save inference model in dygraph without input
      
      * fix unittets for test_model.py
      
      * solve conflicts
      
      * solve conflicts
      
      * delete http.log
      
      * fix test_model.py bug, correct initialization of MyModel
      
      * fix unittests bugs
      
      * set paddle manual seed for unittest
      
      * fix Model bugs, because inputs can be list or dict when it is provided.
      
      * add random seed for test_export_deploy_model
      
      * delete redundant codes, because  calls
      
      * Code optimization, error information optimization
      d15cbe70
  25. 25 9月, 2020 1 次提交
  26. 24 9月, 2020 1 次提交
  27. 17 9月, 2020 2 次提交
  28. 04 9月, 2020 1 次提交
  29. 03 9月, 2020 1 次提交
  30. 02 9月, 2020 1 次提交
    • L
      Correct 2.0 API usage in hapi.model.load (#26829) · 9ee4e3dc
      LiuChiachi 提交于
      * replace fluid.optimizer.set_dict with optimizer.set_state_dict
      
      * replace fluid.optimizer.set_dict with optimizer.set_state_dict
      
      * add coverage rate
      
      * Increase coverage rate, fix code style
      
      * Increase coverage rate, fix code style
      
      * add fit to generate optimizer.state_dict() to save .pdopt to increase coverage rate
      
      * delete http.log
      9ee4e3dc
  31. 31 8月, 2020 1 次提交
    • Q
      Move hapi to python/paddle root dir. (#26442) · f7fb4c22
      qingqing01 提交于
      * Move hapi form paddle/incubate to paddle
      
      * Remove vision/datasets/utils.py and clean code
      
      * Add sample code for conll05
      
      * Print pull path when saving model
      
      * Fix sample code after paramter_list of SGD is changed to parameters
      
      * Fix bug in wmt16 datase
      f7fb4c22
  32. 28 8月, 2020 1 次提交
    • L
      fix sample code for hapi.model.save (#26667) · 25dad426
      LiuChiachi 提交于
      * fix sample code for hapi.model.save, test=document_fix
      
      * test=document_fix
      
      * update usage of 2.0 API, test=document_fix
      
      * fix bugs, return dygraph back to users while using model.save in dygraph
      
      * fix code style
      25dad426