1. 21 4月, 2021 1 次提交
    • Z
      【NPU】Merge NPU ccl code (#32381) · c3158527
      zhang wenhui 提交于
      * add allreduce and broadcast without test (#31024)
      
      add allreduce and broadcast without test
      
      * Refactor HCCLCommContext to be compatible with Paddle (#31359)
      
      Refactor HCCLCommContext to be compatible with Paddle (#31359)
      
      * [NPU] add npu kernel for communication op (#31437)
      
      * add allreduce and broadcast without test
      
      * add c_broadcast_test case
      
      * build c_comm_init and c_create_group operators
      
      * make the whole thing compile
      
      * add broadcast and init op test case but run failed
      
      * make unit test compile
      
      * fix broadcast test bug and change into hcom for ccl
      
      * change c_comm_init and c_create_group ops accordingly
      
      * make tests compile
      
      * transfer code to 27
      
      * compiled successfully in 28, but run failed
      
      * test broadcast in 28, but failed
      
      * make hcom primitives work
      
      * change hccl data type for base.h
      
      * fix broadcast bug
      
      * make attributes work
      
      * fix group name bug
      
      * add allreduce but test failed
      
      * allreduce bug for qiuliang
      
      * allreduce finished
      
      * add allgather and reducescatter
      
      * merge all op code
      
      * add allgather test
      
      * finish run all ccl op test exclude send/recv
      
      * all all op and test exclude send/recv
      
      * send_v2_npu.cc recv_v2_npiu.cc compiled
      
      * fix ccl core dump bug and test allgather, reducescatter, broadcast op
      
      * fix allreduce bug just for test
      
      * hcom send&recv test pass, without hcom_destroy
      
      * for qiuliang test
      
      * Ascend Send&Recv Test Pass
      
      * all op (ex send/recv) ok
      
      * fix bug
      
      * merge all ccl op
      
      * style merge to PaddlePaddle
      
      * merge style
      
      * new merge style
      
      * merge style 2
      
      * insert an empty at the end
      
      * disable ctest for hcom to pass ci
      Co-authored-by: Nvoid-main <voidmain1313113@gmail.com>
      Co-authored-by: Nf2hkop <f2huestc@outlook.com>
      
      * Add auto-increasing tag id for Hcom OPs (#31702)
      
      * add c_reduce_sum op (#31793)
      
      add c_reduce_sum op
      
      * update Ascendrc hccl to 20.3 (#32126)
      
      update Ascendrc hccl to 20.3 (#32126)
      
      * fix merge code
      
      * change cmake.txt1
      
      * [NPU] Support npu kernel for c sync stream op (#31386)
      
      * sync stream npu op
      
      * add with_ascend_acl
      
      * update c++ unittest
      
      * compile all failed
      
      * try to pre commit
      
      * after pre commit
      
      * merge&compile&test hccl successfully!
      
      * fix code style
      
      * fix code style
      
      * fix bugs about hccl
      
      * fix some bugs
      
      * fix code style
      
      * fix style
      
      * fix style
      
      * fix
      
      * fixed
      
      * merge develop
      Co-authored-by: Nlw921014 <liuwei921014@yeah.net>
      Co-authored-by: NVoid Main <voidmain1313113@gmail.com>
      Co-authored-by: Nf2hkop <f2huestc@outlook.com>
      Co-authored-by: Nxiayanming <41795079@qq.com>
      c3158527
  2. 15 4月, 2021 1 次提交
    • J
      Update hapi to support AMP (#31417) · fabdb43c
      Jiaqi Liu 提交于
      * make hapi support amp, and add unittest
      
      * make unittest only support GPU
      
      * update parameters for amp in hapi.Model
      
      * update hapi.Model.prepare interface, and update unittest
      
      * fix test_model.py unittest bug
      
      * add grad clear in dygraph
      
      * use_fp16_guard defaults to True, which could avoid nan
      
      * add input check, and add internal doc link to low level api
      
      * update doc, and decrease the sample num of dataset to avoid timeout
      
      * make hapi amp param  support str 'O1' or 'O2'
      
      * resume calling , modify the code of the check part
      
      * upgrade the usage of Fleet API, and disable 'pure_fp16' param
      fabdb43c
  3. 11 1月, 2021 1 次提交
  4. 07 1月, 2021 1 次提交
  5. 27 11月, 2020 2 次提交
    • S
      Support dynamic graph distributed (#28997) · e2d01eb6
      ShenLiang 提交于
      * add reducer
      
      * refine envent for memorycopy
      
      * add concat&split for allreduce
      
      * apply concat & split for fuse tensor
      
      * fix nccl dep
      
      * fix the untest, compile problem and ddp initialize problem
      
      * fix untest for mac & add some comments & solve the repeated param in sublayers
      
      * fix untest for windows & fix document
      e2d01eb6
    • L
      Enhance logger callback for benchmark (#29106) · 9f53f3d0
      LielinJiang 提交于
      * enhance logger callback for benchmark
      9f53f3d0
  6. 25 11月, 2020 1 次提交
  7. 23 11月, 2020 3 次提交
    • L
      Add EarlyStopping (#28691) · 70385518
      LiuChiachi 提交于
      * add early stopping
      
      * add doc for early stopping
      
      * fix sample code bugs
      
      * update infer of mode, update doc, add unittests to increase coverage rate
      
      * fix sample code for early stopping
      
      * update sample code and unittests
      
      * reduce time cost of test_callbacks unittest
      
      * fix model.py code style error
      70385518
    • L
      Update path name of saving in hapi (#28462) · 8c8b42f2
      LiuChiachi 提交于
      * update hapi save_inference_model output pathname
      
      * update hapi save_inference_model output pathname
      
      * use new 2.0-api paddle.static.io.load_inference_model
      
      * add unittests to increase coverage rate
      8c8b42f2
    • L
      Add lr scheduler callback for high level api (#28737) · 00e55ded
      LielinJiang 提交于
      * add lr scheduler
      00e55ded
  8. 13 11月, 2020 1 次提交
  9. 30 10月, 2020 1 次提交
  10. 29 10月, 2020 1 次提交
  11. 22 10月, 2020 1 次提交
    • L
      Update hapi predict interface (#28180) · 68449d19
      LielinJiang 提交于
      * update hapi predict interface
      
      * fix code style
      
      * fix docs
      
      * fix docs
      
      * fix docs
      
      * update docs
      
      * fix codes style
      
      * fix unittest
      
      * fix unittest
      
      * fix coverage
      68449d19
  12. 19 10月, 2020 1 次提交
  13. 15 10月, 2020 1 次提交
  14. 12 10月, 2020 1 次提交
  15. 10 10月, 2020 1 次提交
  16. 30 9月, 2020 1 次提交
    • L
      Remove Input requirement in dygraph for Model (#27557) · d15cbe70
      LiuChiachi 提交于
      * remove input requirment in dygraph Model
      
      * correct unittest
      
      * upadte save inference model in dygraph without input
      
      * fix unittets for test_model.py
      
      * solve conflicts
      
      * solve conflicts
      
      * delete http.log
      
      * fix test_model.py bug, correct initialization of MyModel
      
      * fix unittests bugs
      
      * set paddle manual seed for unittest
      
      * fix Model bugs, because inputs can be list or dict when it is provided.
      
      * add random seed for test_export_deploy_model
      
      * delete redundant codes, because  calls
      
      * Code optimization, error information optimization
      d15cbe70
  17. 25 9月, 2020 1 次提交
  18. 24 9月, 2020 1 次提交
  19. 17 9月, 2020 2 次提交
  20. 04 9月, 2020 1 次提交
  21. 03 9月, 2020 1 次提交
  22. 02 9月, 2020 1 次提交
    • L
      Correct 2.0 API usage in hapi.model.load (#26829) · 9ee4e3dc
      LiuChiachi 提交于
      * replace fluid.optimizer.set_dict with optimizer.set_state_dict
      
      * replace fluid.optimizer.set_dict with optimizer.set_state_dict
      
      * add coverage rate
      
      * Increase coverage rate, fix code style
      
      * Increase coverage rate, fix code style
      
      * add fit to generate optimizer.state_dict() to save .pdopt to increase coverage rate
      
      * delete http.log
      9ee4e3dc
  23. 31 8月, 2020 1 次提交
    • Q
      Move hapi to python/paddle root dir. (#26442) · f7fb4c22
      qingqing01 提交于
      * Move hapi form paddle/incubate to paddle
      
      * Remove vision/datasets/utils.py and clean code
      
      * Add sample code for conll05
      
      * Print pull path when saving model
      
      * Fix sample code after paramter_list of SGD is changed to parameters
      
      * Fix bug in wmt16 datase
      f7fb4c22
  24. 28 8月, 2020 1 次提交
    • L
      fix sample code for hapi.model.save (#26667) · 25dad426
      LiuChiachi 提交于
      * fix sample code for hapi.model.save, test=document_fix
      
      * test=document_fix
      
      * update usage of 2.0 API, test=document_fix
      
      * fix bugs, return dygraph back to users while using model.save in dygraph
      
      * fix code style
      25dad426
  25. 27 8月, 2020 1 次提交
  26. 25 8月, 2020 1 次提交
    • L
      Update save inference model to support dygraph (#25894) · 761ed17d
      LiuChiachi 提交于
      * update save_inference_model for hapi
      
      * update save_inference_model to support dygraph
      
      * fix comments
      
      * fix comments
      
      * test=develop
      
      * test, test=develop
      
      * fix dim test, test=develop
      
      * test, test=develop
      
      * add test_export_deploy_model_dynamic
      
      * fix unittest for hapi: save_inference_model
      
      * fix code style
      
      * accept review by guoshengCS
      
      * fix coverage rate
      
      * update doc for save_inference_model and copyright
      
      * change test model back to LeNet() in test_export_deploy_model
      
      * copy jit.save, use LeNet() to test export deploy model
      
      * add return value for dygraph, and fix doc error
      
      * corrected the doc writing
      
      * Delete redundant import and correct import order in sample code.
      
      * remove 'fluid' and add prepare() and fit() in sample code
      
      * correct usage of API 2.0 in sample code
      
      * fix sample code bugs
      
      * fix code style bugs
      
      * fix test_model.py bugs
      
      * set for_inference=True
      
      * correct usage for static.InputSpec
      
      * update doc for model.save
      
      * correct usage of API 2.0
      
      * rename param name for model.save
      
      * correct for_inference as training
      761ed17d
  27. 24 8月, 2020 1 次提交
  28. 20 8月, 2020 1 次提交
  29. 30 7月, 2020 1 次提交
  30. 24 7月, 2020 1 次提交
    • Q
      Refine Model of high level API (#25559) · b5f8784c
      qingqing01 提交于
      * Refine Model
      
      1. Take the network (instance of Layer) as the input of Model.
      2. Refine set_dict/load_dict of Layer.
      3. Refine Input interface, so update code sample about Input
      b5f8784c
  31. 13 5月, 2020 1 次提交
    • L
      Add vision api for hapi (#24404) · 1a0d26a4
      LielinJiang 提交于
      * add vision
      
      * fix predict, test=develop
      
      * add unittest for vision apis, test=develop
      
      * fix typos
      
      * add hapi models api, test=develop
      
      * fix code format, test=develop
      
      * fix typos, test=develop
      
      * fix sample code import, test=develop
      
      * fix sample codes, test=develop
      
      * add decompress, test=develop
      
      * rm darknet, test=develop
      
      * rm debug code, test=develop
      1a0d26a4
  32. 11 5月, 2020 1 次提交
    • Q
      Add a high-level API with traning and inference into Paddle. (#24293) · 43625bda
      qingqing01 提交于
      * Merge hapi into Paddle
      
      Hapi is a high level API for training and inference.
      The main modules include Model, Loss, Metrics, Dataset.
      Also includes common modules and models in NLP and computer vision, such as BERT, ResNet.
      
      These modules are developed by:
      0YuanZhang0, guoshengCS heavengate, LielinJiang, qingqing01, xyzhou-puck huangjun12, wangxiao1021, zhangyang.
      43625bda