1. 07 4月, 2021 1 次提交
    • Z
      【NPU】Merge ascend GE&distributed code by 0208 from ascendrc (#31957) · 8c7c53b3
      zhang wenhui 提交于
      * Ascend rc (#30483)
      
      * Fix compilcation on CANN20.1 and older (#30494)
      
      Fix compilcation on CANN20.1 and older
      
      * Add distribution supported (#30578)
      
      Add distribution supported
      
      * Build praser for Hcom* operators (#30627)
      
      Build praser for Hcom* operators
      
      * Pass device_ids info from launch to trainer. (#30632)
      
      Pass device_ids info from launch to trainer
      
      * Add Hccl program group (#30642)
      
      Add Hccl program group
      
      * Add startup bash files of test_ascend_group. (#30645)
      
      Add startup bash files of test_ascend_group
      
      * cleanup (#30646)
      
      cleanup test_ascend_group.py
      
      * [Feature] Build parser to support distributed training (#30658)
      
      [Feature] Build parser to support distributed training
      
      * fix compilation on ascend-20.1 (#30722)
      
      fix compilation on ascend-20.1
      
      * Dev/fix ascend string (#30749)
      
      Dev/fix ascend string
      
      * code style (#30781)
      
      code style
      
      * Merge ascend_optimizer and ascend_parser. (#30776)
      
      Merge ascend_optimizer and ascend_parser.
      
      * Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug  (#30797)
      
      Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug
      
      * Add paddle ascend distribution training supported (#30796)
      
      Add paddle ascend distribution training supported
      
      * pass cxx_flags to gloo cmake (#30857)
      
      * Destroy session first. (#30954)
      
      Destroy session first.
      
      * merge
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix style, test=develop
      
      * fix, test=develop
      
      * fix
      
      * fix log fatal, test=develop
      
      * fix enforce style, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix rccl, test=develop
      
      * fix test, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix node_num, test=develop
      
      * fix ids str, test=develop
      
      * fix ids str, test=develop
      
      * fix ids str, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix, test=develop
      
      * fix style code, test=develop
      
      * fix style code, test=develop
      
      * fix style code, test=develop
      
      * fix style code, test=develop
      Co-authored-by: Nhutuxian <hutuxian2011@sina.cn>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: NVoid Main <voidmain1313113@gmail.com>
      Co-authored-by: NLeo Chen <chenqiuliang@baidu.com>
      Co-authored-by: Ndingsiyu <18369187719@163.com>
      Co-authored-by: NOleNet <olenet@126.com>
      8c7c53b3
  2. 05 2月, 2021 1 次提交
  3. 01 2月, 2021 1 次提交
  4. 17 12月, 2020 1 次提交
  5. 26 11月, 2020 1 次提交
  6. 20 9月, 2020 1 次提交
    • T
      【paddle.fleet】Fix/role maker api fix (#27326) · d6b54de4
      tangwei12 提交于
      * fix fleet util and gloo
      
      * fix worker endpoints
      
      * fix
      
      * fix UT
      
      * fix gloo
      
      * fix gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * update gloo
      
      * fix gloo wrapper for hdfs
      
      * add file gloo and UT
      
      * fix UT
      
      * fix UT
      
      * fix UT
      
      * hide public method of RoleMaker
      
      * fix UT
      
      * GPU fleetrun support gloo
      
      * parameterserver fleetrun support gloo
      
      * add UT
      
      * add UT
      
      * fix UT
      
      * fix get server endpoint
      
      * fix get server endpoint
      
      * fix UT
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      
      * Update test_fleet_rolemaker_new.py
      
      * hide public method of rolemaker
      
      * hide public method of rolemaker
      d6b54de4
  7. 10 9月, 2020 1 次提交
  8. 07 9月, 2020 1 次提交
  9. 21 8月, 2020 1 次提交
  10. 18 8月, 2020 1 次提交
  11. 13 8月, 2020 1 次提交
  12. 11 8月, 2020 1 次提交
    • P
      Paddle-2.0 API directory migration (#25898) · 2efcb481
      pangyoki 提交于
      * Directory migration, test=develop
      
      * Change imperative from paddle init to paddle framework, test=develop
      
      * Fixed jit bug, test=develop
      
      * default static mode, test=develop
      
      * fixed format and create parameter belongs to framework, test=develop
      
      * Fixed import package, test=develop
      
      * fix __init__ format, test=develop
      
      * fixed alias problem
      
      * fixed paddle.enable_imperative problems, test=develop
      
      * Add unittest
      
      * delete install_check comment
      
      * Fixed unittest timeout
      
      * fixed unittest error
      
      * move Program default_xx_program to static package
      
      * optimize unittest method
      
      * fixed framework __init__ format
      
      * fixed jit path
      
      * delete alias
      
      * move jit to paddle
      
      * Fixed unittest format
      
      * fixed paddle.default_main_program
      
      * Fixed save load API in paddle __init__.py
      
      * fixed ci paddle.imperative.to_variable
      2efcb481
  13. 03 8月, 2020 1 次提交
  14. 30 7月, 2020 1 次提交
  15. 29 7月, 2020 1 次提交
  16. 28 7月, 2020 1 次提交
    • D
      add more settings for distributed strategy (#25685) · 920d998f
      Dong Daxiang 提交于
      * add more settings for distributed strategy
      Basically, DistributedStrategy has several parts of configurations:
      - BuildStrategy: the same as paddle.fluid.BuildStrategy, but the distributed arguments are moved out of BuildStrategy
      - ExecutionStrategy: the same as paddle.fluid.ExecutionStrategy
      - collective communication configs: nccl_comm_num, hierarchical allreduce and so on
      - distributed algorithms: async_update(mainly used in PS), lars, lamb and so on
      920d998f
  17. 23 7月, 2020 1 次提交
  18. 20 7月, 2020 1 次提交