1. 01 11月, 2022 3 次提交
    • N
      [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True` (#47464) · 5a2ab683
      Nyakku Shigure 提交于
      * [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True`
      
      * revert changes in fluid
      
      * revert unrelated file
      
      * revert changes in norm
      
      * revert changes in auto_parallel_amp
      
      * fix norm and auto_parallel_amp
      
      * revert a typo fix due to fixed at #47477
      5a2ab683
    • N
      [CodeStyle][py2] remove `six` package (part2) (#47334) · 3592ba8c
      Nyakku Shigure 提交于
      * [CodeStyle][py2] remove `six` package (part2)
      
      * six.ensure_str
      
      * remove unused `import six`
      
      * remove six from BUILTIN_LIKELY_MODULES
      
      * remove six in example code
      
      * remove some decode
      
      * try to fix example code
      
      * fix MockEtcdClient get/get_prefix returns data type
      
      * fix MockEtcdClient get_prefix returns data
      
      * fix MockEtcdClient get returns data
      
      * remove `six` in pypi and conda requirements
      
      * fix MockEtcdClient add_watch_callback/add_watch_prefix_callback returns data type
      
      * refine MockEtcdClient
      3592ba8c
    • S
      add missing scale parameter (#47519) · ad251cb5
      sneaxiy 提交于
      ad251cb5
  2. 31 10月, 2022 1 次提交
  3. 28 10月, 2022 3 次提交
  4. 26 10月, 2022 1 次提交
  5. 24 10月, 2022 3 次提交
  6. 23 10月, 2022 1 次提交
  7. 21 10月, 2022 3 次提交
  8. 20 10月, 2022 4 次提交
  9. 19 10月, 2022 3 次提交
  10. 18 10月, 2022 6 次提交
  11. 17 10月, 2022 3 次提交
    • G
      Add enable_partial_send_recv switch in pipeline_configs (#46992) · b9a2f29c
      Ghost Screaming 提交于
      * Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
      is wrong.
      
      * Support allow_partial switch, which can be configure in
      pipeline_configs. If sent tensor are not the same from
      different hosts, they shouldn't been sent partially and
      then concated as a whole tensor.
      
      * Change name allow_partial to enable_partial_send_recv.
      
      * Add global variable _enable_partial_send_recv
      b9a2f29c
    • G
      Support BF16 training for sharding (#46846) · 0b39b244
      Ghost Screaming 提交于
      * Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
      is wrong.
      
      * support pure bfloat16
      
      * support bf16 linear
      
      * update PR to pass CI
      
      * tiny fix where_grad_kernel.cu
      
      * Support bfloat16 type for reducer and sharding.
      
      * Fix some bug.
      
      * Polish code.
      
      * Polise code.
      
      * Add bfloat16 datatype in fill_grad kernels.
      Co-authored-by: Nsneaxiy <sneaxiy@126.com>
      0b39b244
    • Y
      [Auto Parallel] Fix the bug of completion (#47056) · f0af2708
      Yulong Ao 提交于
      * [Auto Parallel] Fix the bug for None labels
      
      * [Auto Parallel] Fix the completion bug
      f0af2708
  12. 14 10月, 2022 3 次提交
  13. 13 10月, 2022 3 次提交
    • W
      combine dp and stage2 hybrid parallel. (#46795) · a95b6f33
      wuhuachaocoding 提交于
      * combine dp and stage2 hybrid parallel.
      
      * update condition.
      a95b6f33
    • X
      [WIP]飞桨PaddlePaddle 分布式强化学习功能研发 (#45998) · f0afcabc
      Xinger 提交于
      * add rpc module in cpp side
      
      * add rpc module in python side
      
      * support win32 and mac for rpc
      
      * 代码优化
      
      * 优化代码
      
      * update rpc
      
      * update rpc launch
      
      * rpc remove rank and world_size api
      
      * fix logger import bug
      
      * remove support for win and mac
      
      * remove support for xpu, npu, cinn and rocm
      
      * remove support for xpu, npu, cinn and rocm
      
      * fix shutdown barrier timeout bug
      
      * update:python_rpc_handler to shared ptr
      
      * fix master shutodwn first bug
      
      * tests support for cpu
      
      * update log to vlog
      
      * update get service info api
      
      * add single process test case
      
      * remove process group
      
      * remove some useless dependencies
      
      * update rpc api comments
      
      * update rpc comments: Example to Examples
      
      * update rpc api comments
      
      * update rpc api comments
      
      * update launch api comments
      
      * update init_rpc comments
      
      * update rpc sync and async comments
      
      * fix bug: init_rpc cant be called repeatly in a process
      
      * update rpc api comment: make master endpoint unique
      
      * update rpc api:service to worker, timeout_ms to timeout
      
      * rename ServiceInfo to WorkerInfo
      
      * refactor: rename server to worker, log to vlog
      
      * add launch test
      
      * remove unused codes
      
      * refine
      f0afcabc
    • N
  14. 12 10月, 2022 3 次提交
    • J
      bugfix (#46921) · acdaa4fb
      JZ-LIANG 提交于
      acdaa4fb
    • Y
      [Auto Parallel] Improve the fine-grained APIs (#46552) · 686fa07a
      Yulong Ao 提交于
      * [Auto Parallel] Suppport different dataloaders
      
      * [Auto Parallel] Add num_shards config for dataset
      
      * [Auto Parallel] Unify the logger and outputs of Engine API
      
      * [Auto Parallel] Fix the bugs of to_static
      
      * [Auto Parallel] Adjust the test_to_static.py
      
      * [Auto Parallel] Add the prepare API and replace __call__ with run
      
      * [Auto Parallel] Improve the private implementations of Engine
      
      * [Auto Parallel] Set capacity of dataloader for opt tuning
      
      * [Auto Parallel] [WIP] Change the fine-grained API
      
      * [Auto Parallel] Improve APIs to support different user cases
      
      * [Auto Parallel] Add removed config
      
      * [Auto Parallel] Add imports
      
      * [Auto Parallel] Fix bugs for to_static
      
      * [Auto Parallel] Remove unnecessary imports
      686fa07a
    • zhouweiwei2014's avatar
      [Zero-Dim] support input 0D Tensor for some unary api (#45992) · 05c2b9ba
      zhouweiwei2014 提交于
      * [Zero-Dim] support input 0D Tensor for unary api
      
      * fix CI
      05c2b9ba