1. 14 10月, 2022 1 次提交
  2. 12 10月, 2022 2 次提交
    • Y
      [Auto Parallel] Improve the fine-grained APIs (#46552) · 686fa07a
      Yulong Ao 提交于
      * [Auto Parallel] Suppport different dataloaders
      
      * [Auto Parallel] Add num_shards config for dataset
      
      * [Auto Parallel] Unify the logger and outputs of Engine API
      
      * [Auto Parallel] Fix the bugs of to_static
      
      * [Auto Parallel] Adjust the test_to_static.py
      
      * [Auto Parallel] Add the prepare API and replace __call__ with run
      
      * [Auto Parallel] Improve the private implementations of Engine
      
      * [Auto Parallel] Set capacity of dataloader for opt tuning
      
      * [Auto Parallel] [WIP] Change the fine-grained API
      
      * [Auto Parallel] Improve APIs to support different user cases
      
      * [Auto Parallel] Add removed config
      
      * [Auto Parallel] Add imports
      
      * [Auto Parallel] Fix bugs for to_static
      
      * [Auto Parallel] Remove unnecessary imports
      686fa07a
    • N
  3. 10 10月, 2022 1 次提交
  4. 08 10月, 2022 1 次提交
  5. 28 9月, 2022 2 次提交
  6. 27 9月, 2022 2 次提交
  7. 26 9月, 2022 1 次提交
  8. 19 9月, 2022 1 次提交
  9. 17 9月, 2022 1 次提交
  10. 15 9月, 2022 1 次提交
    • Y
      [Auto Parallel] Improve the APIs (#45776) · b042a3b1
      Yulong Ao 提交于
      * [Auto Parallel] Use c++ dist attr in the completion process
      
      * [Auto Parallel] Add minor changes
      
      * [Auto Parallel] Use c++ dist attr in the completion process
      
      * [Auto Parallel] Add minor changes
      
      * [Auto Parallel] Add the serialization process for dist attrs
      
      * [Auto Parallel] Remove unnecessary comments
      
      * [Auto Parallel] Fix some bugs
      
      * [Auto Parallel] Fix the code style
      
      * [Auto Parallel] Remove unnecessary impls
      
      * [Auto Parallel] Fix the importing error
      
      * [Auto Parallel] Fix the copy from bugs of op dist attr
      
      * [Auto Parallel] Replace the use of constexpr if
      
      * [Auto Parallel] Redesign the shard_tensor, shard_op and ProcessMesh
      
      * [Auto Parallel] Change API of the completion unittest
      
      * [Auto Parallel] Fix the bug when set_attr an int
      
      * [Auto Parallel] Add the unittest for the serialization
      
      * [Auto Parallel] Add some unit tests
      
      * [Auto Paralle] Unify the strategy
      
      * [Auto Parallel] Improve the engine api
      
      * [Auto Parallel] Reset the changes made to the framework
      
      * [Auto Parallel] Change the engine unittest
      
      * [Auto Parallel] Update API of the completion and partitioner
      
      * [Auto Parallel] Update unit tests using engine api
      
      * update shard annotation
      
      * [Auto Parallel] Remove the modifications of other modules
      
      * [Auto Parallel] Add docs for APIs
      
      * add new strategy
      
      * [Auto Parallel] Replace the logger
      
      * [Auto Parallel] Restore the test_program.py
      
      * [Auto Parallel] Change the import rules
      
      * [Auto Parallel] Add the examples for Engine
      
      * [Auto Parallel] Do some minor changes
      
      * [Auto Parallel] Remove yaml dependency
      
      * [Auto Parallel] Fix the unittests
      
      * add valid after train
      
      * bug fix
      Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com>
      Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
      Co-authored-by: Ncaozhou <48191911+Caozhou1995@users.noreply.github.com>
      b042a3b1
  11. 14 9月, 2022 2 次提交
  12. 09 9月, 2022 1 次提交
  13. 07 9月, 2022 1 次提交
  14. 05 9月, 2022 1 次提交
  15. 31 8月, 2022 2 次提交
  16. 25 8月, 2022 1 次提交
  17. 23 8月, 2022 1 次提交
  18. 18 8月, 2022 1 次提交
  19. 16 8月, 2022 1 次提交
  20. 15 8月, 2022 2 次提交
    • Z
      [AutoParallel] add collate_fn for dist_loader (#45053) · 3649099f
      zhaoyingli 提交于
      * add collate_fn
      
      * fix number of inputs
      3649099f
    • Y
      [Auto Parallel] Move the distributed info from python to c++ (#44510) · a52357fe
      Yulong Ao 提交于
      * [Auto Parallel] Move the distributed info from python to c++
      
      * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc
      
      * [Auto Parallel] Add the lost file
      
      * [Auto Parallel] Make the dist attr be unique_ptr
      
      * [Auto Parallel] Add the proto conversion
      
      * [Auto Parallel] Improve the proto support
      
      * [Auto Parallel] Fix the bugs for adding a device or a link
      
      * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper
      
      * [Auto Parallel] Improve the impl of these dist attrs
      
      * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh
      
      * [Auto Parallel] Fix the unittest problem
      
      * [Auto Parallel] Explicitly add the src file for auto_parallel target
      
      * [Auto Parallel] Add the proto depedency explicitly
      
      * [Auto Parallel] Fix the cmake bug on windows and mac
      
      * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
      
      * [Auto Parallel] Remove unused codes
      
      * [Auto Parallel] Check whether the dist attr is null
      
      * [Auto Parallel] Implement the assign operator for OpDesc explicitly
      a52357fe
  21. 12 8月, 2022 1 次提交
    • Y
      [Auto Parallel] Pybind ProcessMesh and DeviceMesh (#45013) · 5bf3dec9
      Yulong Ao 提交于
      * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh
      
      * [Auto Parallel] Fix the unittest problem
      
      * [Auto Parallel] Explicitly add the src file for auto_parallel target
      
      * [Auto Parallel] Add the proto depedency explicitly
      
      * [Auto Parallel] Fix the cmake bug on windows and mac
      
      * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
      5bf3dec9
  22. 09 8月, 2022 1 次提交
  23. 03 8月, 2022 1 次提交
  24. 29 7月, 2022 2 次提交
  25. 27 7月, 2022 1 次提交
  26. 25 7月, 2022 1 次提交
    • C
      [Auto Parallel] Add dist op cost (#44146) · d0f4465d
      caozhou 提交于
      * update comp cost
      
      * add dist default op cost
      
      * add dist fill constant batch size like op cost
      
      * add elewise op cost
      
      * add fill_constant_batch_size_like op cost unittest
      
      * add unittest and remove fill_constant_batch_size_like grad op cost
      
      * add to cmakelist
      
      * fix unittest bug
      d0f4465d
  27. 21 7月, 2022 1 次提交
  28. 13 7月, 2022 1 次提交
  29. 12 7月, 2022 1 次提交
  30. 11 7月, 2022 1 次提交
  31. 24 6月, 2022 1 次提交
    • Y
      [Auto Parallel] Use a fast completion for data parallelism (#43585) · e64823c1
      Yulong Ao 提交于
      * [Auto Parallel] Use a fast completion for data parallelism
      
      * remove unuse cuSparse function
      
      * [Auto Parallel] Fix some bugs of the fast dp completion
      
      * [Auto Parallel] Add the cmake statements
      
      * [Auto Parallel] Make the unittest adapt to the new interface
      
      * [Auto Parallel] Modify the timeout of the unittest
      
      * [Auto Parallel] Remove unnecessary comments
      Co-authored-by: Nzhouwei25 <zhouwei25@baidu.com>
      e64823c1
  32. 13 6月, 2022 1 次提交
  33. 09 6月, 2022 1 次提交