1. 06 1月, 2023 1 次提交
    • Y
      [Auto Parallel] Merge dist attrs from python into c++ (#49214) · c7899074
      Yulong Ao 提交于
      * [Auto Parallel] Rename methods of ProcessMesh
      
      * [Auto Parallel] Impl the python process_mesh by the c++ one
      
      * [Auto Parallel] Add some minor modifications
      
      * [Auto Parallel] Rename some methods
      
      * [Auto Parallel] Remove unnecessary codes
      
      * [Auto Parallel] Add back some removed files
      
      * [Auto Parallel] Fix bugs
      
      * [Auto Parallel] Fix a bug
      
      * Update process_mesh.cc
      
      * [Auto Parallel] Merge dist attrs of Python into C++
      
      * [Auto Parallel] Add back deleted importing
      
      * [Auto Parallel] Add back removed unittest
      
      * [Auto Parallel] Remove type qualifiers of return types
      
      * [Auto Parallel] Fix some bugs
      
      * [Auto Parallel] Fix a bug of the quant pass
      
      * [Auto Parallel] Fix the code style
      c7899074
  2. 04 1月, 2023 1 次提交
    • J
      [Auto Parallel-Performance] Sharding Comm Optimization (#48604) · 5592f8ad
      JZ-LIANG 提交于
      * remove deps and prior comm
      
      * grad comm fuse
      
      * add deps for amp&global norm
      
      * stage2 broadcast prior deps
      
      * stage2 grad overlap
      
      * stream_analyzer bugfix
      
      * overlap enable
      
      * dep op namescope
      
      * depend support multiple inputs
      
      * check finite deps
      
      * stage2 param comm overlap
      
      * Set kD2HStream
      
      * grad comm hierarchical
      
      * grad comm hierarchical
      
      * new unitest
      Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
      5592f8ad
  3. 29 12月, 2022 1 次提交
  4. 28 12月, 2022 1 次提交
    • Z
      [AutoParallel] adapt for clip (#49249) · df944772
      zhaoyingli 提交于
      * [AutoParallel] adapt for clip
      
      * fix unittest
      
      * enable_static
      
      * fix dist_fill_constant_batch_size_like
      
      * fix process_mesh.shape
      
      * update cond of modifying shape_list
      df944772
  5. 27 12月, 2022 2 次提交
  6. 26 12月, 2022 1 次提交
    • Y
      [Auto Parallel] Merge the python and c++ impls of ProcessMesh (#47503) · 1c0afa79
      Yulong Ao 提交于
      * [Auto Parallel] Rename methods of ProcessMesh
      
      * [Auto Parallel] Impl the python process_mesh by the c++ one
      
      * [Auto Parallel] Add some minor modifications
      
      * [Auto Parallel] Rename some methods
      
      * [Auto Parallel] Remove unnecessary codes
      
      * [Auto Parallel] Add back some removed files
      
      * [Auto Parallel] Fix bugs
      
      * [Auto Parallel] Fix a bug
      
      * Update process_mesh.cc
      
      * [Auto Parallel] Fix a bug
      1c0afa79
  7. 19 12月, 2022 1 次提交
  8. 14 12月, 2022 2 次提交
    • F
      fluid API magration : array_read, array_write (#49022) · 18f921e9
      feifei-111 提交于
      * del array_write & array_read
      
      * fix import err
      
      * fix import err
      
      * fix example codes
      18f921e9
    • Z
      [AutoParallel] recompute tuning (#48608) · 170a31f9
      zhaoyingli 提交于
      * [AutoParallel] recompute tuning
      
      * fix conflict
      
      * update comment
      
      * bug fix
      
      * update rc algo
      
      * tiny fix
      
      * fix clear process_group
      
      * remove comment
      
      * update segment print
      
      * fix import OpRole
      
      * adapt amp pass and grad_clip pass for opt_tuner
      
      * update tuning config
      
      * fix import
      
      * annotate recompute info on ops and upgrade recompute pass
      
      * add op_namescope for seed op
      
      * record reserved vars
      
      * fix recompute var's dist_attr
      
      * fix strategy unittest
      
      * adapt for fp16
      
      * update unittest
      
      * revert copy opt
      
      * update unittest
      
      * rename set_recompute_segments
      
      * fix unittest
      170a31f9
  9. 12 12月, 2022 1 次提交
  10. 08 12月, 2022 2 次提交
  11. 06 12月, 2022 1 次提交
    • K
      Remove fluid matmul (#47988) · 8fb829ba
      kangguangli 提交于
      * remove layers.matmul in nets.py
      
      * remove layers.matmul in rnn_impl/test_quantization_pass/auto_parallel_gpt_model/test_auto_parallel_completion_gpt
      
      * remove layers.matmul in other files
      
      * fix
      
      * fix
      
      * remove layers.matmul itself
      
      * remove ref in CMakeLists.txt and tools directory
      
      * remove matmul in fluid.layers.nn.py
      
      * remove matmul in fluid.dygraph.rnn.py && resotre test_matmul_op.py
      
      * replace matmul in fluid.dygraph.rnn.py && clean api_test in test_matmul_op.py
      
      * fix error && restore empty test_auto_search_dist_matmul_op.py
      
      * fix check in test_auto_parallel_partitioner.py
      
      * fix test_dist_matmul && test_flags_mkldnn_ops_on_off
      
      * fix test_fused_attention_op_xpu.py && test_matmul_op_xpu.py
      
      * remove test_auto_search_dist_matmul_op.py
      
      * remove layers.matmul in auto_parallel_gpt_model.py && fix doc in fluid/io.py
      
      * fix for matmul_grad
      
      * fix codestyle
      
      * fix codestyle
      
      * resolve conflicts error
      
      * restore unit test file but not compiled it for later remove
      
      * fix codestyle
      
      * fix wrong unittest skip
      
      * fix unittest delete
      
      * fix scale cost
      
      * fix scale cost
      
      * resolve conflicts error
      
      * resolve conflicts error
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      8fb829ba
  12. 02 12月, 2022 2 次提交
  13. 01 12月, 2022 1 次提交
  14. 29 11月, 2022 1 次提交
  15. 28 11月, 2022 1 次提交
  16. 25 11月, 2022 1 次提交
  17. 24 11月, 2022 1 次提交
  18. 23 11月, 2022 1 次提交
    • X
      【fluid api clear】remove transpose (#47917) · b994c89d
      xiaoguoguo626807 提交于
      * remove transpose
      
      * codestyle check
      
      * modify CI_STATIC
      
      * modify CI_STATIC
      
      * modify enable static()
      
      * remove unused import
      
      * fix conflict about stack
      
      * fix conflict about stack
      
      * fix conflict about stack
      
      * endless conflict
      b994c89d
  19. 22 11月, 2022 1 次提交
  20. 18 11月, 2022 1 次提交
  21. 10 11月, 2022 1 次提交
  22. 08 11月, 2022 1 次提交
  23. 07 11月, 2022 1 次提交
  24. 03 11月, 2022 1 次提交
  25. 01 11月, 2022 2 次提交
    • N
      [CodeStyle][E711] use `is`/`is not` for comparison with `None` (#47452) · a35a4a53
      Nyakku Shigure 提交于
      * [CodeStyle][E711] use `is`/`is not` for comparison with `None`
      
      * `self.assertTrue($A is None)` -> `self.assertIsNone($A)`
      
      * `self.assertTrue($A is not None)` -> `self.assertIsNotNone($A)`
      
      * `self.assertFalse($A is None)` -> `self.assertIsNotNone($A)`
      
      * `self.assertEqual($A, None)` -> `self.assertIsNone($A)`
      
      * `self.assertNotEqual($A, None)` -> `self.assertIsNotNone($A)`
      a35a4a53
    • N
      [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True` (#47464) · 5a2ab683
      Nyakku Shigure 提交于
      * [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True`
      
      * revert changes in fluid
      
      * revert unrelated file
      
      * revert changes in norm
      
      * revert changes in auto_parallel_amp
      
      * fix norm and auto_parallel_amp
      
      * revert a typo fix due to fixed at #47477
      5a2ab683
  26. 31 10月, 2022 1 次提交
  27. 28 10月, 2022 1 次提交
  28. 23 10月, 2022 1 次提交
  29. 18 10月, 2022 3 次提交
    • C
      [Auto Parallel]Add parallel tuner (#46189) · 3108ba11
      caozhou 提交于
      * add parallel tuner
      
      * add unittest
      
      * fix unittest
      
      * set timeout of unittest
      
      * set unittest timeout
      
      * fix auto_mode setting
      
      * update unittest
      
      * sync from develop and update unittest
      
      * remove unused import
      
      * update unittest
      
      * update cmakelist
      
      * add unittests
      3108ba11
    • C
      [Auto Parallel] Add cost interface (#47043) · da051350
      caozhou 提交于
      * add cost interface
      
      * update inferface and add unittest
      
      * update unittest
      
      * update inferface
      da051350
    • Z
      [AutoParallel] add callbacks (#47014) · 7c92177c
      zhaoyingli 提交于
      * [AutoParallel] add callbacks
      
      * fix unittest
      
      * fix dist_context
      
      * fix engine
      
      * fix cmakelist
      
      * fix unittest's returns
      
      * fix cmakelist
      7c92177c
  30. 14 10月, 2022 1 次提交
  31. 12 10月, 2022 2 次提交
    • Y
      [Auto Parallel] Improve the fine-grained APIs (#46552) · 686fa07a
      Yulong Ao 提交于
      * [Auto Parallel] Suppport different dataloaders
      
      * [Auto Parallel] Add num_shards config for dataset
      
      * [Auto Parallel] Unify the logger and outputs of Engine API
      
      * [Auto Parallel] Fix the bugs of to_static
      
      * [Auto Parallel] Adjust the test_to_static.py
      
      * [Auto Parallel] Add the prepare API and replace __call__ with run
      
      * [Auto Parallel] Improve the private implementations of Engine
      
      * [Auto Parallel] Set capacity of dataloader for opt tuning
      
      * [Auto Parallel] [WIP] Change the fine-grained API
      
      * [Auto Parallel] Improve APIs to support different user cases
      
      * [Auto Parallel] Add removed config
      
      * [Auto Parallel] Add imports
      
      * [Auto Parallel] Fix bugs for to_static
      
      * [Auto Parallel] Remove unnecessary imports
      686fa07a
    • N
  32. 10 10月, 2022 1 次提交