1. 19 9月, 2022 1 次提交
    • X
      [cherry-pick] add abs,mean,sum,ge,gt,pow,etc higher-order differentiation operators (#46184) · ad8beaaf
      Xiaoxu Chen 提交于
      * [cherry-pick] extend reduce_sum,reduce_sum,eq,ne,ge,abs,pow,etc higher order operators
      
      * add reduce_mean,reduce_sum primitive ops
      * add ne_p gt_p primitive operators
      * add ge_p abs_p primitive oparators
      * add cast primitive operators
      * add pow,square prim2oirg rules
      * add elementwise_div orig2prim rule
      
      * [cherry-pick] add mean,sum,ge,gt,ne,abs,etc higher-order differentiation operators(#45888)
      
      * add reduce_mean,reduce_sum primitive ops
      
      * add ne_p gt_p primitive operators
      
      * add ge_p abs_p primitive oparators
      ad8beaaf
  2. 15 9月, 2022 1 次提交
  3. 05 9月, 2022 1 次提交
  4. 31 8月, 2022 1 次提交
  5. 25 8月, 2022 1 次提交
  6. 23 8月, 2022 1 次提交
  7. 16 8月, 2022 2 次提交
  8. 12 8月, 2022 1 次提交
  9. 09 8月, 2022 1 次提交
  10. 03 8月, 2022 1 次提交
  11. 29 7月, 2022 1 次提交
  12. 28 7月, 2022 1 次提交
  13. 25 7月, 2022 1 次提交
    • C
      [Auto Parallel] Add dist op cost (#44146) · d0f4465d
      caozhou 提交于
      * update comp cost
      
      * add dist default op cost
      
      * add dist fill constant batch size like op cost
      
      * add elewise op cost
      
      * add fill_constant_batch_size_like op cost unittest
      
      * add unittest and remove fill_constant_batch_size_like grad op cost
      
      * add to cmakelist
      
      * fix unittest bug
      d0f4465d
  14. 13 7月, 2022 1 次提交
  15. 07 7月, 2022 1 次提交
  16. 29 6月, 2022 1 次提交
  17. 05 6月, 2022 1 次提交
    • S
      【code format check upgrade】 step2:yapf (#42944) · a072fca8
      Sing_chan 提交于
      * use yapf to format all python file
      
      * yapf exclude two unittests file for they rely on writing and reading file, and format will break them
      
      * disable diff_py_file because too many diff files cause command following failed
      a072fca8
  18. 01 6月, 2022 2 次提交
    • J
      [AutoParallel & Science] Miscellaneous improvements (#43139) · f59bcb1c
      JZ-LIANG 提交于
      * adapt for 10 loss
      
      * partitioner support optimizer
      f59bcb1c
    • Y
      [Auto Parallel] Add miscellaneous improvements (#43108) · 010aba33
      Yulong Ao 提交于
      * [Auto Parallel] Add the parallel tuner
      
      * [Auto Parallel] Improve the parallel tuner and fix some bugs
      
      * upodate cost model
      
      * update import Resharder by dist op
      
      * update cost model
      
      * fix comp cost bug
      
      * update cost model
      
      * [Auto Parallel] Amend the dist attr for #processses=1
      
      * update cost model and tuner
      
      * update cost model and tuner
      
      * update cost model and tuner
      
      * update cluster
      
      * update reshard
      
      * [Auto Parallel] Add the estimation from the cost model
      
      * [Auto Parallel] Reimplement the backup and restore functions
      
      * [Auto Parallel] Fix the bugs of the parallel tuner
      
      * [Auto Parallel] Update the engine api and dist context
      
      * [Auto Parallel] Work around the high order grad problem
      
      * [Auto Parallel] Add some miscellaneous improvements
      
      * [Auto Parallel] Add a unittest for DistributedContext
      Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
      010aba33
  19. 19 5月, 2022 2 次提交
  20. 10 5月, 2022 1 次提交
  21. 07 5月, 2022 1 次提交
  22. 06 5月, 2022 1 次提交
    • Z
      [AutoParallel] adapt for 2d laplace (#41601) · c043a21b
      zhaoyingli 提交于
      * add default_ctx in backward.py
      
      * record grad_var_to_var with grad_times
      
      * fix backward
      
      * update annotation
      
      * add complete_high_order_grad in complete_forward
      
      * add dist slice op
      
      * update grad_var_to_var type
      
      * update partition_block init mapping before loss op
      
      * update compatible for 'XShape' & update 'allreduce_vars'
      
      * add dist reshape op when input dim equal to output dim
      
      * update 'set_grad_var_shape' with grad_var_to_var
      
      * fix dist slice
      
      * fix set_grad_var_shape
      
      * add dist pnorm op
      
      * fix dist pnorm dist_attr
      
      * fix engine startprogram & adapt highorder grad
      
      * fix set_grad_var_shape when mp
      
      * update unittest
      
      * update cmakelist
      
      * default strategy in engine: dp
      
      * bug fix
      
      * tiny fix
      
      * flatten outputs
      
      * fix default strategy
      
      * init default ctx
      
      * tiny fix
      
      * test=allcase
      c043a21b
  23. 19 4月, 2022 2 次提交
  24. 18 4月, 2022 2 次提交
  25. 25 3月, 2022 2 次提交
  26. 23 3月, 2022 1 次提交
  27. 16 3月, 2022 1 次提交
    • Y
      [Auto Parallel] Add the support for the auto completion of while_op (#39939) · ec6b8fbd
      Yulong Ao 提交于
      * [Auto Parallel] Support the auto completion of while_op
      
      * [Auto Parallel] Improve the completion algorithms
      
      * [Auto Parallel] Fix bugs for ernie inference
      
      * [Auto Parallel] Remove attrs which cannot be pickled
      
      * [Auto Parallel] make the dims_mappings of LodTensorArray vars empty
      
      * [Auto Parallel] Fix bugs for the ernie inference in the pipeline parallel
      
      * [Auto Parallel] Remove unncessary comments
      
      * [Auto Parallel] Fix a bug of the CMakeLists
      
      * [Auto Parallel] Use the newest APIs to write the unit test
      
      * [Auto Parallel] Remove unnecessary statements
      ec6b8fbd
  28. 02 3月, 2022 1 次提交
  29. 22 2月, 2022 1 次提交
    • J
      Auto Parallel support conditional block (#39612) · a08ee62a
      JZ-LIANG 提交于
      * add subblock logic for context and partitioner
      
      * partitioner support sub blocks
      
      * revise typos
      
      * fixed param init bug for while
      
      * chmod 644
      
      * add unitest
      
      * mv forward parser
      
      * update unitest
      
      * update dist op ctx
      
      * update dist op ctx
      
      * fixed bug in dist op ctx
      
      * fixed bug for recompute subblock
      a08ee62a
  30. 18 2月, 2022 1 次提交
  31. 29 1月, 2022 1 次提交
    • J
      Auto parallel/qkv fuse (#39080) · fdedf909
      JZ-LIANG 提交于
      * support qkv fuse
      
      * support qkv fuse
      
      * update completion
      
      * update completion
      
      * update dist_split
      
      * rerun ci
      
      * is_auto_compatible added
      
      * is_auto_compatible added
      fdedf909
  32. 27 1月, 2022 1 次提交
  33. 20 1月, 2022 1 次提交
  34. 18 1月, 2022 1 次提交