1. 25 9月, 2018 1 次提交
  2. 20 9月, 2018 1 次提交
    • C
      Feature/op_fuse_pass (#12440) · d402234b
      chengduo 提交于
      * Add Preface
      
      * Add demo code
      
      * Save file
      
      * Refine code
      
      * seems can work
      
      * use elementwise strategy
      
      * Use ElementwiseComputeEx
      
      * Add comments
      
      * extract functions from operator
      
      * Refine code
      
      * Follow comment
      
      * code refine
      
      * add op_fuse  pass
      
      * add backward
      
      * code refine
      
      * use TopologySortOperations
      
      * follow comments
      
      * refine IsFusible
      
      * code enhance
      
      * fix op_fusion_pass
      
      * refine code
      
      * refine fuse_elemwise_act_op
      
      * adjust the input and output
      
      * refine logic
      
      * add intermediate_edge
      
      * disable inplace
      
      * follow comments
      
      * refine logic
      
      * follow comments
      
      * Remove the removable IntermediateOut
      
      * change strategy
      
      * code refine
      
      * enable fuse backward
      
      * code refine
      
      * code refine
      
      * rename unit test
      
      * follow comments
      d402234b
  3. 17 9月, 2018 2 次提交
  4. 15 9月, 2018 1 次提交
  5. 10 9月, 2018 2 次提交
  6. 14 8月, 2018 1 次提交
  7. 09 8月, 2018 1 次提交
  8. 27 7月, 2018 1 次提交
  9. 26 7月, 2018 5 次提交
  10. 22 7月, 2018 1 次提交
  11. 18 7月, 2018 5 次提交
  12. 15 7月, 2018 1 次提交
  13. 13 7月, 2018 1 次提交
    • C
      Refine multi thread cpu parallel exe (#11406) · 86b0a725
      chengduo 提交于
      * refine multi-thread CPU Parallel exe
      
      * refine multi thread CPU Parallel exe
      
      * Refine CPU version for ParallelExecutor
      
      * add share_parameter_between_cards_
      
      * Fix ParallelExecutor bug
      
      * Fix unit test
      
      * Fix parameter opt balance
      
      * Fix with opti (param->grad)
      
      * Add grad to op var
      
      * Remove shard_param_between_cards
      86b0a725
  14. 12 7月, 2018 2 次提交
  15. 29 6月, 2018 1 次提交
  16. 28 6月, 2018 1 次提交
  17. 26 6月, 2018 4 次提交
  18. 21 6月, 2018 1 次提交
  19. 20 6月, 2018 1 次提交
  20. 14 6月, 2018 1 次提交
    • Q
      Fix NCCLBcast hang up bug in Parallel Executor (#11377) · 046bb5c8
      Qiyang Min 提交于
      * 1. Create buddy allocator in each places before NcclBcast the variables
      2. Check the memory usage of ALL gpus rather than the first one
      
      * 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing
      2. NOTE the usage of NCCLGroupGuard
      
      * Remove the memory usage check of gpus
      
      * Fix code style
      046bb5c8
  21. 12 6月, 2018 1 次提交
  22. 11 6月, 2018 1 次提交
  23. 10 6月, 2018 2 次提交
  24. 08 6月, 2018 1 次提交
  25. 07 6月, 2018 1 次提交