1. 08 1月, 2021 1 次提交
  2. 05 1月, 2021 1 次提交
    • C
      [cherry pick]Set FLAGS_selected_gpus for spawn (#29962) (#30097) · cda7397f
      Chen Weihang 提交于
      Set FLAGS_selected_gpus for spawn.
      
      When the child process starts, it will inherit the configuration of the main process and set the FLAGS once, but the environment variable has not been set at this time, which leads to the FLAGS_selected_gpus is keep same with mainprocess(usually empty), so manually update the flags here.
      
      注:增加了一个单测,又移除了,单测打印显示CI机器nvidia-smi只有两张卡,需要大于两张卡才能测这个问题
      cda7397f
  3. 26 11月, 2020 1 次提交
  4. 24 11月, 2020 1 次提交
  5. 14 10月, 2020 1 次提交
  6. 29 9月, 2020 1 次提交
  7. 28 9月, 2020 2 次提交
  8. 31 8月, 2020 1 次提交
  9. 28 8月, 2020 1 次提交
    • C
      Add interface to launch parallel dygraph by multiprocessing (#26044) · 31f422ae
      Chen Weihang 提交于
      * add dygraph parallel run interface
      
      * polish implement & unified env property name
      
      * add print config arg
      
      * refactor init_parallel_env function
      
      * Compatible with multiprocessing and launch modes
      
      * set default trainer start port
      
      * support run in python 2
      
      * polish python2 support code
      
      * remove python2 support
      
      * refine launch import
      
      * polish dome design details
      
      * refactor api implemention & path
      
      * use new method _set_expected_place
      
      * add spawn unittest framework & mnist test
      
      * add more unittests & doc
      
      * fix unittest failed
      
      * polish english doc
      
      * self review and polish details
      
      * refactor code by reviewer's comments
      
      * fix unittest failed
      
      * fix parallel_env unittest
      
      * fix several typos
      
      * fix error introduced when fixing typos
      
      * add unpublic note for start_processes
      
      * polish details by xiaoguang's comment
      
      * verify correctly when spawn nprocs=-1
      
      * refactor spawn & init_parallel_env design
      
      * polish doc details
      
      * open spawn unittests
      
      * try to fix doc compile error
      
      * try to fix unknown doc format error
      
      * add skip unittest when not gpu
      31f422ae