1. 05 1月, 2021 3 次提交
    • C
      [cherry pick]Set FLAGS_selected_gpus for spawn (#29962) (#30097) · cda7397f
      Chen Weihang 提交于
      Set FLAGS_selected_gpus for spawn.
      
      When the child process starts, it will inherit the configuration of the main process and set the FLAGS once, but the environment variable has not been set at this time, which leads to the FLAGS_selected_gpus is keep same with mainprocess(usually empty), so manually update the flags here.
      
      注:增加了一个单测,又移除了,单测打印显示CI机器nvidia-smi只有两张卡,需要大于两张卡才能测这个问题
      cda7397f
    • T
      fix large scale memory (#30035) (#30085) · e3975223
      tangwei12 提交于
      * memory holder optimize
      
      Change-Id: Ic91af8ac6f2853336d28a9fbbc5e8d0c57b5d05e
      
      * memory holder optimize
      
      Change-Id: I2fd1c14ecc17f5d5ce88b87890381ea801e6367f
      
      * fix large scale memory holder
      
      Change-Id: Ief0992b02b00220e16c72cc637a56e7b5788140f
      
      * fix large scale memory holder
      
      Change-Id: I910142a3952ead643a5604f8f80955f3e6efe655
      e3975223
    • C
  2. 04 1月, 2021 3 次提交
  3. 31 12月, 2020 6 次提交
  4. 30 12月, 2020 3 次提交
  5. 29 12月, 2020 10 次提交
    • L
      [Kunlun] 2.0 cherry-pick:Support for Baidu Kunlun XPU multi card training (#29713) · 847aa172
      liuyuhui 提交于
      * [Kunlun] PR1:Support one Kunlun card training in parallel executor (#29337)
      
      * [Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574)
      
      * [Kunlun] bug fix of PR2: Support MultiDevicePass and BKCL in parallel executor  (#29926)
      
      * add bkcl.so in whl for kunlun (#29947)
      
      * [Kunlun] bug fix of PR2: Support MultiDevicePass and BKCL in parallel executor  (#29961)
      Co-authored-by: NQingshuChen <qingshu.chen714@gmail.com>
      847aa172
    • C
      [Cherry-pick] Complex network execute support (#29905) · 91ebc460
      Chen Weihang 提交于
      * [Complex] Add support for complex grad accumulated (#29889)
      
      * add support for complex grad accumulated
      
      * add unittest for coverage
      
      * update test dtype
      
      * remove useless blank line
      
      * [Complex] Handle complex to real after type promotion (#29855)
      
      * try to add fwd op input dtypes
      
      * refactor base impl
      
      * return tmp_ins after dygraph prepare data
      
      * fix typo found in debug
      
      * polish comment & add complex net test
      
      * revert detail change
      
      * fix unittest failed
      
      * add complex kernel condition control
      
      * fix xpu test failed & polish comment
      
      * polish details by review comments
      
      * Complex op test (#29753)
      
      * delete no need to calculate inputs in dygraph op_test
      
      * delete no need to calculate inputs in dygraph op_test
      
      * change grad elementwise_mul for complex types (#29757)
      
      * add conj op for complex types
      
      * add conj for complex types
      
      * add more test case
      
      * add conj_op test
      
      * modify conj api and impl
      
      * add complex type for fill_constant_op xpu
      
      * add setConstant for complex type
      
      * remove complex conj test file
      
      * user define grad for test_conj_op
      
      * add test case for static mode of conj api
      
      * modify conj doc
      
      * change input args name to x
      
      * remove useless codes
      
      * conj support real types
      
      * add conj test case for real number
      
      * delete no need to calculate inputs in dygraph op_test
      
      * delete no need to calculate inputs in dygraph op_test
      
      * modify grad of mul for complex types
      
      * fix the grads of inputs args order not match bug
      
      * change the grad of div when complex types (#29804)
      
      * change the grad of div when complex types
      
      * fix the grads of inputs args order not match bug
      Co-authored-by: Nchentianyu03 <chentianyu03@baidu.com>
      91ebc460
    • [cherry-pick] #26920 , #22924 (#29948) · bea300dd
      石晓伟 提交于
      bea300dd
    • C
    • W
      Support mips (#29943) · 5a8d43bb
      Wilber 提交于
      5a8d43bb
    • T
      cherry pick heter ps (#29955) · a839ddca
      Thunderbrook 提交于
      * cherry pick heter ps
      
      *  CMakeList
      a839ddca
    • W
      [Inference] FLAGS_call_statck is turned on default when ON_INFER=ON (#29800) · fae406ae
      Wilber 提交于
      * [Inference] FLAGS_call_statck is turned on default when ON_INFER=ON
      
      * cherry-pick 29828
      fae406ae
    • W
    • L
      Fix Conv2DTanspose bug when padding='same' (#29915) (#29936) · acb29ff8
      LielinJiang 提交于
      * fix conv_transpose bug when padding=same
      acb29ff8
    • X
      [cherry-pick] clean redundant API alias in 2.0 - part 1 #29928 (#29960) · c9c835b5
      XiaoguangHu 提交于
      * [cherry-pick] cherry-pick of PR#29928
      
      * delete paddle.metric.chunk_eval and paddle.metric.mean_iou
      
      * delete paddle.nn.clip and paddle.nn.clip_by_norm
      
      * delete paddle.nn.functional.activation.hard_sigmoid and paddle.nn.functional.activation.hard_swish
      
      * [cherry-pick] cherry-pick of PR#29928
      
      * fix extension import error
      c9c835b5
  6. 28 12月, 2020 3 次提交
    • T
      support some shape for matmul and cast in xpu place (#29900) (#29907) · d84b8e83
      taixiurong 提交于
      * support some shape in matmul and cast
      
      * modify matmul
      d84b8e83
    • L
      [Cherry-Pick 2.0][Dy2Stat] 1. Fix bug of for-range stmts. 2. Support that step... · a8b6dd86
      liym27 提交于
      [Cherry-Pick 2.0][Dy2Stat] 1. Fix bug of for-range stmts. 2. Support that step value is negative in for-range stmts (#29519) (#29874)
      
      1. Fix error in _build_cond_stmt of for-range stmts.
      
      2. Support that step value is negative in for-range stmts
      
      3. Fix code because of the diff between Py2 and Py3
      a8b6dd86
    • H
      [Cherry-pick] Cherry-pick of PR#29579 and PR#29617 (#29904) · 63939597
      Huihuang Zheng 提交于
      * [Dy2stat] Enable jit.save to Save Without Running (#29579)
      
      Enable jit.save to Save Without Running.
      
      * Modify CublasHandleHolder to Fix Random Unittest Failure. test=develop (#29617)
      
      Modify CublasHandleHolder from using PADDLE_ENFORCE_CUDA_SUCCESS to PADDLE_RETRY_CUDA_SUCCESS to fix random unittest failure. We checked that the unittest log showed CUDA allocation error at this file, which may due to GPU not enough. We fixed similar failure in the past, so we applied PADDLE_RETRY_CUDA_SUCCESS here.
      63939597
  7. 25 12月, 2020 5 次提交
  8. 24 12月, 2020 1 次提交
    • T
      2 0 ps core 1 (#29883) · 57843b2c
      tangwei12 提交于
      * add ps table (#29463)
      
      * add ps table
      
      Change-Id: I468a04bd071d21ff52654926fcf4d5f3da19e178
      
      * add service (#29560)
      
      * add service, remove ut on mac
      
      * fix heter_profiler & add heter stop method
      
      * fix code style
      57843b2c
  9. 23 12月, 2020 2 次提交
  10. 22 12月, 2020 4 次提交