1. 25 8月, 2022 11 次提交
    • H
      optimize conv algo cache (#41891) · 1cd7e68b
      hong 提交于
      * optimizer conv alog speed
      
      * code polish
      
      * remove useless code
      
      * fix compile error
      
      * fix cpu compile error
      
      * not use cudnn alog t
      
      * add search cache max number
      
      * polish code
      
      * fix cache test bug
      
      * add groups data format to conv args
      
      * fix cache test bug
      
      * fix cudnn_deterministic bug
      
      * fix test switch auto tune bug
      
      * fix test swith autotune bug;
      
      * fix conv cache bug
      
      * fix cache test error
      
      * fix cache test bug
      
      * fix windows mac compile error
      
      * fix workspace search error
      
      * update cudnn cache
      
      * fix cache test bug; test=develop
      
      * fix autotune swith test error
      
      * polish code
      
      * oplish code
      1cd7e68b
    • Z
      Fl-PS bug fix (#45413) · f2f3f6e7
      ziyoujiyi 提交于
      * back fl
      
      * delete ssl cert
      
      * .
      
      * make warning
      
      * .
      
      * unittest paral degree
      
      * solve unittest
      
      * heter & multi cloud commm ready
      
      * .
      
      * .
      
      * fl-ps v1.0
      
      * .
      
      * support N + N mode
      
      * .
      
      * .
      
      * .
      
      * .
      
      * delete print
      
      * .
      
      * .
      
      * .
      
      * .
      
      * fix bug
      
      * .
      
      * .
      
      * fl-ps with coordinator ready
      
      * merge dev
      
      * update message parse only
      
      * update fl client scheduler
      
      * fix bug
      
      * update multithreads sync
      
      * fix ci errors
      
      * update role_maker.py
      
      * update role_maker.py
      
      * fix ci error: windows py import error
      
      * fix ci error: windows py import error
      
      * fix windows ci pylib import error
      
      * add dump fields & params
      
      * try to fix windows import fleet error
      
      * fix ps FLAGS error
      
      * fix logging risk
      
      * fix logging possible risk
      
      * write trainer_desc file
      
      * support split sparse params in local & remote
      
      * fix import paddle.fluid.core.PSGPU
      
      * fix import paddle.fluid.core.PSGPU
      
      * add remote_sparse & local_sparse config
      
      * fix unittest
      
      * fix test_dist_fleet_geo table error
      
      * fix PADDLE_ENFORCE error
      
      * fix other's pr conflict
      
      * forbidden ssd table
      
      * .
      
      * recover ssd table code
      
      * recover file mode
      f2f3f6e7
    • R
      [triu_indices] add triu_indices_op (#45168) · a410c397
      Rayman 提交于
      a410c397
    • W
      fix params sync multi times problem (#45406) · 20d38664
      Wilber 提交于
      20d38664
    • H
    • J
      [Auto Parallel] Support High Order Differential with Data Parallel Calc-Comm Overlaping (#45388) · bdd0b0f1
      JZ-LIANG 提交于
      * support high order differential with data parallel overlap
      
      * update unitest
      bdd0b0f1
    • U
      fix roi_align_op_npu to pass the unittest (#45310) · 256bf6ff
      USTCKAY 提交于
      256bf6ff
    • S
      Fix unique_kernel bugs (#45032) · ea1f4702
      sprouteer 提交于
      * fix unique_kernel bugs
      
      * fix unique kernel cu bugs
      ea1f4702
    • H
      Fix relu python call (#45082) · 839fac65
      hong 提交于
      * add python final state
      
      * fix bug
      
      * fix bugs
      
      * fix bug
      
      * fix bug
      
      * revert impl, final state mul not support selected rows
      
      * fix softmax use cudnn error
      
      * add softlable false unitest
      
      * revert loss.py
      839fac65
    • H
      add temporal shift and grad *test=kunlun (#45300) · 63d9a175
      haosicheng 提交于
      63d9a175
    • Z
      enforce_reshape (#45386) · 0bf40070
      zhoutianzi666 提交于
      0bf40070
  2. 24 8月, 2022 27 次提交
  3. 23 8月, 2022 2 次提交