1. 02 3月, 2021 8 次提交
    • T
      fix sycn training error (#31357) · 5d7a8b05
      tangwei12 提交于
      * fix sycn training error
      
      Change-Id: Ie2feebcf0b5b2984fd59cfcdde0c817840e203d2
      5d7a8b05
    • Q
      fix ELU output for nan, test=develop (#31132) · ec72f5b2
      Qi Li 提交于
      ec72f5b2
    • Q
      [ROCM] update fluid operators for rocm (part5), test=develop (#31258) · 65bcaeb0
      Qi Li 提交于
      * [ROCM] update fluid operators for rocm (part5), test=develop
      
      * address review comments, test=develop
      
      * fix typo, test=develop
      65bcaeb0
    • P
      add n-d input support for trt scale converter (#31316) · 2e9e3fad
      Pei Yang 提交于
      * add n-d input support for trt scale converter
      
      * add flatten for ut
      
      * fix dims
      2e9e3fad
    • S
      support trt serialize when load model from memory (#31342) · 6404c438
      Shang Zhizhou 提交于
      * support trt serialize when load model from memory
      
      * delete conv_bn_fuse_pass before tensorrt, with which trt serialize engine id is not stable
      
      * Revert "delete conv_bn_fuse_pass before tensorrt, with which trt serialize engine id is not stable"
      
      performance degradation, fix in the future
      
      This reverts commit fa6cd17e60b15df351efda379ddd00e9e9c1fea9.
      
      * add delete conv_bn
      
      * delete path when delete_cache_files
      6404c438
    • G
      lamb_op_xpu;test=kunlun (#31012) · d79fdc3d
      Gradie 提交于
      * lamb_op_xpu;test=kunlun
      
      * modify lamb_op_xpu.cc;test=kunlun
      
      * delete atol lamb_op_xpu; test=kunlun
      
      * update xpu.cmake;test=kunlun
      
      * test_error 1e-5,lamb_op_xpu;test=kunlun
      
      * error1e-5,lamb_op_xpu,test=kunlun
      
      * delete atol lamb_xpu;test=kunlun
      
      * modify atol,lamb_op_xpy;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu, XPUOptest;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu,modify xpu_cmake; test=kunlun
      
      * lamb_op_xpu;test=kunlun
      
      * lamb_op_xpu,modify xpucmake;test=kunlun
      d79fdc3d
    • D
      topo and memory performance for heterps (#30440) · d1075df2
      danleifeng 提交于
      * topo and memory performance for heterps; test=develop
      * add trainwithprofiler in heter trainier; test=develop
      d1075df2
    • Q
      72d99c5d
  2. 01 3月, 2021 10 次提交
  3. 28 2月, 2021 1 次提交
  4. 27 2月, 2021 2 次提交
  5. 26 2月, 2021 7 次提交
  6. 25 2月, 2021 8 次提交
  7. 24 2月, 2021 4 次提交