1. 09 4月, 2022 3 次提交
    • L
      Autotune the workspace_size_limit in conv. (#40338) · b937cdc5
      limingshu 提交于
      * Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode.
      
      * Use the system cudaMalloc and cudaFree to allocate workspace during searching.
      
      * Enable switch of two kind of workspace setting methods.
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      b937cdc5
    • C
      Add get profiler from config (#41532) · e1792a31
      chenjian 提交于
      * no
      
      * maintain old profiler
      
      * add get profiler from serialization config
      
      * add unit test
      
      * improve coverage
      
      * fix
      
      * Revert "improve coverage"
      
      This reverts commit 4a980bfda48adadee551d0e1c5740bc5b7389200.
      
      * fix unit
      
      * fix
      
      * fix
      e1792a31
    • J
      fix_ci_problem3 (#41484) · 9cb2287c
      Jiabin Yang 提交于
      * fix_ci_problem3
      
      * support windows no default error
      9cb2287c
  2. 08 4月, 2022 8 次提交
  3. 07 4月, 2022 20 次提交
  4. 06 4月, 2022 9 次提交