1. 24 6月, 2022 1 次提交
    • Y
      [Phi]Change Copy from Kernel to basic component utils (#43622) · 2739bd73
      YuanRisheng 提交于
      * perfect copy
      
      * deal with conflict
      
      * deal with conflict
      
      * fix compile bugs
      
      * fix unittest bugs
      
      * change code format
      
      * deal with conflict
      
      * modify code by review
      
      * fix ce bugs
      
      * fix ce bugs
      
      * add lo
      
      * perfect code format
      
      * deal with conflicts
      2739bd73
  2. 07 6月, 2022 1 次提交
  3. 05 6月, 2022 1 次提交
  4. 04 6月, 2022 1 次提交
  5. 15 4月, 2022 1 次提交
    • L
      Change cuDNN Conv kernel for auto tune feature (#41313) · 35acfeda
      limingshu 提交于
      * change cudnn helper for auto-tune
      
      * Add FLAGS_use_autotune to set the global status of autotune and change the order of choosing algorithm.
      
      * Fix the bug in calculating and printing current step cache hit rate.
      
      * Improve the autotune cache and fix unittest.
      
      * Change the key from AlgorithmType to int64_t.
      
      * Fix unittest for cpu-only env.
      
      * change ChooseAlgoByWorkspace for heuristic mode
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      35acfeda
  6. 09 4月, 2022 1 次提交
  7. 06 4月, 2022 1 次提交
  8. 05 4月, 2022 1 次提交
    • Z
      Implement AutoTuneStatus class for Kernel Auto Tune (#41218) · b0f8000e
      Zhang Ting 提交于
      * switch autotune
      
      * implement AutoTuneCache
      
      * implement AutoTuneCache class
      
      * add pybind api
      
      * add dygraph test
      
      * support static mode and eager mode and improve unittests
      
      * rename the SwitchAutoTune Class and improve tests
      
      * improve AutoTuneStatus and reduce the cost of tests
      b0f8000e
  9. 31 3月, 2022 2 次提交
    • Z
      7dfd3846
    • L
      add_autotune_kernel_tool (#40658) · 7c5dca9f
      limingshu 提交于
      * for 1st time interface combine.
      
      * modification with kernel factory
      
      * first auto_tune version.
      
      * first version.
      
      * basic version
      
      * add warm up step.
      
      * a debug version.
      
      * optimize the functionality of class auto_tuner.
      
      * add some quotes for optimized auto_tuner class.
      
      * add some quotes for optimized auto_tuner class.
      
      * add namespace.
      
      * modification according to the advices
      
      * replace fluid header with phi header.
      
      * replace fluid header with phi header.
      7c5dca9f
  10. 25 3月, 2022 1 次提交
  11. 23 3月, 2022 1 次提交