1. 29 3月, 2020 1 次提交
    • Z
      Improve elementwise performance. (#23001) · 58615a62
      zhaoyuchen2018 提交于
      * Improve elementwise performance.
      
      Elementwise performace is poor as walk into CommonGradBroadcastCUDA, add some new kernels for different data pattern.
      
      * Add some cuda kernel to speedup common broadcast cases. test=develop
      
      * Add more test cases and fix cuda kernel bug. test=develop
      
      * Remove tests as cpu percision fails.test=develop
      
      * Refine SplitDims, test=develop
      
      * Change file mode, test=develop
      58615a62
  2. 28 3月, 2020 2 次提交
  3. 27 3月, 2020 6 次提交
  4. 26 3月, 2020 14 次提交
  5. 25 3月, 2020 14 次提交
  6. 24 3月, 2020 3 次提交