1. 29 5月, 2019 1 次提交
  2. 22 5月, 2019 1 次提交
    • G
      Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130) · 2281ebf0
      guomingz 提交于
      * Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.
      
      Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
      Batch size | with fusion | without fusion
      -- | -- | --
      1 | 214.7 | 53.4
      50 | 1219.727 | 137.280
      
      test=develop
      
      * Fix the format issue
      
      test=develop
      
      * Add the missing nolint comments.
      
      test=develop
      
      * Fix the typos.
      
      test=develop
      
      * Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.
      
      test=develop
      
      * Adjust the indentation.
      
      test=develop
      
      * Add the test_conv_brelu_mkldnn_fuse_pass case.
      
      test=develop
      
      * Slightly update the code per Baidu comments.
      Let the parameter definition embedded into the code.
      That's will make the code easy to understand.
      
      test=develop
      2281ebf0
  3. 28 3月, 2019 1 次提交
    • C
      Fix the interface of Pass::Apply (#16484) · ed61d67c
      chengduo 提交于
      * modify the interface of Pass::Allay
      test=develop
      
      * Polish code
      test=develop
      
      * Fix Travis CI
      test=develop
      
      * fix Pass::Apply interface
      test=develop
      
      * Fix Travis CI
      test=develop
      ed61d67c
  4. 21 2月, 2019 1 次提交
  5. 31 8月, 2018 1 次提交
  6. 29 8月, 2018 1 次提交
  7. 21 8月, 2018 1 次提交
  8. 09 8月, 2018 1 次提交
  9. 26 7月, 2018 4 次提交
  10. 22 7月, 2018 1 次提交
  11. 19 7月, 2018 1 次提交
  12. 18 7月, 2018 4 次提交
  13. 26 6月, 2018 1 次提交
  14. 11 6月, 2018 1 次提交
  15. 08 6月, 2018 1 次提交
  16. 06 6月, 2018 1 次提交
    • Y
      SSA Graph Builder Factory · d9af1532
      yuyang18 提交于
      * Use Builder Chain to decorate new builders. It is easy to extend
        builders.
      * Make graphviz path as a build strategy, not a FLAGS.
      d9af1532
  17. 15 5月, 2018 1 次提交
  18. 14 5月, 2018 2 次提交
  19. 07 4月, 2018 1 次提交
  20. 07 3月, 2018 2 次提交
  21. 06 3月, 2018 2 次提交
  22. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  23. 10 2月, 2018 2 次提交
  24. 07 2月, 2018 1 次提交
  25. 06 2月, 2018 2 次提交
  26. 01 2月, 2018 1 次提交
  27. 31 1月, 2018 1 次提交
  28. 30 1月, 2018 1 次提交