1. 22 5月, 2019 1 次提交
    • G
      Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130) · 2281ebf0
      guomingz 提交于
      * Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.
      
      Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
      Batch size | with fusion | without fusion
      -- | -- | --
      1 | 214.7 | 53.4
      50 | 1219.727 | 137.280
      
      test=develop
      
      * Fix the format issue
      
      test=develop
      
      * Add the missing nolint comments.
      
      test=develop
      
      * Fix the typos.
      
      test=develop
      
      * Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.
      
      test=develop
      
      * Adjust the indentation.
      
      test=develop
      
      * Add the test_conv_brelu_mkldnn_fuse_pass case.
      
      test=develop
      
      * Slightly update the code per Baidu comments.
      Let the parameter definition embedded into the code.
      That's will make the code easy to understand.
      
      test=develop
      2281ebf0
  2. 21 5月, 2019 6 次提交
  3. 20 5月, 2019 2 次提交
    • Q
      Optimize communicator flags (#17494) · 287de41c
      Qiao Longfei 提交于
      * optimize communicator flag
      
      * change flags in init py test=develop
      287de41c
    • L
      Double backward elementwise div (#17416) · 10b23a72
      lvmengsi 提交于
      * double backward, elementwise_div
      
      * fix dx empty. test=develop
      
      * bug fix (#17392)
      
      fix secure bug
      
      * Eanble stack operator for a Ngraph, test=develop (#17406)
      
      * fix sqrt_grad_grad unittest. test=develop (#17410)
      
      * fix sqrt_grad_grad unittest. test=develop
      
      * disable sqrt_grad_grad unittest. test=develop
      
      * test=develop, fix unittest
      
      * test=develop, fix unittest
      
      * test=develop, fix unittest
      
      * test=develop, fix bug
      
      * fix unittest. test=develop
      
      * fix unittest dx. test=develop
      
      * tmp fix! for test... test=develop
      
      * reduce tmp, test=develop
      
      * test=develop, reduce tmp
      
      * fix broadcast unittest. test=develop
      
      * fix format. test=develop
      
      * refine code. test=develop
      
      * refine code. test=develop
      
      * refine GetDoubleGradSafeTensor. test=develop
      
      * fix format. test=develop
      10b23a72
  4. 19 5月, 2019 1 次提交
  5. 18 5月, 2019 1 次提交
  6. 17 5月, 2019 3 次提交
  7. 16 5月, 2019 2 次提交
  8. 15 5月, 2019 6 次提交
  9. 14 5月, 2019 6 次提交
  10. 13 5月, 2019 4 次提交
    • Y
      Optimize the computing kernel of sequence_reverse operator (#17349) · 218d8d8f
      Yihua Xu 提交于
      * Optimize the computing kernel of sequence_reverse operator.
      
      test=develop
      
      * Clean code
      
      test=develop
      
      * Fix for cpplint syntax checking.
      
      test=develop
      
      * Fix the compile warning issue.
      
      test=develop
      218d8d8f
    • Y
      Optimize the elementwise op using eigen (#15494) · dcda2023
      Yiqun Liu 提交于
      * Optimize the elementwise op with CUDA kernels.
      test=develop
      
      * Support setting of attr in op config file.
      test=develop
      
      * Add the support the setting dtype and initializer in config.
      test=develop
      
      * Save workspace.
      
      * Add initializer "zeros".
      test=develop
      
      * Fix compiling error.
      
      * Support the use of existed file to initailize tensor in op_tester.
      
      * Use eigen to optimize the elementwise_add/mul for the case that x and y have the same dims.
      test=develop
      dcda2023
    • K
      add double grad for elementwise_mul op (#17255) · 8bae8590
      Kaipeng Deng 提交于
      * add double grad for elementwise_mul. test=develop
      
      * remove comment. test=develop
      
      * fix grad sum. test=develop
      
      * fix for axis expand. test=develop
      
      * add test for axis expand. test=develop
      8bae8590
    • K
      add double grad for square op (#17173) · 11d3a38f
      Kaipeng Deng 提交于
      * add double grad for square. test=develop
      
      * formax code. test=develop
      
      * fix for grad sum. test=develop
      
      * refine shape. test=develop
      
      * refine extract. test=develop
      11d3a38f
  11. 10 5月, 2019 4 次提交
  12. 09 5月, 2019 2 次提交
  13. 08 5月, 2019 2 次提交