1. 23 5月, 2019 3 次提交
  2. 22 5月, 2019 3 次提交
    • K
      Enable square operator for the nGraph Bridge. (#17551) · 43d15b9d
      Krzysztof Binias 提交于
      test=develop
      43d15b9d
    • S
      [NGraph] add increment op to ngraph engine (#16929) · f86f49e7
      Sevin F. Varoglu 提交于
      * add increment op to ngraph engine
      
      test=develop
      
      * fix style errors
      
      test=develop
      f86f49e7
    • G
      Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130) · 2281ebf0
      guomingz 提交于
      * Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.
      
      Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
      Batch size | with fusion | without fusion
      -- | -- | --
      1 | 214.7 | 53.4
      50 | 1219.727 | 137.280
      
      test=develop
      
      * Fix the format issue
      
      test=develop
      
      * Add the missing nolint comments.
      
      test=develop
      
      * Fix the typos.
      
      test=develop
      
      * Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.
      
      test=develop
      
      * Adjust the indentation.
      
      test=develop
      
      * Add the test_conv_brelu_mkldnn_fuse_pass case.
      
      test=develop
      
      * Slightly update the code per Baidu comments.
      Let the parameter definition embedded into the code.
      That's will make the code easy to understand.
      
      test=develop
      2281ebf0
  3. 21 5月, 2019 5 次提交
  4. 20 5月, 2019 1 次提交
    • L
      Double backward elementwise div (#17416) · 10b23a72
      lvmengsi 提交于
      * double backward, elementwise_div
      
      * fix dx empty. test=develop
      
      * bug fix (#17392)
      
      fix secure bug
      
      * Eanble stack operator for a Ngraph, test=develop (#17406)
      
      * fix sqrt_grad_grad unittest. test=develop (#17410)
      
      * fix sqrt_grad_grad unittest. test=develop
      
      * disable sqrt_grad_grad unittest. test=develop
      
      * test=develop, fix unittest
      
      * test=develop, fix unittest
      
      * test=develop, fix unittest
      
      * test=develop, fix bug
      
      * fix unittest. test=develop
      
      * fix unittest dx. test=develop
      
      * tmp fix! for test... test=develop
      
      * reduce tmp, test=develop
      
      * test=develop, reduce tmp
      
      * fix broadcast unittest. test=develop
      
      * fix format. test=develop
      
      * refine code. test=develop
      
      * refine code. test=develop
      
      * refine GetDoubleGradSafeTensor. test=develop
      
      * fix format. test=develop
      10b23a72
  5. 19 5月, 2019 1 次提交
  6. 18 5月, 2019 1 次提交
  7. 17 5月, 2019 1 次提交
  8. 16 5月, 2019 1 次提交
  9. 15 5月, 2019 6 次提交
  10. 14 5月, 2019 4 次提交
  11. 13 5月, 2019 4 次提交
  12. 12 5月, 2019 1 次提交
  13. 10 5月, 2019 4 次提交
  14. 08 5月, 2019 1 次提交
  15. 07 5月, 2019 4 次提交
    • Z
      Enhance inplace/mem-opt pass and enhance softmax_with_cross_entropy op inplace (#17225) · 4f859408
      Zeng Jinle 提交于
      * add use_cuda to inplace pass,test=develop
      
      * add test softmax_with_xe_inplace test,test=develop
      
      * fix potential inplace bug
      test=develop
      
      * add more skip vars in mem opt pass,test=develop
      
      * follow comment,test=develop
      
      * follow comments,move duplicate out arg check to program->graph,test=develop
      4f859408
    • B
      update sofmax with axis arg test=develop (#17190) · e782b54b
      baojun 提交于
      e782b54b
    • T
      remove unused FLAGS_warpctc_dir (#17162) · ff1661f1
      Tao Luo 提交于
      * remove unused FLAGS_warpctc_dir
      
      test=develop
      
      * remove FLAGS_warpctc_dir
      
      test=develop
      ff1661f1
    • K
      Softmax_cross_entropy op add axis (#16806) · a71d8fdb
      Kaipeng Deng 提交于
      * add attr axis infershape. test=develop
      
      * add CUDA kernel. test=develop
      
      * fix unittest. test=develop
      
      * fix unittest for soft_label. test=develop
      
      * fix fp16 unittest. test=develop
      
      * remove comment code. test=develop
      
      * refine test for axis. test=develop
      
      * add python api. test=develop
      
      * fix doc. test=develop
      
      * fix fp16 unittest. test=develop
      
      * fix ngraph test. test=develop
      
      * fix ENFORCE for test_imperative_transformer. test=develop
      
      * fit for ngraph test. test=develop
      
      * fix after rebase develop. test=develop
      
      * fix doc. test=develop
      
      * fix API.spec. test=develop
      
      * fix test_layers. test=develop
      
      * fix format. test=develop
      a71d8fdb