1. 29 5月, 2019 1 次提交
  2. 27 5月, 2019 1 次提交
  3. 24 5月, 2019 2 次提交
  4. 23 5月, 2019 2 次提交
  5. 22 5月, 2019 1 次提交
    • G
      Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130) · 2281ebf0
      guomingz 提交于
      * Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.
      
      Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
      Batch size | with fusion | without fusion
      -- | -- | --
      1 | 214.7 | 53.4
      50 | 1219.727 | 137.280
      
      test=develop
      
      * Fix the format issue
      
      test=develop
      
      * Add the missing nolint comments.
      
      test=develop
      
      * Fix the typos.
      
      test=develop
      
      * Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.
      
      test=develop
      
      * Adjust the indentation.
      
      test=develop
      
      * Add the test_conv_brelu_mkldnn_fuse_pass case.
      
      test=develop
      
      * Slightly update the code per Baidu comments.
      Let the parameter definition embedded into the code.
      That's will make the code easy to understand.
      
      test=develop
      2281ebf0
  6. 20 5月, 2019 1 次提交
  7. 15 5月, 2019 1 次提交
  8. 10 5月, 2019 1 次提交
    • Q
      Double backward of conv2d. (#17211) · e32c9888
      qingqing01 提交于
      * Add conv2d_grad_grad_op
      * Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
          - Now use it in conv2d_grad_grad.
          - Will simply the searching code in conv2d and conv2d_grad in next PR.
      * Enhance and fix bug in unit testing of gradient_checker.
      * Support to fetch empty variables,return None in Python.
      e32c9888
  9. 08 5月, 2019 3 次提交
  10. 07 5月, 2019 1 次提交
  11. 30 4月, 2019 1 次提交
  12. 28 4月, 2019 1 次提交
  13. 23 4月, 2019 1 次提交
  14. 21 4月, 2019 1 次提交
    • Z
      Refine model gpu memory (#16993) · 1202d3fc
      Zeng Jinle 提交于
      * speedup gc and inplace softmax_with_cross_entropy_grad
      test=develop
      
      * refine models gpu mem
      Merge skip vars and warning messages of mem opt
      remove relu mem opt
      test=develop
      
      * follow comments
      test=develop
      1202d3fc
  15. 18 4月, 2019 1 次提交
  16. 16 4月, 2019 2 次提交
    • X
      fix infershape bug · 5663fbfb
      xuezhong 提交于
      test=develop
      5663fbfb
    • J
      [MKL-DNN] Added reusing of primitive descriptors (fp32) (#16667) · 87a44b11
      Jacek Czaja 提交于
      * - Reuse of conv PD
      
      - conv transpose pd reused
      
      - Added PD reusing of softmax and Batch Norm
      
      - Refactoring and removal of not needed routines of mkl-dnn ops
      
      test=develop
      
      - Fix to reusing conv
      
      test=develop
      
      - Lint fixes
      
      test=develop
      
      - Further lint fixes
      
      test=develop
      
      - Lint  fixes
      
      test=develop
      
      - lint fixes
      
      test=develop
      
      - Lint workaround
      
      test=develop
      
      * - Fix after review on including boost as third party header
      
      test=develop
      
      * - Fix after review. Name change to something more descriptive
      
      test=develop
      87a44b11
  17. 11 4月, 2019 1 次提交
  18. 03 4月, 2019 1 次提交
  19. 02 4月, 2019 1 次提交
    • C
      Model data cryption link all lib (#16555) · c38c7c56
      Chen Weihang 提交于
      * link the libwbaes.so into paddle
      
      * polish detail, test=develop
      
      * try fix mac_pr_ci error, test=develop
      
      * add compile option, test=develop
      
      * fix ci error, test=develop
      
      * ignore failed to find mac lib, test=develop
      
      * change cdn to bj, cdn can't get the latest version
      
      * trigger ci, test=develop
      
      * temporary delete win32 lib linking, test=develop
      
      * change https to http, test=develop
      
      * turn compile option on to off
      
      * turn compile option off to on, test=develop
      
      * try lib compiled by gcc4.8, test=develop
      
      * update lib version, test=develop
      
      * link other lib, test=develop
      
      * add setup config
      
      * delete false, test=develop
      
      * delete no_soname, test=develop
      
      * recover so name set
      
      * fix, test=develop
      
      * adjust make config, test=develop
      
      * remove link to wbaes, test=develop
      
      * remove useless define, test=develop
      c38c7c56
  20. 30 3月, 2019 1 次提交
  21. 29 3月, 2019 12 次提交
  22. 28 3月, 2019 3 次提交
    • J
      [MKL-DNN] Tensor modifications revert (#16462) · 26323274
      Jacek Czaja 提交于
      * Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)"
      
      This reverts commit 13816dd4.
      Apart from enabling transformer for MKL-DNN
      
      * Revert "- MKL-DNN pooling updated to set_prim_desc"
      
      This reverts commit c63f6b20.
      
      Conflicts:
      	paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc
      
      * Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429)"
      
      test=develop
      
      This reverts commit dec9cf53.
      
      * - concat compilation fix
      
      - lint
      
      test=develop
      
      - Lint fixes
      
      test=develop
      
      - Lint fixes
      
      test=develop
      
      - Fix Transpose MKLDNN op
      
      test=develop
      26323274
    • S
      fix travis ci · 5656fa9f
      sneaxiy 提交于
      test=develop
      5656fa9f
    • Z
      Revert "Fix allocator bug" · 174d0d0b
      Zeng Jinle 提交于
      add include headers to fix travis-ci
      test=develop
      174d0d0b