1. 16 10月, 2020 1 次提交
    • L
      [oneDNN] Conv dilation support (#27914) · 7cb4a8b8
      lidanqing 提交于
      * conv dilated mkldnn support: forward and backward pass
      
      * add mkldnn conv_transpose dilation UT
      test=develop
      
      * remove unnecessary PADDLE_ENFORCE
      
      * add int8 and bf16 dilated conv UT
      
      * update according to reviews
      7cb4a8b8
  2. 02 9月, 2020 1 次提交
  3. 26 5月, 2020 1 次提交
    • L
      Update PADDLE_ENFORCE in DNNL related ops (#24333) · c3c61d34
      lidanqing 提交于
      * Update PADDLE_ENFORCE in DNNL related ops
      test=develop
      
      * Abstract macro of OP_GET_PLACE_CHECK
      test=develop
      
      * update according to reviews
      
      * update GET_PLACE_CPU_CHECK
      
      * fix typo
      test=develop
      
      * revert macro
      test=develop
      c3c61d34
  4. 14 5月, 2020 1 次提交
    • F
      update conv error info (#24430) · 526a2117
      FDInSky 提交于
      * test=develop update conv error info
      
      * test=develop update iou_similarity error info
      
      * test=develop update some error info based review
      526a2117
  5. 10 12月, 2019 1 次提交
    • A
      MKL-DNN 1.0 Update (#20162) · e81f0228
      Adam 提交于
      * MKLDNN v1.0 rebase to Paddle 1.6
      test=develop
      
      * Add hacky paddle::string::to_string() implementation
      
      * vectorize<int64-t>() -> vectorize() cleanup
      test=develop
      
      * PADDLE_ENFORCE and void_cast fixes
      test=develop
      
      * Rebase changes
      test=develop
      
      * Cosmetics
      test=develop
      
      * Delete MKL from mkldnn.cmake
      test=develop
      
      * CMake debug commands
      test=develop
      
      * Delete MKLDNN_VERBOSE and rebase fixes
      test=develop
      
      * Rebase fixes
      test=develop
      
      * Temporarily disable int8 resnet101 vgg16 and vgg19 tests
      test=develop
      
      * Add libmkldnn.so.1 to python setup
      test=develop
      
      * Add libmkldnn.so.1 to inference_lib cmake after rebase
      test=develop
      
      * Post rebase fixes + FC int8 changes
      test=develop
      
      * Fix LRN NHWC
      test=develop
      
      * Fix NHWC conv3d
      test=develop
      
      * Windows build fix + next conv3d fix
      test=develop
      
      * Fix conv2d on AVX2 machines
      test=develop
      e81f0228
  6. 03 12月, 2019 1 次提交
  7. 29 11月, 2019 1 次提交
    • H
      Add dygraph execution context (#20157) · ac854670
      hong 提交于
      * add_dygraph_execution_context
      
      * add dygraph infershape context and execution context; test=develop
      
      * fix imperative bug; test=develop
      
      * remove inputs outputs interface from execution context,
      because it have same function with inputNames;
      test=develop
      
      * remove tracer_test ctest; test=develop
      
      * fix split op bug; test=develop
      
      * fix unitests bug; test=develop
      
      * fix distribute test bug; test=develop
      
      * fix ngraph compile bug; test=develop
      
      * fix grad maker bug; test=develop
      
      * fix load op bugs; test=develop
      
      * fix operator.cc construct bug; test=develop
      
      * remove useless name find in operator; test=develop
      
      * add tracer_test; test=develop
      
      * fix concat, split bug; test=develop
      
      * remove tracer_test unitest; test=develop
      
      * fix attribute check bug; test=develop
      
      * add test code to fix converage; test=develop
      
      * remove useless code, change check backward input in engin; test=develop
      
      * unlock var type infer shape;test=develop
      
      * add ShareAllLoD api; test=develop
      
      * add dygraph infershape context unitest; test=develop
      
      * remove increase and decrease lod in dygraph; test=develop
      
      * addd override; test=develop
      
      * fix increase descrease lod; test=develop
      
      * fix paddle_enforce; test=develop
      
      * disable lod op dygraph check; test=develop
      
      * fix paddle enforce error; test=develop
      
      * add comment for op_registry and OperatorBase; test=develop
      
      * optimize the comment of op_registry; test=develop
      
      * fix format of comment; test=develop
      
      * fix format of comment; test=develop
      
      * optimize the format of comment; test=develop
      
      * optimize the format of the comment; test=develop
      
      * optimize comment of op_registry; test=develop
      ac854670
  8. 07 11月, 2019 1 次提交
  9. 17 10月, 2019 1 次提交
    • J
      [MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241) · a1cd27f1
      Jacek Czaja 提交于
      * - Flushing mkl-dnn cache
      
      test=develop
      
      - Disabled clearing cache for LoadModel
      
      - Added clearing of mkl-dnn cache when Executor is created
      
      test=develop
      
      - Do not clear for GPU places
      
      test=develop
      
      - compilation fix
      
      test=develop
      
      * - Moved clearing of mkl-dnn cache in destructor of executor
      
      test=develop
      
      * - Compilation fix
      
      test=develop
      
      - Reverted conditional clearing of mkl-dnn cache in Executors's
        destructor
      
      test=develop
      
      - compilation fix
      a1cd27f1
  10. 14 9月, 2019 1 次提交
  11. 04 9月, 2019 1 次提交
  12. 03 9月, 2019 1 次提交
  13. 29 8月, 2019 1 次提交
  14. 15 8月, 2019 1 次提交
  15. 10 6月, 2019 1 次提交
  16. 22 5月, 2019 1 次提交
    • G
      Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130) · 2281ebf0
      guomingz 提交于
      * Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.
      
      Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
      Batch size | with fusion | without fusion
      -- | -- | --
      1 | 214.7 | 53.4
      50 | 1219.727 | 137.280
      
      test=develop
      
      * Fix the format issue
      
      test=develop
      
      * Add the missing nolint comments.
      
      test=develop
      
      * Fix the typos.
      
      test=develop
      
      * Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.
      
      test=develop
      
      * Adjust the indentation.
      
      test=develop
      
      * Add the test_conv_brelu_mkldnn_fuse_pass case.
      
      test=develop
      
      * Slightly update the code per Baidu comments.
      Let the parameter definition embedded into the code.
      That's will make the code easy to understand.
      
      test=develop
      2281ebf0
  17. 16 4月, 2019 1 次提交
    • J
      [MKL-DNN] Added reusing of primitive descriptors (fp32) (#16667) · 87a44b11
      Jacek Czaja 提交于
      * - Reuse of conv PD
      
      - conv transpose pd reused
      
      - Added PD reusing of softmax and Batch Norm
      
      - Refactoring and removal of not needed routines of mkl-dnn ops
      
      test=develop
      
      - Fix to reusing conv
      
      test=develop
      
      - Lint fixes
      
      test=develop
      
      - Further lint fixes
      
      test=develop
      
      - Lint  fixes
      
      test=develop
      
      - lint fixes
      
      test=develop
      
      - Lint workaround
      
      test=develop
      
      * - Fix after review on including boost as third party header
      
      test=develop
      
      * - Fix after review. Name change to something more descriptive
      
      test=develop
      87a44b11
  18. 28 3月, 2019 1 次提交
    • J
      [MKL-DNN] Tensor modifications revert (#16462) · 26323274
      Jacek Czaja 提交于
      * Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)"
      
      This reverts commit 13816dd4.
      Apart from enabling transformer for MKL-DNN
      
      * Revert "- MKL-DNN pooling updated to set_prim_desc"
      
      This reverts commit c63f6b20.
      
      Conflicts:
      	paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc
      
      * Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429)"
      
      test=develop
      
      This reverts commit dec9cf53.
      
      * - concat compilation fix
      
      - lint
      
      test=develop
      
      - Lint fixes
      
      test=develop
      
      - Lint fixes
      
      test=develop
      
      - Fix Transpose MKLDNN op
      
      test=develop
      26323274
  19. 26 2月, 2019 1 次提交
    • J
      - MKL-DNN pooling updated to set_prim_desc · c63f6b20
      Jacek Czaja 提交于
      - MKLDNN ops revisited
      
      - disabled softmax modifications
      
      - disabled elementwise_add
      
      - reverted LRN modifications
      
      - reverted SUM primitive
      
      - Partial reviing of softmax
      
      - Enable softmax
      
      - Softmax changes
      
      - LRN is back
      
      - LRN partially disabled
      
      - LRN is back
      
      - LRN fix
      
      - compilation fixes
      
      - Sum fixed(hopefully)
      
      - Enabling (partially) elementwise_add
      
      - Fixes to elemenwise_add
      
      - Lint fixes
      
      quantize fix
      
      - compilation fix
      
      test=develop
      
      Disabling pooling
      
      - Disabled quantize op
      
      test=develop
      c63f6b20
  20. 29 1月, 2019 1 次提交
  21. 27 11月, 2018 1 次提交
    • J
      - conv2d transpose MKL-DNN · fb24690a
      Jacek Czaja 提交于
      test=develop
      
      - Added new header for MKLDNN reuse functionality
      
      - Extended conv2d_transpose GetExpectedKernelType for MKL-DNN supporrt
      
      - Buildable conv transpose mkldnn and conv mkldnn using conv template
      
      - Conv2d transpose roughlt implemented and buildable
      
      - Added modifications conv2d transpose MKLDNN unit tests
      
      - Fix to UT of conv2d transpose mkldnn op
      
      - Wrong type of MKLDNN primitive was chosen for conv2d transpose
      
      - HAcks for conv2d transpose
      
      - UT enalbed
      
      - Replaced copying loop with memcpy
      
      - Draft of passing lambda into AcquireMemory
      
      - Made reorder (IOHW->OIHW) to be called only once
      fb24690a