1. 12 12月, 2018 1 次提交
  2. 15 11月, 2018 1 次提交
    • S
      add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278) · 8a1eeec5
      Sylwester Fraczek 提交于
      * add is_test to pooling and activations
      
      add prop_kind support for layers activation. conv and pooling
      
      add a pass that sets is_test to true
      
      add transpiler version of is_test pass
      
      test=develop
      
      * patch test and pass
      
      test=develop
      
      * add pass to analyzer.h
      
      test=develop
      
      * add is_test attr description & pass only on mkldnn
      
      in:
      activation_op.cc
      batch_norm_op.cc
      conv_op.cc
      dropout_op.cc
      lrn_op.cc
      pool_op.cc
      sequence_pool_op.cc
      softmax_op.cc
      
      * fix is_test handling for activation pool and conv
      
      * change description of is_test for all layers again
      
      * remove GetAttr(use_mkldnn) from pass
      
      * rename correct_mkldnn_test_phase to is_test
      
      and remove dependency on MKLDNN
      test=develop
      
      * review fix magic number
      
      * two if(..)s into one
      
      * Check is_test once and pass mkldnn forward prop kind
      
      * dereference shared_ptr with * (without get())
      
      test=develop
      
      * add is_test_pass back
      
      test=develop
      8a1eeec5
  3. 09 11月, 2018 1 次提交
    • C
      Add InferVarType for some op (#14201) · 6c6e6385
      chengduo 提交于
      * add_infer_var_type
      test=develop
      
      * InferVarTypeHelper-> VarTypeInferenceHelper
      test=develop
      
      * PassInputTypeAndDTypeOnOutput
       test=develop
      
      * follow comment
      test=develop
      6c6e6385
  4. 31 10月, 2018 1 次提交
  5. 29 10月, 2018 1 次提交
  6. 22 10月, 2018 1 次提交
  7. 15 6月, 2018 1 次提交
  8. 14 6月, 2018 1 次提交
  9. 11 6月, 2018 1 次提交
  10. 07 6月, 2018 1 次提交
    • M
      Mkldnn layout (#11040) · 3ff9ba0e
      mozga-intel 提交于
      * Add MKLDNN layout support in Paddle
      
      Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
      can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
      is hardcode to be used in all MKLDNN op kernels. As a result,
      non-optimized execution path is selected in MKLDNN primitive which
      bring worse performance.
      Besides framework change, three MKLDNN OP kernels were updated
      for using new MKLDNN layout. They are conv/pool2d/batch_norm.
      Other MKLDNN OP kernels need be also updated in similar way to
      achieve best performance.
      
      * Add MKLDNN layout support in activation OP
      
      * Don't populate layout from input to output when kMKLDNN in
      
      * Refine pool mkldnn op kernel
      
      * MKLDNN layout
      
      * Remove the inferitance from tensor file
      
      * MKLDNN layout: refactoring
      
      * Remove additional #define to register new operator
      
      * Prepare mkldnn tests to work with layout
      3ff9ba0e
  11. 08 5月, 2018 1 次提交
    • Y
      Clean OpProtoAndCheckerMaker · 0e78cb69
      Yu Yang 提交于
      Do not use ctor
      
      * Reduce line of codes.
      * We can use virtual function for Maker now.
      * The implementation does not care what maker holds, it is easier to
      refactor later.
      0e78cb69
  12. 19 4月, 2018 1 次提交
  13. 17 4月, 2018 1 次提交
  14. 17 3月, 2018 1 次提交
  15. 10 3月, 2018 1 次提交
  16. 28 2月, 2018 3 次提交
  17. 16 2月, 2018 1 次提交
  18. 12 2月, 2018 1 次提交
  19. 10 2月, 2018 2 次提交
  20. 17 1月, 2018 3 次提交
  21. 15 1月, 2018 2 次提交
  22. 14 1月, 2018 1 次提交
    • D
      "cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0
      dzhwinter 提交于
      * "unified operators"
      
      * "add CUDNN register"
      
      * "add use cudnn attribute"
      
      * "add attribute"
      
      * "test conv tranpose op"
      
      * "remove duplicated attr"
      
      * "fix op test"
      
      * "add attribute to set cudnn"
      
      * "add more log"
      
      * "need layout op register support"
      
      * "add more log"
      
      * "change GetExpectedKernelType "
      
      * "fix Get attr in conv_op"
      
      * "fix CI"
      
      * "fix tests"
      
      * "removed kernel priority fallback"
      
      * "fix CI"
      
      * "fix stack pointer bug"
      
      * "refine buggy interface"
      
      * "add const cast to save life"
      
      * "fix get_output_with_grad"
      
      * "fix op test with dataformat"
      
      * ""fix pooling
      
      * "fix pooling test"
      
      * "fix CI"
      
      * "fix with_gpu error"
      
      * "add transform needed functional check"
      
      * "fix unpack list error"
      
      * "comment out parallel.do temporary"
      
      * "fix CI"
      
      * "fix compile doc error"
      
      * "make threshold larger"
      5ad1aef0
  23. 04 1月, 2018 1 次提交
  24. 20 12月, 2017 1 次提交
  25. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  26. 27 11月, 2017 1 次提交
  27. 17 11月, 2017 1 次提交
  28. 08 11月, 2017 1 次提交
  29. 05 11月, 2017 1 次提交
  30. 28 10月, 2017 1 次提交
  31. 27 10月, 2017 2 次提交
  32. 25 10月, 2017 1 次提交
  33. 17 10月, 2017 1 次提交