1. 05 12月, 2018 1 次提交
  2. 16 11月, 2018 1 次提交
  3. 26 6月, 2018 1 次提交
  4. 07 6月, 2018 1 次提交
    • M
      Mkldnn layout (#11040) · 3ff9ba0e
      mozga-intel 提交于
      * Add MKLDNN layout support in Paddle
      
      Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
      can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
      is hardcode to be used in all MKLDNN op kernels. As a result,
      non-optimized execution path is selected in MKLDNN primitive which
      bring worse performance.
      Besides framework change, three MKLDNN OP kernels were updated
      for using new MKLDNN layout. They are conv/pool2d/batch_norm.
      Other MKLDNN OP kernels need be also updated in similar way to
      achieve best performance.
      
      * Add MKLDNN layout support in activation OP
      
      * Don't populate layout from input to output when kMKLDNN in
      
      * Refine pool mkldnn op kernel
      
      * MKLDNN layout
      
      * Remove the inferitance from tensor file
      
      * MKLDNN layout: refactoring
      
      * Remove additional #define to register new operator
      
      * Prepare mkldnn tests to work with layout
      3ff9ba0e
  5. 19 4月, 2018 1 次提交
  6. 16 2月, 2018 1 次提交
    • A
      [WIP] Move DataType enum inside VarType (#8447) · c7ad26d6
      Abhinav Arora 提交于
      * Move Pod Types from DataType enum to Type enum
      
      * Fixed data_type.h
      
      * Fix type in TensorDesc
      
      * Add comment to framework.proto
      
      * Fixed type in data_type.h
      
      * Updated format of type in data_type.h
      
      * Fix var_desc.h
      
      * Fix op_kernel_type.h
      
      * Fixed data_type_transform_test.cc
      
      * Fix operator.h
      
      * Fixed data_type_transform.cc
      
      * Fixed op_kernel_type_test.cc
      
      * Fix operator.cc
      
      * Fixed data_layout_transform_test.cc
      
      * Fix var_desc.cc
      
      * Fixed assign_value_op.cc
      
      * Fixed assign_value_op.h
      
      * fixed protobuf.cc
      
      * Fix data_layout_transform_test.cc and op_kernel_type_test.cc
      
      * Fixed rnn_memory_helper_op.cc
      
      * Fix progrma_desc_test.cc
      
      * Fixed fill_constant_batch_size_like_op.cc
      
      * Fix operator_test.cc
      
      * Fixed fill_constant_op.cc
      
      * Fixed gaussian_random_op.cc
      
      * Fixed uniform_random_op.cc
      
      * Fixed edit_distance_op.cc
      
      * Fixed fill_constant_batch_size_like_op.cc
      
      * Fixed rnn_memory_helper_op.cc
      
      * Fixed chunk_eval_op.cc
      
      * Fixed assign_value_op.cc
      
      * Fixed assign_value_op.h
      
      * Fixed cast_op.h
      
      * Fixed cast_op.h
      
      * Fix fill constant op
      
      * Fixed clang for assign_value_op.cc
      
      * Fix one_hot_op.h
      
      * Fix one_hot_op.cc
      
      * Fix fill_op.cc
      
      * Fixed sum_op.cc
      
      * Fixed sum_op clang
      
      * Fix uniform_random_op.cc
      
      * Fix gaussian_random_op.cc
      
      * Fix backward.cc
      
      * Fix protobuf.cc
      
      * Fixed prune_test.cc
      
      * Fixed op_registry_test.cc
      
      * Fix data_device_transform_test.cu
      
      * Fix travis error
      
      * Fixed one_hot_op.cu
      
      * Fixed op_registry_test.cc
      
      * Fixed nccl_op.cc
      
      * Fixing python tests
      
      * Revert "Fixing python tests"
      
      This reverts commit fccaa4c5.
      
      * Fixing Pybind to remove data type
      
      * Fixing tensor.py
      
      * Updated the new files:
      
      * Resolve error in merge conflict of fill_constant_batch_size_like_op.cc
      c7ad26d6
  7. 12 2月, 2018 1 次提交
  8. 10 2月, 2018 2 次提交
  9. 19 1月, 2018 1 次提交
  10. 14 1月, 2018 1 次提交
    • D
      "cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0
      dzhwinter 提交于
      * "unified operators"
      
      * "add CUDNN register"
      
      * "add use cudnn attribute"
      
      * "add attribute"
      
      * "test conv tranpose op"
      
      * "remove duplicated attr"
      
      * "fix op test"
      
      * "add attribute to set cudnn"
      
      * "add more log"
      
      * "need layout op register support"
      
      * "add more log"
      
      * "change GetExpectedKernelType "
      
      * "fix Get attr in conv_op"
      
      * "fix CI"
      
      * "fix tests"
      
      * "removed kernel priority fallback"
      
      * "fix CI"
      
      * "fix stack pointer bug"
      
      * "refine buggy interface"
      
      * "add const cast to save life"
      
      * "fix get_output_with_grad"
      
      * "fix op test with dataformat"
      
      * ""fix pooling
      
      * "fix pooling test"
      
      * "fix CI"
      
      * "fix with_gpu error"
      
      * "add transform needed functional check"
      
      * "fix unpack list error"
      
      * "comment out parallel.do temporary"
      
      * "fix CI"
      
      * "fix compile doc error"
      
      * "make threshold larger"
      5ad1aef0
  11. 04 1月, 2018 1 次提交
  12. 27 12月, 2017 1 次提交
  13. 25 12月, 2017 2 次提交
  14. 24 12月, 2017 2 次提交