1. 26 6月, 2018 2 次提交
    • Y
      Refactor Operator.cc, and clean code · 9faf5a39
      yuyang18 提交于
      9faf5a39
    • T
      MKLDNN elementwis_add with default broadcast operations (#11544) · e26f51ce
      Tomasz Patejko 提交于
      * elementwise_add with bcast: Brian's implementation by Brian added, with default bcasts
      
      * elementwise_add with bcast: GetExpectedKernelType added to elementwise_op
      
      * elementwise_add with bcast: use_mkldnn attribute added
      
      * elementwise_add with bcast: changes after review and some formatting
      
      * elementwise_add with bcast: changes after style check
      
      * elementwise_add with bcast: changes after style check cont.
      
      * elementwise_add with bcast: MKLDNN unittests added
      
      * elementwise_add with bcast: original unittests with use_mkldnn flag
      
      * elementwise_add with bcast: handling of MKLDNN format corrected
      
      * elementwise_add with bcast: setting MKLDNN format turned into lambda
      
      * elementwise_add with bcast: MKDNN format setting turned into separate function
      
      * elementwise_add with bcast: condition for choosing MKLDNN simplified
      
      * elementwise_add with bcast: fix for MKLDNN format set incorrectly in bcasts
      
      * elementwise_add with bcast: changes in unittests for broadcasts
      
      * elementwise_add with bcast: fixes in unittests regarding dimensions
      
      * elementwise_add with bcast: bring back correct format setting in mklml grad path
      
      * elementwise_add with bcast: fixed compilation error
      e26f51ce
  2. 07 6月, 2018 1 次提交
    • M
      Mkldnn layout (#11040) · 3ff9ba0e
      mozga-intel 提交于
      * Add MKLDNN layout support in Paddle
      
      Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
      can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
      is hardcode to be used in all MKLDNN op kernels. As a result,
      non-optimized execution path is selected in MKLDNN primitive which
      bring worse performance.
      Besides framework change, three MKLDNN OP kernels were updated
      for using new MKLDNN layout. They are conv/pool2d/batch_norm.
      Other MKLDNN OP kernels need be also updated in similar way to
      achieve best performance.
      
      * Add MKLDNN layout support in activation OP
      
      * Don't populate layout from input to output when kMKLDNN in
      
      * Refine pool mkldnn op kernel
      
      * MKLDNN layout
      
      * Remove the inferitance from tensor file
      
      * MKLDNN layout: refactoring
      
      * Remove additional #define to register new operator
      
      * Prepare mkldnn tests to work with layout
      3ff9ba0e
  3. 25 4月, 2018 1 次提交
  4. 07 3月, 2018 1 次提交
    • K
      Integrate float16 into data_type_transform (#8619) · 266ccaa8
      kexinzhao 提交于
      * test cpu float16 data transform
      
      * add isnan etc
      
      * small fix
      
      * fix containsNAN test error
      
      * add data_type transform GPU test
      
      * add float16 GPU example
      
      * fix error
      
      * fix GPU test error
      
      * add context wait
      266ccaa8
  5. 12 2月, 2018 1 次提交
  6. 10 2月, 2018 2 次提交
  7. 21 1月, 2018 1 次提交
    • Q
      Data type transform (#7653) · 85671b8a
      Qiao Longfei 提交于
      * init complete data layout transform
      
      * can compile
      
      * test passed
      
      * optimize code
      
      * fix while_grad_op first step loss lod problem
      
      * optimize in out ptr for transform
      
      * add check
      
      * update copyright
      
      * clean code
      
      * add NeedTransformLayout
      
      * add comment
      
      * change the interface of data_type_transform
      
      * init data_type_transform_test
      
      * complete data_type_transform_test
      
      * add TransDataType to data_transform
      85671b8a
  8. 19 1月, 2018 1 次提交
  9. 14 1月, 2018 1 次提交
    • D
      "cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0
      dzhwinter 提交于
      * "unified operators"
      
      * "add CUDNN register"
      
      * "add use cudnn attribute"
      
      * "add attribute"
      
      * "test conv tranpose op"
      
      * "remove duplicated attr"
      
      * "fix op test"
      
      * "add attribute to set cudnn"
      
      * "add more log"
      
      * "need layout op register support"
      
      * "add more log"
      
      * "change GetExpectedKernelType "
      
      * "fix Get attr in conv_op"
      
      * "fix CI"
      
      * "fix tests"
      
      * "removed kernel priority fallback"
      
      * "fix CI"
      
      * "fix stack pointer bug"
      
      * "refine buggy interface"
      
      * "add const cast to save life"
      
      * "fix get_output_with_grad"
      
      * "fix op test with dataformat"
      
      * ""fix pooling
      
      * "fix pooling test"
      
      * "fix CI"
      
      * "fix with_gpu error"
      
      * "add transform needed functional check"
      
      * "fix unpack list error"
      
      * "comment out parallel.do temporary"
      
      * "fix CI"
      
      * "fix compile doc error"
      
      * "make threshold larger"
      5ad1aef0
  10. 10 1月, 2018 1 次提交
  11. 08 1月, 2018 2 次提交
    • D
      Feature/add shared layout (#7233) · e94db381
      dzhwinter 提交于
      * "reuse ShareLoD with no regret"
      
      * "removed base class shareLayout"
      
      * "fix CI"
      e94db381
    • Q
      cpu gpu transform function (#7191) · 0f353ab4
      Qiao Longfei 提交于
      * add rename guard
      
      * add device_data_transform
      
      * add device_data_transform_test
      
      * modify GetExpectedKernelType
      
      * update operator.run
      
      * support test test_label_semantic_roles
      
      * optimize code
      
      * optimize code
      
      * rename GetActualKernelType to GetExpectedKernelType
      
      * fix chunk_eval_op and device_data_transform_test
      
      * add is_same_place to place
      
      * optimize code, refine rename_guard
      
      * refine rename guard, add GetKernelTypeForVar
      
      * optimize code
      
      * add some log
      
      * rename guard
      
      * use sub scope to create var
      
      * fix compile
      
      * add IsInitialized for Tensor
      
      * add VarIsTensor
      
      * fix op_registry_test
      
      * test
      
      * tmp disable priority
      
      * restore switch_kernel.md
      
      * code clean
      0f353ab4
  12. 05 1月, 2018 1 次提交
    • D
      Feature/use cudnn (#7141) · 5593858d
      dzhwinter 提交于
      * "add c++ side kernel selection"
      
      * "add multiple kernel op test"
      
      * "kernel selection only support cudnn"
      
      * "better formatter"
      
      * "small fix with UseCPU"
      
      * "depends on change interface Get(Place, Library)"
      
      * "fix CI"
      
      * "fix python cudnn test"
      
      * "leave the register cudnn op to another PR"
      
      * "fix CI"
      
      * "use all kernel by default"
      
      * "fix CI"
      5593858d
  13. 04 1月, 2018 1 次提交
  14. 02 1月, 2018 2 次提交
  15. 27 12月, 2017 1 次提交
  16. 26 12月, 2017 3 次提交
    • Q
      Add data transform fn (#6953) · f97f69fe
      Qiao Longfei 提交于
      * init data_transform
      
      * complete DataTransform
      
      * fix build error
      
      * add data_transform_test
      
      * add a register test for data_transform_fn
      
      * use function to simulate registration macro
      
      * add register macro
      
      * update test
      
      * clean code
      
      * restore unrelated code
      
      * update data transform test
      
      * generate unique name for REGISTER_DATA_TRANSFORM_FN
      
      * add const
      
      * follow comment
      
      * update KernelTypePair hash function
      f97f69fe
    • L
      unify the indentation of license · 761b3297
      Luo Tao 提交于
      761b3297
    • D
      "fix threadpool style" (#7017) · 80dafdf5
      dzhwinter 提交于
      * "fix threadpool style"
      
      * "remove header"
      80dafdf5
  17. 17 10月, 2017 1 次提交
    • Y
      Rewrite feed/fetch op (#4815) · 4df6cf4d
      Yu Yang 提交于
      * Feed/Fetch op just plain operator, not a OpWithKernel
      * Do not register OpInfoMaker since Feed/Fetch will never be
        configured by users
      * Feed/Fetch op has empty gradient
      * Feed/Fetch op do not hard code `feed_variable`, `fetch_variable` as
        its input and output, make it as a plain Operator input/output
      4df6cf4d
  18. 28 9月, 2017 1 次提交
  19. 27 9月, 2017 1 次提交
  20. 25 7月, 2017 1 次提交
  21. 17 7月, 2017 2 次提交
  22. 11 7月, 2017 2 次提交
  23. 06 7月, 2017 2 次提交
  24. 05 7月, 2017 1 次提交
  25. 04 7月, 2017 4 次提交
  26. 03 7月, 2017 1 次提交
  27. 28 6月, 2017 2 次提交