1. 14 6月, 2018 1 次提交
  2. 13 6月, 2018 1 次提交
  3. 07 6月, 2018 3 次提交
    • D
      Big data op_test benchmark, for checking output consistent in different runs. (#10646) · f7c96f07
      dzhwinter 提交于
      * "init benchmark ops"
      
      * "untrack outputs"
      
      * "delete some usused code"
      
      * "benchmark"
      
      * "fix ci"
      
      * "fix op test"
      
      * "fix uint16 missing"
      
      * "fix ci"
      
      * "follow comments"
      
      * "fix ci"
      
      * "follow comments"
      
      * "conficts. merge develop branch"
      
      * repick
      
      * "merge develop branch"
      f7c96f07
    • F
      dc8e0b49
    • M
      Mkldnn layout (#11040) · 3ff9ba0e
      mozga-intel 提交于
      * Add MKLDNN layout support in Paddle
      
      Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
      can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
      is hardcode to be used in all MKLDNN op kernels. As a result,
      non-optimized execution path is selected in MKLDNN primitive which
      bring worse performance.
      Besides framework change, three MKLDNN OP kernels were updated
      for using new MKLDNN layout. They are conv/pool2d/batch_norm.
      Other MKLDNN OP kernels need be also updated in similar way to
      achieve best performance.
      
      * Add MKLDNN layout support in activation OP
      
      * Don't populate layout from input to output when kMKLDNN in
      
      * Refine pool mkldnn op kernel
      
      * MKLDNN layout
      
      * Remove the inferitance from tensor file
      
      * MKLDNN layout: refactoring
      
      * Remove additional #define to register new operator
      
      * Prepare mkldnn tests to work with layout
      3ff9ba0e
  4. 30 5月, 2018 1 次提交
  5. 29 5月, 2018 1 次提交
  6. 03 5月, 2018 1 次提交
  7. 25 4月, 2018 2 次提交
  8. 12 4月, 2018 1 次提交
    • Q
      Dist transpiler support prefetch (#9714) · 4c55a602
      Qiao Longfei 提交于
      * init
      
      * add some check
      
      * add dist transpile logic
      
      * add insert op for block
      
      * init change get_pserver_program
      
      * optimize code
      
      * fix a bug
      
      * can run now
      
      * start to do table split
      
      * start to process table gradient
      
      * complete pserver part
      
      * can send_vars now
      
      * revert cpplint
      
      * fix a bug
      
      * optimize code
      
      * move dist test to models
      
      * revert the interface of distribute_transpiler.transpile
      
      * fix prefetch_block
      
      * optimize trainspiler code
      
      * add comment to sum_op
      
      * add warning log
      
      * fix comment
      
      * fix test_send_recv
      
      * fix test_send_recv
      
      * fix train with no distributed table
      
      * optimize GetDims
      4c55a602
  9. 04 4月, 2018 1 次提交
  10. 30 3月, 2018 1 次提交
  11. 14 3月, 2018 1 次提交
  12. 12 3月, 2018 1 次提交
  13. 09 3月, 2018 1 次提交
  14. 27 2月, 2018 1 次提交
  15. 16 2月, 2018 1 次提交
    • A
      [WIP] Move DataType enum inside VarType (#8447) · c7ad26d6
      Abhinav Arora 提交于
      * Move Pod Types from DataType enum to Type enum
      
      * Fixed data_type.h
      
      * Fix type in TensorDesc
      
      * Add comment to framework.proto
      
      * Fixed type in data_type.h
      
      * Updated format of type in data_type.h
      
      * Fix var_desc.h
      
      * Fix op_kernel_type.h
      
      * Fixed data_type_transform_test.cc
      
      * Fix operator.h
      
      * Fixed data_type_transform.cc
      
      * Fixed op_kernel_type_test.cc
      
      * Fix operator.cc
      
      * Fixed data_layout_transform_test.cc
      
      * Fix var_desc.cc
      
      * Fixed assign_value_op.cc
      
      * Fixed assign_value_op.h
      
      * fixed protobuf.cc
      
      * Fix data_layout_transform_test.cc and op_kernel_type_test.cc
      
      * Fixed rnn_memory_helper_op.cc
      
      * Fix progrma_desc_test.cc
      
      * Fixed fill_constant_batch_size_like_op.cc
      
      * Fix operator_test.cc
      
      * Fixed fill_constant_op.cc
      
      * Fixed gaussian_random_op.cc
      
      * Fixed uniform_random_op.cc
      
      * Fixed edit_distance_op.cc
      
      * Fixed fill_constant_batch_size_like_op.cc
      
      * Fixed rnn_memory_helper_op.cc
      
      * Fixed chunk_eval_op.cc
      
      * Fixed assign_value_op.cc
      
      * Fixed assign_value_op.h
      
      * Fixed cast_op.h
      
      * Fixed cast_op.h
      
      * Fix fill constant op
      
      * Fixed clang for assign_value_op.cc
      
      * Fix one_hot_op.h
      
      * Fix one_hot_op.cc
      
      * Fix fill_op.cc
      
      * Fixed sum_op.cc
      
      * Fixed sum_op clang
      
      * Fix uniform_random_op.cc
      
      * Fix gaussian_random_op.cc
      
      * Fix backward.cc
      
      * Fix protobuf.cc
      
      * Fixed prune_test.cc
      
      * Fixed op_registry_test.cc
      
      * Fix data_device_transform_test.cu
      
      * Fix travis error
      
      * Fixed one_hot_op.cu
      
      * Fixed op_registry_test.cc
      
      * Fixed nccl_op.cc
      
      * Fixing python tests
      
      * Revert "Fixing python tests"
      
      This reverts commit fccaa4c5.
      
      * Fixing Pybind to remove data type
      
      * Fixing tensor.py
      
      * Updated the new files:
      
      * Resolve error in merge conflict of fill_constant_batch_size_like_op.cc
      c7ad26d6
  16. 13 2月, 2018 1 次提交
    • A
      Separate VarType from VarDesc in framework.proto and fix all related compiler errors (#8414) · fcadb452
      Abhinav Arora 提交于
      * Refine Type system
      
      * Fixing type inference
      
      * Fixed create_reader_op.cc
      
      * Fix var_desc.h
      
      * Fixed executor.cc
      
      * Fix shape_inference.h
      
      * Fixed create_reader_op.cc
      
      * Fix tensor_util.h
      
      * Fixed var_type_inference_test.cc
      
      * Fix shape_inference.cc
      
      * Fixed sum_op.c
      
      * Fixed read_op.cc
      
      * Fix var_type.h
      
      * Fixed beam_search_decode_op.cc
      
      * sendrecvop_utils.cc
      
      * Fix operator.cc
      
      * Fixed lookup_table_op.cc
      
      * Fixed op_desc.cc
      
      * Fixed get_places_op.cc
      
      * Fixed lod_rank_table_op.cc
      
      * Fixed beam_search_op.cc
      
      * Fix var_desc.cc
      
      * Fixed lod_tensor_to_array_op.cc
      
      * Fixed while_op.cc
      
      * Fix program_desc_test.cc
      
      * tensor_array_read_write_op.cc
      
      * Fix assign_op.cc
      
      * Fix executor.cc
      
      * Fix protobuf.cc
      
      * Fix protobuf.cc
      fcadb452
  17. 12 2月, 2018 1 次提交
  18. 10 2月, 2018 2 次提交
  19. 09 2月, 2018 1 次提交
  20. 06 2月, 2018 2 次提交
  21. 02 2月, 2018 1 次提交
  22. 31 1月, 2018 1 次提交
  23. 19 1月, 2018 1 次提交
  24. 14 1月, 2018 1 次提交
    • D
      "cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0
      dzhwinter 提交于
      * "unified operators"
      
      * "add CUDNN register"
      
      * "add use cudnn attribute"
      
      * "add attribute"
      
      * "test conv tranpose op"
      
      * "remove duplicated attr"
      
      * "fix op test"
      
      * "add attribute to set cudnn"
      
      * "add more log"
      
      * "need layout op register support"
      
      * "add more log"
      
      * "change GetExpectedKernelType "
      
      * "fix Get attr in conv_op"
      
      * "fix CI"
      
      * "fix tests"
      
      * "removed kernel priority fallback"
      
      * "fix CI"
      
      * "fix stack pointer bug"
      
      * "refine buggy interface"
      
      * "add const cast to save life"
      
      * "fix get_output_with_grad"
      
      * "fix op test with dataformat"
      
      * ""fix pooling
      
      * "fix pooling test"
      
      * "fix CI"
      
      * "fix with_gpu error"
      
      * "add transform needed functional check"
      
      * "fix unpack list error"
      
      * "comment out parallel.do temporary"
      
      * "fix CI"
      
      * "fix compile doc error"
      
      * "make threshold larger"
      5ad1aef0
  25. 12 1月, 2018 1 次提交
  26. 10 1月, 2018 3 次提交
  27. 09 1月, 2018 1 次提交
  28. 08 1月, 2018 5 次提交
    • Q
      fix priority · 0b52cc88
      qiaolongfei 提交于
      0b52cc88
    • Q
      add back priority · ca90356b
      qiaolongfei 提交于
      ca90356b
    • D
      Feature/add shared layout (#7233) · e94db381
      dzhwinter 提交于
      * "reuse ShareLoD with no regret"
      
      * "removed base class shareLayout"
      
      * "fix CI"
      e94db381
    • Q
      cpu gpu transform function (#7191) · 0f353ab4
      Qiao Longfei 提交于
      * add rename guard
      
      * add device_data_transform
      
      * add device_data_transform_test
      
      * modify GetExpectedKernelType
      
      * update operator.run
      
      * support test test_label_semantic_roles
      
      * optimize code
      
      * optimize code
      
      * rename GetActualKernelType to GetExpectedKernelType
      
      * fix chunk_eval_op and device_data_transform_test
      
      * add is_same_place to place
      
      * optimize code, refine rename_guard
      
      * refine rename guard, add GetKernelTypeForVar
      
      * optimize code
      
      * add some log
      
      * rename guard
      
      * use sub scope to create var
      
      * fix compile
      
      * add IsInitialized for Tensor
      
      * add VarIsTensor
      
      * fix op_registry_test
      
      * test
      
      * tmp disable priority
      
      * restore switch_kernel.md
      
      * code clean
      0f353ab4
    • E
      Show argument dimensions with operator::DebugStringEx (#7268) · 8814bec0
      emailweixu 提交于
      This can make it easier to locate error.
      8814bec0
  29. 05 1月, 2018 1 次提交
    • D
      Feature/use cudnn (#7141) · 5593858d
      dzhwinter 提交于
      * "add c++ side kernel selection"
      
      * "add multiple kernel op test"
      
      * "kernel selection only support cudnn"
      
      * "better formatter"
      
      * "small fix with UseCPU"
      
      * "depends on change interface Get(Place, Library)"
      
      * "fix CI"
      
      * "fix python cudnn test"
      
      * "leave the register cudnn op to another PR"
      
      * "fix CI"
      
      * "use all kernel by default"
      
      * "fix CI"
      5593858d