1. 08 1月, 2018 2 次提交
    • Q
      cpu gpu transform function (#7191) · 0f353ab4
      Qiao Longfei 提交于
      * add rename guard
      
      * add device_data_transform
      
      * add device_data_transform_test
      
      * modify GetExpectedKernelType
      
      * update operator.run
      
      * support test test_label_semantic_roles
      
      * optimize code
      
      * optimize code
      
      * rename GetActualKernelType to GetExpectedKernelType
      
      * fix chunk_eval_op and device_data_transform_test
      
      * add is_same_place to place
      
      * optimize code, refine rename_guard
      
      * refine rename guard, add GetKernelTypeForVar
      
      * optimize code
      
      * add some log
      
      * rename guard
      
      * use sub scope to create var
      
      * fix compile
      
      * add IsInitialized for Tensor
      
      * add VarIsTensor
      
      * fix op_registry_test
      
      * test
      
      * tmp disable priority
      
      * restore switch_kernel.md
      
      * code clean
      0f353ab4
    • E
      Show argument dimensions with operator::DebugStringEx (#7268) · 8814bec0
      emailweixu 提交于
      This can make it easier to locate error.
      8814bec0
  2. 05 1月, 2018 4 次提交
    • Y
      Add COWPtr and its unittest · 0cfb5465
      Yang Yu 提交于
      It will be used for LoD information in LoDTensor since LoD is a copy
      on write field.
      
      It is pretty slow for copying LoD information between operators. For
      resnet it will cost roughly 10% time of whole time, including reading
      data.
      0cfb5465
    • D
      Feature/use cudnn (#7141) · 5593858d
      dzhwinter 提交于
      * "add c++ side kernel selection"
      
      * "add multiple kernel op test"
      
      * "kernel selection only support cudnn"
      
      * "better formatter"
      
      * "small fix with UseCPU"
      
      * "depends on change interface Get(Place, Library)"
      
      * "fix CI"
      
      * "fix python cudnn test"
      
      * "leave the register cudnn op to another PR"
      
      * "fix CI"
      
      * "use all kernel by default"
      
      * "fix CI"
      5593858d
    • Y
      Refine parallel_do · 8496b2e4
      Yang Yu 提交于
      8496b2e4
    • Y
      send_recv variables (#7161) · e5fe8935
      Yancey 提交于
      * send_recv variable
      
      * delete unused logs
      
      * fix ci failed
      
      * update
      
      * resize tensor before tensor copy
      
      * add selectedrows unit test
      
      * check rows
      e5fe8935
  3. 04 1月, 2018 6 次提交
  4. 03 1月, 2018 7 次提交
  5. 02 1月, 2018 3 次提交
  6. 01 1月, 2018 2 次提交
  7. 29 12月, 2017 4 次提交
  8. 28 12月, 2017 9 次提交
  9. 27 12月, 2017 3 次提交