1. 29 7月, 2020 1 次提交
    • C
      Simplify BufferedReader to improve DataLoader performance (#25648) · 1b3081b1
      Chen Weihang 提交于
      * simplify buffered reader to improve DataLoader performance
      
      * fix 22 failed unittests
      
      * fix cuda pinned context condition
      
      * fix test_reader_reset failed
      
      * fix two failed unittests
      
      * change unittest place
      
      * polish error messaage
      
      * polish cast op GetExpecctedKernelType
      
      * remove debug info in unittest
      1b3081b1
  2. 21 7月, 2020 1 次提交
  3. 26 11月, 2018 1 次提交
  4. 08 11月, 2018 1 次提交
  5. 18 9月, 2018 1 次提交
  6. 29 5月, 2018 1 次提交
  7. 26 5月, 2018 1 次提交
  8. 24 5月, 2018 1 次提交
  9. 18 5月, 2018 1 次提交
  10. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  11. 12 2月, 2018 1 次提交
  12. 10 2月, 2018 2 次提交
  13. 21 1月, 2018 1 次提交
    • Q
      Data type transform (#7653) · 85671b8a
      Qiao Longfei 提交于
      * init complete data layout transform
      
      * can compile
      
      * test passed
      
      * optimize code
      
      * fix while_grad_op first step loss lod problem
      
      * optimize in out ptr for transform
      
      * add check
      
      * update copyright
      
      * clean code
      
      * add NeedTransformLayout
      
      * add comment
      
      * change the interface of data_type_transform
      
      * init data_type_transform_test
      
      * complete data_type_transform_test
      
      * add TransDataType to data_transform
      85671b8a
  14. 14 1月, 2018 1 次提交
    • D
      "cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0
      dzhwinter 提交于
      * "unified operators"
      
      * "add CUDNN register"
      
      * "add use cudnn attribute"
      
      * "add attribute"
      
      * "test conv tranpose op"
      
      * "remove duplicated attr"
      
      * "fix op test"
      
      * "add attribute to set cudnn"
      
      * "add more log"
      
      * "need layout op register support"
      
      * "add more log"
      
      * "change GetExpectedKernelType "
      
      * "fix Get attr in conv_op"
      
      * "fix CI"
      
      * "fix tests"
      
      * "removed kernel priority fallback"
      
      * "fix CI"
      
      * "fix stack pointer bug"
      
      * "refine buggy interface"
      
      * "add const cast to save life"
      
      * "fix get_output_with_grad"
      
      * "fix op test with dataformat"
      
      * ""fix pooling
      
      * "fix pooling test"
      
      * "fix CI"
      
      * "fix with_gpu error"
      
      * "add transform needed functional check"
      
      * "fix unpack list error"
      
      * "comment out parallel.do temporary"
      
      * "fix CI"
      
      * "fix compile doc error"
      
      * "make threshold larger"
      5ad1aef0
  15. 10 1月, 2018 1 次提交
  16. 09 1月, 2018 1 次提交
  17. 08 1月, 2018 1 次提交
    • Q
      cpu gpu transform function (#7191) · 0f353ab4
      Qiao Longfei 提交于
      * add rename guard
      
      * add device_data_transform
      
      * add device_data_transform_test
      
      * modify GetExpectedKernelType
      
      * update operator.run
      
      * support test test_label_semantic_roles
      
      * optimize code
      
      * optimize code
      
      * rename GetActualKernelType to GetExpectedKernelType
      
      * fix chunk_eval_op and device_data_transform_test
      
      * add is_same_place to place
      
      * optimize code, refine rename_guard
      
      * refine rename guard, add GetKernelTypeForVar
      
      * optimize code
      
      * add some log
      
      * rename guard
      
      * use sub scope to create var
      
      * fix compile
      
      * add IsInitialized for Tensor
      
      * add VarIsTensor
      
      * fix op_registry_test
      
      * test
      
      * tmp disable priority
      
      * restore switch_kernel.md
      
      * code clean
      0f353ab4