1. 14 1月, 2018 1 次提交
    • D
      "cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0
      dzhwinter 提交于
      * "unified operators"
      
      * "add CUDNN register"
      
      * "add use cudnn attribute"
      
      * "add attribute"
      
      * "test conv tranpose op"
      
      * "remove duplicated attr"
      
      * "fix op test"
      
      * "add attribute to set cudnn"
      
      * "add more log"
      
      * "need layout op register support"
      
      * "add more log"
      
      * "change GetExpectedKernelType "
      
      * "fix Get attr in conv_op"
      
      * "fix CI"
      
      * "fix tests"
      
      * "removed kernel priority fallback"
      
      * "fix CI"
      
      * "fix stack pointer bug"
      
      * "refine buggy interface"
      
      * "add const cast to save life"
      
      * "fix get_output_with_grad"
      
      * "fix op test with dataformat"
      
      * ""fix pooling
      
      * "fix pooling test"
      
      * "fix CI"
      
      * "fix with_gpu error"
      
      * "add transform needed functional check"
      
      * "fix unpack list error"
      
      * "comment out parallel.do temporary"
      
      * "fix CI"
      
      * "fix compile doc error"
      
      * "make threshold larger"
      5ad1aef0
  2. 09 1月, 2018 2 次提交
    • Y
      Port WarpCTC Operator (#5107) · b5fda272
      Yiqun Liu 提交于
      * Add Seq2BatchFunctor, which will be used in WarpCTCOp.
      
      * Implement WrapCTCFunctor and WrapCTCKernel.
      
      * Add unittest of warpctc_op.
      
      * Modify the check_output inferface in python unittest framework to allow check a subset of outputs.
      
      * Use absolute offset lod in warpctc_op and related functors.
      
      * Refine the comments of warpctc_op.
      
      * The new python unittest supports checking a subset of the outputs, so revoke the previous change.
      
      * Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor.
      
      * Update to the newest codes.
      
      * Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.
      b5fda272
    • Y
      Rename CopyFrom to Copy for tensors (#7292) · ce6dad3b
      Yu Yang 提交于
      * Rename Tensor::CopyFrom to Tensor::Copy
      
      * Fix CI
      
      * Fix compile
      ce6dad3b
  3. 02 1月, 2018 2 次提交
    • D
      Feature/transform (#7111) · 899a79cc
      dzhwinter 提交于
      * "fix data transform"
      
      * "data transformer"
      
      * "add device pool"
      
      * "add test"
      
      * "fix ci"
      
      * "fix datalayout implementation "
      
      * "fix based on comment"
      899a79cc
    • Q
      fix compile (#7125) · 105ee86d
      QI JUN 提交于
      105ee86d
  4. 29 12月, 2017 3 次提交
  5. 28 12月, 2017 5 次提交
  6. 27 12月, 2017 3 次提交
  7. 26 12月, 2017 3 次提交
  8. 25 12月, 2017 4 次提交
  9. 24 12月, 2017 1 次提交
  10. 21 12月, 2017 1 次提交
  11. 20 12月, 2017 1 次提交
  12. 19 12月, 2017 3 次提交
  13. 18 12月, 2017 1 次提交
  14. 15 12月, 2017 1 次提交
  15. 14 12月, 2017 1 次提交
  16. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  17. 11 12月, 2017 1 次提交
  18. 12 12月, 2017 1 次提交
  19. 09 12月, 2017 1 次提交
  20. 08 12月, 2017 1 次提交
  21. 03 12月, 2017 1 次提交
  22. 29 11月, 2017 2 次提交