1. 11 4月, 2020 1 次提交
  2. 10 4月, 2020 1 次提交
  3. 10 10月, 2019 1 次提交
  4. 11 9月, 2019 1 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
  5. 23 1月, 2019 1 次提交
  6. 18 10月, 2018 1 次提交
  7. 16 10月, 2018 2 次提交
  8. 15 8月, 2018 1 次提交
  9. 01 6月, 2018 1 次提交
  10. 12 2月, 2018 1 次提交
  11. 10 2月, 2018 2 次提交
  12. 22 12月, 2017 1 次提交
  13. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  14. 25 10月, 2017 1 次提交
    • Q
      CPU Batch Norm Op (#4964) · ee998a9c
      Qiao Longfei 提交于
      * init batch norm op
      
      * prepare input output
      
      * compute mean_out var_out save_mean save_var on CPU
      
      * active is test
      
      * use eigen to do computation
      
      * complete batch norm forward
      
      * set default momentum to 0.9
      
      * add batch norm grad op in CPU
      
      * add tensor_format and NHWC support, add python test
      
      * add test training
      
      * add batch norm gradient test
      
      * improve comment, fix foward Python UnitTest
      
      * add gradient test
      
      * fix eigen warning
      
      * follow name style
      
      * fix a bug
      
      * change float to T
      
      * add simple forward test
      
      * test with different place
      
      * add backward test
      
      * refine python test
      
      * remove old python test code
      
      * code clean
      
      * follow code style
      
      * update comment
      ee998a9c
  15. 10 10月, 2017 1 次提交
  16. 28 9月, 2017 1 次提交
  17. 20 9月, 2017 1 次提交
  18. 23 8月, 2017 1 次提交
  19. 11 8月, 2017 1 次提交
  20. 08 8月, 2017 1 次提交
  21. 07 8月, 2017 1 次提交
  22. 05 8月, 2017 1 次提交
  23. 02 8月, 2017 1 次提交
  24. 01 8月, 2017 1 次提交
  25. 26 7月, 2017 1 次提交
  26. 25 7月, 2017 1 次提交
  27. 19 7月, 2017 2 次提交
  28. 17 7月, 2017 3 次提交
  29. 14 7月, 2017 1 次提交
  30. 13 7月, 2017 2 次提交
    • Y
      Follow comments · 79b70c2d
      Yu Yang 提交于
      * Convert `op` --> `operators`
      * Remove AddType in OpProtoMaker, because type is part of registry.
      * Rename CPU_OR_GPU --> DEVICE_TYPE in registry macro.
      79b70c2d
    • Y
      Add a sample op, `add_op` · a0aaafe9
      Yu Yang 提交于
      * Refine register methods, make Op can get rid of whole-archieve
      * `USE_OP` before a op is used.
      * Add unittest for add_op.
      a0aaafe9