1. 12 3月, 2022 1 次提交
  2. 20 2月, 2022 1 次提交
  3. 11 2月, 2022 1 次提交
  4. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  5. 16 12月, 2021 1 次提交
    • T
      Faster implementation of CPU kernel for ROI Align operator (#37848) · 023ff4f5
      Tomasz Socha 提交于
      * Faster implementation of CPU kernel for ROI_ALIGN Operator
      
      * Add missing variable to CUDA roi_align_op
      
      * Style
      
      * Fix boundaries
      
      * Rename variables for indexes calculation
      
      * Remove unnecessary emplace
      
      * Revert "Remove unnecessary emplace"
      
      This reverts commit c10e87f7fb812f1a672fde32f2690a97d47e2f5a.
      
      * Style
      023ff4f5
  6. 03 12月, 2021 1 次提交
  7. 08 9月, 2021 1 次提交
    • F
      merge CMakeList.txt manual (#35378) · c4a3e8b4
      feng_shuai 提交于
      * merge CMakeList.txt manual
      
      * add platform for changethreadnum
      
      * repair some bugs according to make error
      
      * do nothing just flush CI
      
      * forget change thread num
      
      * add inplace_atol param for check_output_with_place
      
      * Windows
      
      * std:min and std::max should be change because of windows
      c4a3e8b4
  8. 10 6月, 2021 1 次提交
  9. 09 3月, 2021 1 次提交
  10. 19 2月, 2021 1 次提交
  11. 08 9月, 2020 1 次提交
  12. 11 7月, 2020 1 次提交
  13. 12 5月, 2020 1 次提交
  14. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  15. 11 4月, 2020 1 次提交
  16. 10 4月, 2020 1 次提交
  17. 10 10月, 2019 1 次提交
  18. 11 9月, 2019 1 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
  19. 23 1月, 2019 1 次提交
  20. 18 10月, 2018 1 次提交
  21. 16 10月, 2018 2 次提交
  22. 15 8月, 2018 1 次提交
  23. 01 6月, 2018 1 次提交
  24. 12 2月, 2018 1 次提交
  25. 10 2月, 2018 2 次提交
  26. 22 12月, 2017 1 次提交
  27. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  28. 25 10月, 2017 1 次提交
    • Q
      CPU Batch Norm Op (#4964) · ee998a9c
      Qiao Longfei 提交于
      * init batch norm op
      
      * prepare input output
      
      * compute mean_out var_out save_mean save_var on CPU
      
      * active is test
      
      * use eigen to do computation
      
      * complete batch norm forward
      
      * set default momentum to 0.9
      
      * add batch norm grad op in CPU
      
      * add tensor_format and NHWC support, add python test
      
      * add test training
      
      * add batch norm gradient test
      
      * improve comment, fix foward Python UnitTest
      
      * add gradient test
      
      * fix eigen warning
      
      * follow name style
      
      * fix a bug
      
      * change float to T
      
      * add simple forward test
      
      * test with different place
      
      * add backward test
      
      * refine python test
      
      * remove old python test code
      
      * code clean
      
      * follow code style
      
      * update comment
      ee998a9c
  29. 10 10月, 2017 1 次提交
  30. 28 9月, 2017 1 次提交
  31. 20 9月, 2017 1 次提交
  32. 23 8月, 2017 1 次提交
  33. 11 8月, 2017 1 次提交
  34. 08 8月, 2017 1 次提交
  35. 07 8月, 2017 1 次提交
  36. 05 8月, 2017 1 次提交
  37. 02 8月, 2017 1 次提交
  38. 01 8月, 2017 1 次提交