1. 14 9月, 2018 1 次提交
  2. 12 9月, 2018 1 次提交
  3. 29 8月, 2018 1 次提交
  4. 23 8月, 2018 1 次提交
  5. 27 7月, 2018 2 次提交
  6. 05 7月, 2018 1 次提交
  7. 27 6月, 2018 1 次提交
  8. 14 6月, 2018 1 次提交
  9. 25 4月, 2018 1 次提交
  10. 24 4月, 2018 6 次提交
  11. 07 4月, 2018 1 次提交
  12. 27 3月, 2018 2 次提交
  13. 19 3月, 2018 2 次提交
    • X
      fix · ce55975b
      Xin Pan 提交于
      ce55975b
    • X
      Enable P2P memory copy · 18ac6947
      Xin Pan 提交于
      On k40 with 4 devices, time reduces from ~4.0 to ~3.8+, should be
      more obvious on better hardware
      18ac6947
  14. 12 2月, 2018 1 次提交
  15. 11 2月, 2018 1 次提交
  16. 10 2月, 2018 2 次提交
  17. 31 1月, 2018 1 次提交
    • D
      "fix gpu init" (#7528) · 6f7eb0d5
      dzhwinter 提交于
      * "fix gpu init"
      
      * "set env variable default value for share gpu"
      
      * "fix ci"
      
      * "removed CUDA_VISIBLE_DEVICES default"
      
      * "removed"
      6f7eb0d5
  18. 10 1月, 2018 2 次提交
  19. 08 1月, 2018 1 次提交
  20. 05 1月, 2018 1 次提交
    • D
      Feature/use cudnn (#7141) · 5593858d
      dzhwinter 提交于
      * "add c++ side kernel selection"
      
      * "add multiple kernel op test"
      
      * "kernel selection only support cudnn"
      
      * "better formatter"
      
      * "small fix with UseCPU"
      
      * "depends on change interface Get(Place, Library)"
      
      * "fix CI"
      
      * "fix python cudnn test"
      
      * "leave the register cudnn op to another PR"
      
      * "fix CI"
      
      * "use all kernel by default"
      
      * "fix CI"
      5593858d
  21. 03 1月, 2018 2 次提交
  22. 27 12月, 2017 4 次提交
  23. 26 12月, 2017 1 次提交
  24. 25 12月, 2017 1 次提交
  25. 24 12月, 2017 1 次提交
    • D
      Feature/operator run place (#6783) · 735eba29
      dzhwinter 提交于
      * "change operator interface"
      
      * "move devicepool to device_context"
      
      * "fix operator test"
      
      * "fix op_registry Run interface"
      
      * "net op passed. Need to fix nccl multi-Context"
      
      * "add nccl group function"
      
      * "add nccl group function"
      
      * "fix gpu count exceed 32 error"
      
      * "fix recurrent op, nccl op"
      
      * "change the other operators interface with Place"
      
      * "fix typo"
      
      * "fix pybind"
      
      * "fix device in python side"
      
      * "fix pybind failed"
      
      * "add init for test"
      
      * "fix CI"
      735eba29
  26. 18 12月, 2017 1 次提交
    • D
      Feature/global context (#6537) · 24fda392
      dzhwinter 提交于
      * "add DeviceContextPool"
      
      * "add devicecontextpool in pybind"
      
      * "add comments in python side "
      
      * "fix static link error"
      
      * "fix CI error"
      
      * "add executor.py"
      
      * "fix CI error"
      
      * "add with gpu macro"
      
      * "remove comment out codes"
      
      * "add TODO items"
      
      * "update init devices"
      24fda392