1. 23 11月, 2018 1 次提交
    • S
      Fix cmake for AMDGPU platform (#13801) · 61c5f13f
      sabreshao 提交于
      * HIP cmake.
      Enable whole archieve build for pybind library.
      
      Disable two warning.
      
      Rollback to C++11.
      
      Link RCCL to WA gpu kernel loading issue.
      
      Update eigen to fix build failure.
      
      Add more include directories.
      
      Fix O3 build failure.
      
      Update eigen.
      
      fix tensor_util_test segment fault issue
      
      add more macro check in hip.cmake.
      we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future.
      
      Fix rocRAND load.
      
      Update eigen to fix gru_unit_op and reduce_op.
      
      Add HIP support to testing.
      
      Update eigen to support int16 and int8 in arg min and arg max.
      
      * add rocprim as cub library used by nv implementation
      
      * Reduce build time in rocprim.
      
      * Add rocprim introduction, remove useless cmake code.
      
      * Remove useless flags and format cmake file.
      61c5f13f
  2. 22 11月, 2018 1 次提交
    • W
      Windows/online (#14474) · d9a1f3e5
      wopeizl 提交于
      * add recordio support
      
      * disable the openblas multi-thread on windows since no support
      adjust the python script
      
      * code style
      
      * code style
      test=develop
      
      * add create_recordio_file_reader back
      
      * fix code style
      test=develop
      
      * fix the gtest.cmake on windows
      
      * fix cc_test on windows
      
      * fix the win build
      test=develop
      
      * remove fused compile support on windows
      test=develop
      
      * add the jit support
      test=develop
      
      * add the jit support, test=develop
      
      * add the jit support, test=develop
      
      * add the jit back
      fix compile error on windows
      
      * rollback test=develop
      
      * test case fix
      
      * disable DSO by default on windows
      
      * exclude warpctc_op on windows
      
      * exclude the dynload_warpctc out on windows
      test=develop
      
      * fix the scripts error
      test=develop
      
      * disable avx on windows by default
      test=develop
      
      * re-organize the cmake file
      
      * disable mkl on windows by default
      
      * add warp_ctc back
      
      * fix the dependency
      
      * fix the dependency
      
      * fix the build issue on windows
      
      * remove unsupported flag on windows
      
      * code style
      
      * code style
      test=develop
      
      * fix issue
      
      * add profiler, parallel_executor back
      
      * clean up the pre-definitions on windows
      
      * fix build issue
      
      * test=develop
      d9a1f3e5
  3. 16 11月, 2018 1 次提交
    • W
      Refine operator cmake (#14413) · a2d9b344
      Wu Yi 提交于
      * wip simplify operator framework
      
      * wip
      
      * wip
      
      * done test=develop
      
      * clean test=develop
      
      * fix test=develop
      
      * fix deps test=develop
      
      * fix cpu build test=develop
      
      * fix tensorrt build test=develop
      
      * fix tests test=develop
      
      * fix test=develop
      
      * fix cpu build test=develop
      a2d9b344
  4. 06 11月, 2018 1 次提交
    • P
      gpu support, fix build issue: · 1f12ba61
      peizhilin 提交于
      1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string
      2. comment out some ops which not supported on windows
      3. cuda libs may not be correctly linked to target on windows
      1f12ba61
  5. 05 11月, 2018 2 次提交
  6. 19 10月, 2018 1 次提交
  7. 17 10月, 2018 1 次提交
  8. 25 9月, 2018 1 次提交
  9. 25 8月, 2018 1 次提交
  10. 24 8月, 2018 2 次提交
  11. 21 8月, 2018 1 次提交
  12. 17 8月, 2018 1 次提交
  13. 05 7月, 2018 1 次提交
  14. 12 4月, 2018 1 次提交
  15. 08 4月, 2018 1 次提交
  16. 07 4月, 2018 1 次提交
  17. 22 3月, 2018 1 次提交
  18. 16 3月, 2018 1 次提交
    • S
      Demostration of cmake refine for HIP support. · 45c988d8
      sabreshao 提交于
      1. Add option WITH_AMD_GPU.
      2. Add cmake/hip.cmake for HIP toolchain.
      3. Some external module such as eigen may need HIP port.
      4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake.
      5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
      45c988d8
  19. 14 3月, 2018 1 次提交
  20. 07 3月, 2018 1 次提交
  21. 10 2月, 2018 2 次提交
  22. 26 1月, 2018 1 次提交
    • K
      New Run() method for framework::Executor (#7807) · 788f5c6d
      kexinzhao 提交于
      * initial commit
      
      * add new executor run function
      
      * fix bug
      
      * fix multiple definition of feed_fetch_method issue
      
      * fix cmake
      
      * fix tensor copy error
      
      * refine executor code
      
      * add comments
      
      * temporary modification
      
      * address comments
      
      * fix bug
      788f5c6d
  23. 16 1月, 2018 1 次提交
  24. 01 1月, 2018 1 次提交
  25. 27 12月, 2017 1 次提交
  26. 19 12月, 2017 1 次提交
  27. 18 12月, 2017 1 次提交
    • D
      Feature/global context (#6537) · 24fda392
      dzhwinter 提交于
      * "add DeviceContextPool"
      
      * "add devicecontextpool in pybind"
      
      * "add comments in python side "
      
      * "fix static link error"
      
      * "fix CI error"
      
      * "add executor.py"
      
      * "fix CI error"
      
      * "add with gpu macro"
      
      * "remove comment out codes"
      
      * "add TODO items"
      
      * "update init devices"
      24fda392
  28. 30 11月, 2017 1 次提交
  29. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  30. 28 10月, 2017 1 次提交
    • F
      Python API for inference model saving/load (#5020) · 6783dcee
      fengjiayi 提交于
      * Add `dump_to_file()` for ProgrameDescBind in pybind
      
      * Update
      
      * Add utility.py
      
      * typo
      
      * Fix bugs
      
      * Move add_feed/fetch_components to untility.py
      
      * Compelete dump
      
      * Follow comments
      
      * Change output of Prune() from inference to pointer
      
      * Expose Prune() to Python
      
      * Compelete save/load API of inference model
      
      * Fix errors
      
      * Debuging
      
      * Compelete unit tests
      
      * follow comments
      6783dcee
  31. 24 10月, 2017 1 次提交
  32. 16 10月, 2017 1 次提交
  33. 12 10月, 2017 1 次提交
    • Cudnn conv op (#4195) · a3ccbdb3
      武毅 提交于
      * add cudnn_conv_op
      
      * WIP
      
      * update
      
      * update
      
      * fix grad check
      
      * use platform::memory
      
      * add support group for cudnn
      
      * update
      
      * follow comments
      
      * fix onlycpu build
      
      * update cuda define
      
      * follow comments
      
      * follow comments
      
      * merge with updates
      
      * fix compile error
      
      * follow comments
      
      * follow comments
      a3ccbdb3
  34. 07 10月, 2017 1 次提交
  35. 28 9月, 2017 1 次提交
  36. 27 9月, 2017 2 次提交