1. 23 11月, 2018 1 次提交
    • S
      Fix cmake for AMDGPU platform (#13801) · 61c5f13f
      sabreshao 提交于
      * HIP cmake.
      Enable whole archieve build for pybind library.
      
      Disable two warning.
      
      Rollback to C++11.
      
      Link RCCL to WA gpu kernel loading issue.
      
      Update eigen to fix build failure.
      
      Add more include directories.
      
      Fix O3 build failure.
      
      Update eigen.
      
      fix tensor_util_test segment fault issue
      
      add more macro check in hip.cmake.
      we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future.
      
      Fix rocRAND load.
      
      Update eigen to fix gru_unit_op and reduce_op.
      
      Add HIP support to testing.
      
      Update eigen to support int16 and int8 in arg min and arg max.
      
      * add rocprim as cub library used by nv implementation
      
      * Reduce build time in rocprim.
      
      * Add rocprim introduction, remove useless cmake code.
      
      * Remove useless flags and format cmake file.
      61c5f13f
  2. 22 11月, 2018 4 次提交
    • K
      Bumped MKL-DNN version to 0.17 · 6cc6bf40
      Krzysztof Binias 提交于
      test=develop
      6cc6bf40
    • C
      fix CAPI binary build issue. · ce7add88
      chuanqiw 提交于
      ce7add88
    • W
      Windows/online (#14474) · d9a1f3e5
      wopeizl 提交于
      * add recordio support
      
      * disable the openblas multi-thread on windows since no support
      adjust the python script
      
      * code style
      
      * code style
      test=develop
      
      * add create_recordio_file_reader back
      
      * fix code style
      test=develop
      
      * fix the gtest.cmake on windows
      
      * fix cc_test on windows
      
      * fix the win build
      test=develop
      
      * remove fused compile support on windows
      test=develop
      
      * add the jit support
      test=develop
      
      * add the jit support, test=develop
      
      * add the jit support, test=develop
      
      * add the jit back
      fix compile error on windows
      
      * rollback test=develop
      
      * test case fix
      
      * disable DSO by default on windows
      
      * exclude warpctc_op on windows
      
      * exclude the dynload_warpctc out on windows
      test=develop
      
      * fix the scripts error
      test=develop
      
      * disable avx on windows by default
      test=develop
      
      * re-organize the cmake file
      
      * disable mkl on windows by default
      
      * add warp_ctc back
      
      * fix the dependency
      
      * fix the dependency
      
      * fix the build issue on windows
      
      * remove unsupported flag on windows
      
      * code style
      
      * code style
      test=develop
      
      * fix issue
      
      * add profiler, parallel_executor back
      
      * clean up the pre-definitions on windows
      
      * fix build issue
      
      * test=develop
      d9a1f3e5
    • S
      add dlpack support · 3912545f
      sneaxiy 提交于
      test=develop
      3912545f
  3. 19 11月, 2018 3 次提交
  4. 16 11月, 2018 2 次提交
    • W
      Refine operator cmake (#14413) · a2d9b344
      Wu Yi 提交于
      * wip simplify operator framework
      
      * wip
      
      * wip
      
      * done test=develop
      
      * clean test=develop
      
      * fix test=develop
      
      * fix deps test=develop
      
      * fix cpu build test=develop
      
      * fix tensorrt build test=develop
      
      * fix tests test=develop
      
      * fix test=develop
      
      * fix cpu build test=develop
      a2d9b344
    • Y
      fix inference on gpu out of mem (#14414) · 7796f65f
      Yan Chunwei 提交于
      * fix inference on gpu out of mem
      
      the transfer logic in operator.cc will keep creating new scopes.
      7796f65f
  5. 14 11月, 2018 4 次提交
  6. 12 11月, 2018 3 次提交
  7. 10 11月, 2018 1 次提交
  8. 09 11月, 2018 1 次提交
  9. 08 11月, 2018 7 次提交
  10. 07 11月, 2018 5 次提交
  11. 06 11月, 2018 2 次提交
    • P
      gpu support, fix build issue: · 1f12ba61
      peizhilin 提交于
      1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string
      2. comment out some ops which not supported on windows
      3. cuda libs may not be correctly linked to target on windows
      1f12ba61
    • T
      fix avx512f flag (#14041) · d277a2e6
      tensor-tang 提交于
      d277a2e6
  12. 05 11月, 2018 2 次提交
  13. 02 11月, 2018 1 次提交
  14. 01 11月, 2018 1 次提交
  15. 31 10月, 2018 1 次提交
  16. 30 10月, 2018 1 次提交
  17. 29 10月, 2018 1 次提交