- 23 11月, 2018 1 次提交
-
-
由 sabreshao 提交于
* HIP cmake. Enable whole archieve build for pybind library. Disable two warning. Rollback to C++11. Link RCCL to WA gpu kernel loading issue. Update eigen to fix build failure. Add more include directories. Fix O3 build failure. Update eigen. fix tensor_util_test segment fault issue add more macro check in hip.cmake. we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future. Fix rocRAND load. Update eigen to fix gru_unit_op and reduce_op. Add HIP support to testing. Update eigen to support int16 and int8 in arg min and arg max. * add rocprim as cub library used by nv implementation * Reduce build time in rocprim. * Add rocprim introduction, remove useless cmake code. * Remove useless flags and format cmake file.
-
- 22 11月, 2018 1 次提交
-
-
由 wopeizl 提交于
* add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop
-
- 21 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 16 11月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* wip simplify operator framework * wip * wip * done test=develop * clean test=develop * fix test=develop * fix deps test=develop * fix cpu build test=develop * fix tensorrt build test=develop * fix tests test=develop * fix test=develop * fix cpu build test=develop
-
- 15 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 06 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
1. Non utf-8 characters within comments of OPs may lead to protobuf fail to parse_from_string 2. comment out some ops which not supported on windows 3. cuda libs may not be correctly linked to target on windows
-
- 05 11月, 2018 2 次提交
- 19 10月, 2018 1 次提交
-
-
由 typhoonzero 提交于
-
- 17 10月, 2018 1 次提交
-
-
由 typhoonzero 提交于
-
- 25 9月, 2018 1 次提交
-
-
由 Xin Pan 提交于
-
- 25 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 24 8月, 2018 2 次提交
- 21 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 17 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 05 7月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* move to platform * "move init from framework to platform" * "remove used init" * "fix ci" * "fix ci" * "fix generic" * "fix ci" * "fix ci" * "fix ci" * "disable fragile test"
-
- 12 4月, 2018 1 次提交
-
-
由 Yang Yang(Tony) 提交于
-
- 08 4月, 2018 1 次提交
-
-
由 Yi Wang 提交于
-
- 07 4月, 2018 1 次提交
-
-
由 Yi Wang 提交于
* cpplint test and add tesnor_py_test.cc * Update * Update
-
- 22 3月, 2018 1 次提交
-
-
由 sabreshao 提交于
Fix CI.
-
- 16 3月, 2018 1 次提交
-
-
由 sabreshao 提交于
1. Add option WITH_AMD_GPU. 2. Add cmake/hip.cmake for HIP toolchain. 3. Some external module such as eigen may need HIP port. 4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake. 5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
-
- 14 3月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 07 3月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 10 2月, 2018 2 次提交
- 26 1月, 2018 1 次提交
-
-
由 kexinzhao 提交于
* initial commit * add new executor run function * fix bug * fix multiple definition of feed_fetch_method issue * fix cmake * fix tensor copy error * refine executor code * add comments * temporary modification * address comments * fix bug
-
- 16 1月, 2018 1 次提交
-
-
由 dangqingqing 提交于
-
- 01 1月, 2018 1 次提交
-
-
由 Tao Luo 提交于
-
- 27 12月, 2017 1 次提交
-
-
由 typhoonzero 提交于
-
- 19 12月, 2017 1 次提交
-
-
由 qiaolongfei 提交于
-
- 18 12月, 2017 1 次提交
-
-
由 dzhwinter 提交于
* "add DeviceContextPool" * "add devicecontextpool in pybind" * "add comments in python side " * "fix static link error" * "fix CI error" * "add executor.py" * "fix CI error" * "add with gpu macro" * "remove comment out codes" * "add TODO items" * "update init devices"
-
- 30 11月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 26 11月, 2017 1 次提交
-
-
由 dzhwinter 提交于
* "make global tensor function independently" * "replace functor" * "fix inline template error" * "fix tensor array with CopyFrom" * "fix other case use CopyFrom" * "move the op interface hardly" * "fix operators" * "fix typo" * "delete dynamic recurrent rnn and fix gru_unit in debugmode" * "fix unique_ptr copy" * "fix cuda copy" * "fix namespace error" * "removed nccl python test" * "fix include error" * "fix typo" * fix copy util test
-
- 28 10月, 2017 1 次提交
-
-
由 fengjiayi 提交于
* Add `dump_to_file()` for ProgrameDescBind in pybind * Update * Add utility.py * typo * Fix bugs * Move add_feed/fetch_components to untility.py * Compelete dump * Follow comments * Change output of Prune() from inference to pointer * Expose Prune() to Python * Compelete save/load API of inference model * Fix errors * Debuging * Compelete unit tests * follow comments
-
- 24 10月, 2017 1 次提交
-
-
由 Yi Wang 提交于
* Add print_operators_doc.cc * Update Escape * Correct a bug * Remove OpInfoMap::Iterate * Update the print_operators_doc.cc * Escape tab * Use auto& * Use auto& * Remove trailing , * clang-format C++
-
- 16 10月, 2017 1 次提交
-
-
由 fengjiayi 提交于
* Expose Executor to Python * Follow comments
-
- 12 10月, 2017 1 次提交
-
-
由 武毅 提交于
* add cudnn_conv_op * WIP * update * update * fix grad check * use platform::memory * add support group for cudnn * update * follow comments * fix onlycpu build * update cuda define * follow comments * follow comments * merge with updates * fix compile error * follow comments * follow comments
-
- 07 10月, 2017 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 28 9月, 2017 1 次提交
-
-
由 fengjiayi 提交于
-