- 16 3月, 2018 1 次提交
-
-
由 sabreshao 提交于
1. Add option WITH_AMD_GPU. 2. Add cmake/hip.cmake for HIP toolchain. 3. Some external module such as eigen may need HIP port. 4. Add macro hip_library/hip_binary/hip_test to cmake/generic.cmake. 5. Add one HIP source concat.hip.cu as an example. Each .cu may have its corresponding .hip.cu.
-
- 13 3月, 2018 1 次提交
-
-
由 QI JUN 提交于
* fix nccl op unit test * fix build error * format code * refine nccl related unit test * fix build error * add setGPUData * clean up * follow comments * rm test_nccl.cu * follow comment * rm wait
-
- 08 3月, 2018 1 次提交
-
-
由 Tao Luo 提交于
* compile and install the static library of fluid inference * fix dynload_cuda not in CPU mode * update shared library and adjust the deploy of openblas * adjust the deploy of openblas * * auto add all fluid modules for static library * use libprotobuf.a instead of libprotobuf-lite.a for profiler * use set_property to set the global varible instead of ENV * add gpu depends of fluid modules, auto add inference_lib_dist depends * change the condition of openblas_lib, and fix a typo
-
- 06 3月, 2018 2 次提交
-
-
由 qiaolongfei 提交于
-
由 Yu Yang 提交于
-
- 05 3月, 2018 1 次提交
-
-
由 dongzhihong 提交于
-
- 01 3月, 2018 2 次提交
- 28 2月, 2018 1 次提交
-
-
由 Yu Yang 提交于
* Make CUPTI_LIB_PATH not passing by macro. * Add missing header
-
- 26 2月, 2018 1 次提交
-
-
由 Xin Pan 提交于
-
- 15 2月, 2018 1 次提交
-
-
由 Yi Wang 提交于
-
- 14 2月, 2018 1 次提交
-
-
由 Yang Yang 提交于
-
- 12 2月, 2018 3 次提交
-
-
由 Jacek Czaja 提交于
-
由 qingqing01 提交于
-
由 Tao Luo 提交于
* refine inference_lib_dist after code move, and add it to docker/build.sh * remove is_directory in inference_lib.cmake
-
- 08 2月, 2018 2 次提交
- 07 2月, 2018 1 次提交
-
-
由 Yu Yang 提交于
* It will sightly faster compile and make warpctc compile well on CUDA 9 and GCC 5.5
-
- 06 2月, 2018 2 次提交
- 05 2月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 01 2月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 30 1月, 2018 2 次提交
- 27 1月, 2018 1 次提交
-
-
由 yi.wu 提交于
-
- 25 1月, 2018 1 次提交
-
-
由 yi.wu 提交于
-
- 23 1月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 22 1月, 2018 2 次提交
-
-
由 dangqingqing 提交于
-
由 dangqingqing 提交于
-
- 21 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "fix decode bug" * "follow commnet" * "fix error" * "fix hook bug" * fix based comment * fix copyright * fix based on comment
-
- 20 1月, 2018 1 次提交
-
-
由 dangqingqing 提交于
-
- 19 1月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 16 1月, 2018 2 次提交
-
-
由 dangqingqing 提交于
-
由 Luo Tao 提交于
-
- 15 1月, 2018 2 次提交
-
-
由 gongweibao 提交于
Fix grpc bugs
-
由 dzhwinter 提交于
* add copyright hook * add copyright hook * refine copyright hook * "test copyright hook" * fix check style * fix ci
-
- 09 1月, 2018 1 次提交
-
-
由 Yiqun Liu 提交于
* Add Seq2BatchFunctor, which will be used in WarpCTCOp. * Implement WrapCTCFunctor and WrapCTCKernel. * Add unittest of warpctc_op. * Modify the check_output inferface in python unittest framework to allow check a subset of outputs. * Use absolute offset lod in warpctc_op and related functors. * Refine the comments of warpctc_op. * The new python unittest supports checking a subset of the outputs, so revoke the previous change. * Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor. * Update to the newest codes. * Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.
-
- 06 1月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 05 1月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 04 1月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-