- 23 2月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* move unbind to phi * revert infer shape * add header file * move concat_and_split to phi
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 21 1月, 2022 1 次提交
-
-
由 chentianyu03 提交于
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 13 1月, 2022 1 次提交
-
-
由 石晓伟 提交于
-
- 08 12月, 2021 1 次提交
-
-
由 sneaxiy 提交于
* fix CUDA Graph H2D bug again * fix no return bug
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 09 11月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
* try to fix CUDA Graph H2D copy bug * remove useless code * fix ci * fix ROCM CI * fix CUDA_VERSION * improve CI coverage
-
- 09 8月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 22 7月, 2021 1 次提交
-
-
由 wuhuachaocoding 提交于
-
- 07 7月, 2021 1 次提交
-
-
由 xiayanming 提交于
-
- 31 3月, 2021 1 次提交
-
-
由 Thunderbrook 提交于
* fix split core * format
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
- 12 6月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979) test=develop
-
- 29 5月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 (#17415) * Optimize the concat and split kernel for special cases that the number of inputs/outputs is 2. test=develop * Refine codes. test=develop * Correct the condition. test=develop * Move the define of tmp_data outside the if statement. * Print the cudnn minor version. test=develop * Fix the case when in_num/o_num is 1 in concat/split op. test=develop * Remove const_cast. test=develop
-
- 25 12月, 2018 1 次提交
-
-
由 chengduo 提交于
* refine tensor test=develop * refine tensor test=develop * fix device_context log test=develop
-
- 21 12月, 2018 2 次提交
-
-
由 chengduo 提交于
* Add Temporal Allocator * add Temporay Allocator to DeviceContext test=develop * code refine test=develop * fix mean_iou test=develop * Add DeviceTemporaryAllocator test=develop * fix conv_op bug test=develop * small fix test=develop * code refine test=develop * log refine test=develop * fix unit test test=develop * move double check * refine concat_and_split test=develop * add limit_of_temporary_allocation test=develop * fix name test=develop
-
由 minqiyang 提交于
test=develop
-
- 20 12月, 2018 1 次提交
-
-
由 minqiyang 提交于
-
- 23 10月, 2018 1 次提交
-
-
由 chengduo 提交于
* speedup split_op test=develop * speedup split_op test=develop * rename ConcatGrad to Split * refine concat and split test=develop * fix compile error
-
- 18 9月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 17 9月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 27 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 22 8月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 19 6月, 2018 2 次提交
-
-
由 qiaolongfei 提交于
-
由 qingqing01 提交于
* Make the CUDA kernel of concat correct and fix unit tests.
-
- 17 6月, 2018 1 次提交
-
-
由 qiaolongfei 提交于
-
- 02 5月, 2018 1 次提交
-
-
由 Abhinav Arora 提交于
* Fix CPPLint issues in lstm_cpu_kernel.h * Fix CPPLint issues in math/math_function_test * Fix CPPLint issues in math/math_function_test * Fix CPPLint issues in math/concat.cc * Fix CPPLint issues in math/concat.cc * Fix CPPLint issues in math/concat.cc * Fix CPPLint issues in math/gru_cpu_kernel * Fix CPPLint issues in math/selected_rows_functor_test.cu * Fix compile error * Fix compile error
-
- 30 4月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "re-commit " * "picked up" * "fix ci" * "fix pdb hang up issue in cuda 9"
-
- 23 3月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 05 3月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 03 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 02 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-