- 07 11月, 2018 6 次提交
-
-
由 Qiao Longfei 提交于
Optimize thread pool
-
由 chengduo 提交于
* add fp16 backward support test=develop * add sum_op fp16 test * disable test_dist_save_load test=develop * add check_grad for sum * add unit test for softmax_grad fp16 test=develop * add scale_op unit test * add mul_grad_op unit test for fp16 * add cross_entropy_grad and eman_grad unit test for fp16 test=develop * fix cross_entropy unit test * add pool2d fp16 unit test * refine conv2d fp16 unit test test=develop * refine activation unit test test=develop * fix ci test=develop * follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796 test=develop
-
由 Qiao Longfei 提交于
test=develop
-
由 Xin Pan 提交于
Revert " Exhaustive search for cuDNN conv."
-
由 qingqing01 提交于
This reverts commit ce7d9b07.
-
由 qingqing01 提交于
* exhaustive search for cuDNN conv. * Refine code and add unit testing. * Clean code * Fix model load in fluid/inference and unit testing in conv2d * Follow comments.
-
- 06 11月, 2018 22 次提交
-
-
由 Zeng Jinle 提交于
Fix rmsprop_op enforce bug
-
由 chengduo 提交于
test=develop
-
由 tensor-tang 提交于
fix jit on mac
-
由 Zeng Jinle 提交于
Remove some locks in ParallelExecutor
-
由 Zeng Jinle 提交于
Stream Callback Support in CUDA 10
-
由 Wu Yi 提交于
run dist tests in serial
-
由 tensor-tang 提交于
test=develop
-
由 sneaxiy 提交于
test=develop
-
由 sneaxiy 提交于
test=develop
-
由 Wu Yi 提交于
-
由 Zhen Wang 提交于
* add dam test * update fuse_statis * use separated dam model. * Revert "use separated dam model." This reverts commit 13e775c86f909b164b7cc1d35a8a24b964ec622e. * test=develop * modify the cmake file about infer test, test=develop. * remove one comment, test=develop.
-
由 typhoonzero 提交于
-
由 sneaxiy 提交于
test=develop
-
由 Yu Yang 提交于
Warning only at first when CUDA CC not matched. test=develop
-
-
由 whs 提交于
* Fix build error of affine grid op in mac os. test=develop * Make function return reference. test=develop
-
由 Qiao Longfei 提交于
-
由 Qiao Longfei 提交于
-
由 tensor-tang 提交于
Refine/jit/vmulcode
-
由 tensor-tang 提交于
-
由 Zeng Jinle 提交于
Fix lod_level share bug in read_op
-
由 Wu Yi 提交于
* wip * add ref_by_trainer_id op * ready to test * fix ref inputs * refine rpc_op_handle * fix merge bug
-
- 05 11月, 2018 12 次提交
-
-
由 tensor-tang 提交于
Residual data reorder in MKLDNN convolution
-
由 tensor-tang 提交于
Fix avx illegal instuctions
-
由 tensor-tang 提交于
throw error when mismatch cpu avx version
-
由 sneaxiy 提交于
test=develop
-
由 tensor-tang 提交于
test=develop
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
Fea/jit/gen
-
由 Xin Pan 提交于
fix to only check block 0
-
由 tensor-tang 提交于
test=develop
-
由 tensor-tang 提交于
test=develop
-
由 tensor-tang 提交于
test=develop
-