- 28 4月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
* refine_dropout_mem,test=develop * # This is a combination of 14 commits. # The first commit's message is: remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066) # This is the 2nd commit message: Fleet unify distributed training (#16791) * implement distributed transpiler with fleet # This is the 3rd commit message: ParallelDyGraph with GPU collective mode (#16827) implement dygraph.parallel.DataParallel to hook reduce op. # This is the 4th commit message: Init mixed precision training interface (#16856) * Init mixed precision training interface * Add fp16 test script test=develop * All initializers support float16 test=develop * Code cleanup & add more code annotations test=develop * Update API spec test=develop * Add usage example in doc test=develop # This is the 5th commit message: fix reference_count_pass,test=develop (#17060) test=develop # This is the 6th commit message: Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090) * Cache the information of linear interpolation in forward and use it in backward. test=develop * Fix cuda kernel. test=develop # This is the 7th commit message: remove unnecessary prepare_data (#17080) test=develop # This is the 8th commit message: fix interpolate cu. test=develop (#17101) # This is the 9th commit message: test=develop, double backward leaky_relu (#17067) backward of backward: leaky_relu # This is the 10th commit message: fix fuse optimizer ops (#17102) test=develop # This is the 11th commit message: truncated_gaussian_random supported in distributed training, test=develop (#17091) # This is the 12th commit message: Detailed coordinate description for yolov3 loss (#17007) * Detailed coordinate description for yolov3 loss test=develop * modified api.spec test=develop * modified loss name * fix api.spec test=develop * polish description test=develop * modified api.spec test=develop # This is the 13th commit message: fix test_weight_decay (#17109) test=develop # This is the 14th commit message: Path flag (#17105) * fix python/paddle/fluid/__init__.py detecting problems
-
由 Huihuang Zheng 提交于
1. Use CudnnWorkspaceHandle in exhaustive search of conv_cudnn. 2. For Ops using CudnnWorkspaceHandle in exhaustive search, release their GPU memory after exhaustive search. test=develop
-
- 23 4月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* make_conv_cudnn_ws_size_configurable, test=develop * change std::max to std::min test=develop
-
- 21 2月, 2019 1 次提交
-
-
由 Xin Pan 提交于
test=develop
-
- 14 1月, 2019 1 次提交
-
- 11 1月, 2019 2 次提交
-
-
由 chengduozh 提交于
test=develop This reverts commit 064512aa.
-
由 chengduo 提交于
* remove workspace_handle in conv2d_cudnn test=develop * remove workspace_handle test=develop * fix bug test=develop * make test_conv2d_op SERIAL test=develop * save memory in conv_cudnn test=develop * enhance thread safety test=develop * enhance temporary allocator test=develop * Add excess fraction test=develop * follow comments test=develop * fix bug and code refine test=develop * fix memory size check test=develop * rename reuse_tmp_allocation_excess_fraction test=develop
-
- 10 1月, 2019 1 次提交
-
-
由 Wu Yi 提交于
* clip softmax for fp16 * updates * fuse xent support fp16 test=develop * wip * wip * add simple row reduce * wip fp16 accurate softmax * add accurate softmax kernel for fp16 test=develop * update test=develop * fix cpu build test=develop * update api.spec test=develop * follow comments test=develop * fix build test=develop * fix trt build test=develop * fix inference build test=develop * fix merge test=develop * update test=develop * try fix build test=develop * fix build test=develop * rename real_exp test=develop * fortest * remove hacky kernels test=develop * clean up test=develop
-
- 26 11月, 2018 1 次提交
-
-
由 minqiyang 提交于
test=develop
-
- 19 11月, 2018 1 次提交
-
-
由 qingqing01 提交于
* Convolution fusion operator. * Clean code test=develop
-
- 13 11月, 2018 1 次提交
-
-
由 Dang Qingqing 提交于
test=develop
-
- 09 11月, 2018 1 次提交
-
-
由 qingqing01 提交于
* exhaustive search for cuDNN conv. * Refine code and add unit testing. * Fix model load in fluid/inference and unit testing in conv2d * Follow comments. * Fix compiling test=develop
-
- 08 11月, 2018 1 次提交
-
-
由 minqiyang 提交于
Fix code to support cpplint syntax check test=develop
-
- 07 11月, 2018 3 次提交
-
-
由 chengduo 提交于
* add fp16 backward support test=develop * add sum_op fp16 test * disable test_dist_save_load test=develop * add check_grad for sum * add unit test for softmax_grad fp16 test=develop * add scale_op unit test * add mul_grad_op unit test for fp16 * add cross_entropy_grad and eman_grad unit test for fp16 test=develop * fix cross_entropy unit test * add pool2d fp16 unit test * refine conv2d fp16 unit test test=develop * refine activation unit test test=develop * fix ci test=develop * follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796 test=develop
-
由 qingqing01 提交于
This reverts commit ce7d9b07.
-
由 qingqing01 提交于
* exhaustive search for cuDNN conv. * Refine code and add unit testing. * Clean code * Fix model load in fluid/inference and unit testing in conv2d * Follow comments.
-
- 05 11月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 26 10月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 25 10月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 16 10月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 04 9月, 2018 2 次提交
-
-
由 guochaorong 提交于
- 31 8月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 30 8月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 17 8月, 2018 1 次提交
-
- 16 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "cherry picked operators changes" * "remove duplicated code" * "add constant setter" * "add get expected kernel" * "fix ci" * "add fill constant"
-
- 31 7月, 2018 1 次提交
-
-
由 Yu Yang 提交于
* Introduced by #11205
-
- 26 7月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* refine conv cudnn enforce * update * update all cudnn ops * fix
-
- 06 6月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "fix deterministic" * "fix ci" * "fix init"
-
- 07 5月, 2018 1 次提交
-
-
由 Kexin Zhao 提交于
-
- 28 4月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 04 4月, 2018 1 次提交
-
-
由 Kexin Zhao 提交于
-
- 17 3月, 2018 1 次提交
-
-
由 Kexin Zhao 提交于
-
- 16 3月, 2018 3 次提交
-
-
由 Kexin Zhao 提交于
-
由 Kexin Zhao 提交于
-
由 Kexin Zhao 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 14 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "unified operators" * "add CUDNN register" * "add use cudnn attribute" * "add attribute" * "test conv tranpose op" * "remove duplicated attr" * "fix op test" * "add attribute to set cudnn" * "add more log" * "need layout op register support" * "add more log" * "change GetExpectedKernelType " * "fix Get attr in conv_op" * "fix CI" * "fix tests" * "removed kernel priority fallback" * "fix CI" * "fix stack pointer bug" * "refine buggy interface" * "add const cast to save life" * "fix get_output_with_grad" * "fix op test with dataformat" * ""fix pooling * "fix pooling test" * "fix CI" * "fix with_gpu error" * "add transform needed functional check" * "fix unpack list error" * "comment out parallel.do temporary" * "fix CI" * "fix compile doc error" * "make threshold larger"
-