- 29 4月, 2019 2 次提交
-
-
由 wopeizl 提交于
* 1. move the API check into CPU process 2. adjust the check order
-
由 tianshuo78520a 提交于
-
- 28 4月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
* refine_dropout_mem,test=develop * # This is a combination of 14 commits. # The first commit's message is: remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066) # This is the 2nd commit message: Fleet unify distributed training (#16791) * implement distributed transpiler with fleet # This is the 3rd commit message: ParallelDyGraph with GPU collective mode (#16827) implement dygraph.parallel.DataParallel to hook reduce op. # This is the 4th commit message: Init mixed precision training interface (#16856) * Init mixed precision training interface * Add fp16 test script test=develop * All initializers support float16 test=develop * Code cleanup & add more code annotations test=develop * Update API spec test=develop * Add usage example in doc test=develop # This is the 5th commit message: fix reference_count_pass,test=develop (#17060) test=develop # This is the 6th commit message: Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090) * Cache the information of linear interpolation in forward and use it in backward. test=develop * Fix cuda kernel. test=develop # This is the 7th commit message: remove unnecessary prepare_data (#17080) test=develop # This is the 8th commit message: fix interpolate cu. test=develop (#17101) # This is the 9th commit message: test=develop, double backward leaky_relu (#17067) backward of backward: leaky_relu # This is the 10th commit message: fix fuse optimizer ops (#17102) test=develop # This is the 11th commit message: truncated_gaussian_random supported in distributed training, test=develop (#17091) # This is the 12th commit message: Detailed coordinate description for yolov3 loss (#17007) * Detailed coordinate description for yolov3 loss test=develop * modified api.spec test=develop * modified loss name * fix api.spec test=develop * polish description test=develop * modified api.spec test=develop # This is the 13th commit message: fix test_weight_decay (#17109) test=develop # This is the 14th commit message: Path flag (#17105) * fix python/paddle/fluid/__init__.py detecting problems
-
由 Huihuang Zheng 提交于
1. Use CudnnWorkspaceHandle in exhaustive search of conv_cudnn. 2. For Ops using CudnnWorkspaceHandle in exhaustive search, release their GPU memory after exhaustive search. test=develop
-
- 27 4月, 2019 2 次提交
-
-
由 tianshuo78520a 提交于
* fix python/paddle/fluid/__init__.py detecting problems
-
由 chengduo 提交于
test=develop
-
- 26 4月, 2019 6 次提交
-
-
由 xiaoting 提交于
* Detailed coordinate description for yolov3 loss test=develop * modified api.spec test=develop * modified loss name * fix api.spec test=develop * polish description test=develop * modified api.spec test=develop
-
由 tangwei12 提交于
-
由 chengduo 提交于
test=develop
-
由 ceci3 提交于
backward of backward: leaky_relu
-
由 Kaipeng Deng 提交于
-
由 Tao Luo 提交于
test=develop
-
- 25 4月, 2019 6 次提交
-
-
由 whs 提交于
Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090) * Cache the information of linear interpolation in forward and use it in backward. test=develop * Fix cuda kernel. test=develop
-
由 Zeng Jinle 提交于
test=develop
-
由 Yibing Liu 提交于
* Init mixed precision training interface * Add fp16 test script test=develop * All initializers support float16 test=develop * Code cleanup & add more code annotations test=develop * Update API spec test=develop * Add usage example in doc test=develop
-
由 Yan Xu 提交于
implement dygraph.parallel.DataParallel to hook reduce op.
-
由 tangwei12 提交于
* implement distributed transpiler with fleet
-
由 tangwei12 提交于
-
- 24 4月, 2019 8 次提交
-
-
由 Zhen Wang 提交于
add checkpoint functions for graph. test=develop
-
由 guomingz 提交于
* resolve #17057 Fixed the bug that fuse_relu/fuse_residual option couldn't be passed to class TestConv2dInt8Op. test=develop * Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing. test=develop
-
由 Tao Luo 提交于
fix runtime_context_cache bug when gpu model has an op runs only on cpu
-
由 wopeizl 提交于
1. specify the cuda arch name and bin to decrease the compile time for inference test=develop 2. simplify the script and add comments 3. remove the fluid process from cicheck
-
由 XiaoguangHu 提交于
update pip version in Readme to 1.4.1
-
由 Cheerego 提交于
test=develop
-
由 chengduo 提交于
test=develop
-
由 XiaoguangHu 提交于
update Readme and releasenote for 1.4.1
-
- 23 4月, 2019 14 次提交
-
-
由 chengduo 提交于
* Add fuse momenutum ops
-
由 guru4elephant 提交于
fix nccl wrapper on windows
-
由 tangwei12 提交于
-
由 luotao1 提交于
test=develop
-
由 Zeng Jinle 提交于
* make_conv_cudnn_ws_size_configurable, test=develop * change std::max to std::min test=develop
-
由 jerrywgz 提交于
fix potential hung in generate proposals, test=develop
-
由 Kaipeng Deng 提交于
fix split for dimension judgment
-
由 Tao Luo 提交于
update ngraph version
-
由 tink2123 提交于
test=develop
-
由 shanyi15 提交于
-
由 chengduo 提交于
test=develop
-
由 Tao Luo 提交于
reduce unittest time by rename testcuda to has_cuda
-
由 qingqing01 提交于
Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862) * Support backward of backward and a new gradient checker * Rename decorators.py to decorator_helper.py, since Python on Windows CI has decorators package. 1. Add ReluDoubleGradMaker when register relu_grad. 2. Add a new gradient checker by comparing theoretical and numerical Jacobian. Check double gradients by double_grad_check.
-
由 lujun 提交于
fix bug in save, test=develop
-