- 06 12月, 2019 2 次提交
-
-
由 bingyanghuang 提交于
-
由 Aurelius84 提交于
-
- 05 12月, 2019 1 次提交
-
-
由 lilong12 提交于
* fix the computation for dx (grad for x) for prelu operation. (#20949) * set the default value of alpha for prelu to 0.25, test=develop * add the call to __syncthreads(), test=develop * fix the implementation of cpu prelu, test=develop * repair the implementation of element mode prelu, test=develop * modify test_prelu_op.py, test=develop
-
- 04 12月, 2019 2 次提交
-
-
由 WangXi 提交于
-
由 bingyanghuang 提交于
-
- 03 12月, 2019 10 次提交
-
-
由 lilong12 提交于
* set dim[0] to -1 if dim[0] < 0 and remove assertion to runtime, test=develop
-
由 Lv Mengsi 提交于
* fix transpose conv,test=develop * fix comments test=develop
-
由 zhaoyuchen2018 提交于
* Improve argsort performance. - Give 200000 data to compute argsort on v100, can speed up ~190x before opt cost: 0.53s after opt cost:0.0027s - Add fp16 support * Refine error message * Refine code * Add descending sort test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 Kaipeng Deng 提交于
* add Adam beta1/beta2 support Variable. test=develop
-
由 zhaoyuchen2018 提交于
* Add Asypadding for conv fusion. test=develop reference: pr/20042 * Fix eigen build link error * Change back file mode * Use math function & add more checks.
-
由 lilong12 提交于
* add the framework support for distfc and ut, test=develop * fix the implementation of shard_index_op, test=develop
-
由 Kaipeng Deng 提交于
* batch_norm momentum support variable. test=develop
-
由 Pei Yang 提交于
-
由 bingyanghuang 提交于
-
由 wangguanzhong 提交于
-
- 02 12月, 2019 3 次提交
-
-
由 zhaoyuchen2018 提交于
* Improve topk performance. give 200000 data to compute topk, before opt: cost 1s after opt: cost 0.0028s. * Refine return value. * Add cuda util funtions. * Fix ComputeBlockSize bug & refine comments. Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 zhaoyuchen2018 提交于
The op should handle k=1024 Fix seq_len < warpsize error. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 zhaoyuchen2018 提交于
seems shuffle_sync cannot handle small size test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
- 30 11月, 2019 1 次提交
-
-
由 Yihua Xu 提交于
* Fix the crash issue when scale or bias was null-pointer. * Add the error message for passing CI. test=release/1.6
-
- 29 11月, 2019 1 次提交
-
-
由 WangXi 提交于
-
- 26 11月, 2019 4 次提交
-
-
由 Lv Mengsi 提交于
* Fix gradients (#20857) * fix_gradients * fix_gradients, test=develop * fix instance norm (#21042) * fix instance norm * update unitest,test=develop * fix_bn * revert unittest,test=develop
-
由 bingyanghuang 提交于
-
由 WangXi 提交于
-
由 WangXi 提交于
-
- 25 11月, 2019 2 次提交
-
-
由 Zhang Ting 提交于
* [cherry-pick] All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756) * All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview * fix the bug that attr(offsets) should be initialized, test=develop * [cherry-pick] maxout supports channel_last input (#20846) * maxout support channel_last input, test=develop * modified details of Input(X) and Attr(groups, axis) in doc, test=develop * [cherry-pick] lrn supports channel_last input, test=develop (#20954)
-
由 Chen Weihang 提交于
* add pre condition check for fuse optimizer op pass, test=develop * add log & set init to zero, test=develop * fix test_fuse_all_reduce_pass failed, test=develop * polish details, test=develop * refine PADDLE_ENFORCE & remove needless VLOG, test=develop * refactor op check method, test=develop
-
- 24 11月, 2019 1 次提交
-
-
由 Adam 提交于
test=develop
-
- 23 11月, 2019 1 次提交
-
-
由 Kaipeng Deng 提交于
* fix elementwise_mod FP kernel. test=develop * fix unittest. test=develop
-
- 21 11月, 2019 2 次提交
-
-
由 Chen Weihang 提交于
* delete paddle infershape enforce marco (#20832) * Polish and arrange code in enforce.h (#20901) * Enrich the type of error and declare the error type interfaces (#21024) * Enrich the type of error and declare the error type interfaces, test=develop * adjust tests to adapt new form, test=develop * add inference deps with error_codes.pb.h, test=develop * restore stack iter start pos, test=develop * polish code based review comments, test=develop * Add dependency for error_codes.proto (#21084) * fix activation_functions deps, test=develop, test=document_fix * add error_codes_proto deps, test=develop, test=document_fix * try delete enforce.h, test=develop, test=document_fix * change cuda enforce & add example (#21142) test=release/1.6
-
由 liym27 提交于
[cherry-pick]fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997) (#21225) * fix bug in pool/conv/conv_transpose: 1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation; 2. fix bug of func _get_padding_with_SAME in test_conv/conv_transpose_op.py; 3. fix bug of the computation process in function conv2dtranspose_forward_naive. test=release/1.6
-
- 13 11月, 2019 1 次提交
-
-
由 bingyanghuang 提交于
-
- 07 11月, 2019 1 次提交
-
-
由 Adam 提交于
* Add asymetric padding support for mkldnn pooling test=develop * Add asymetric padding support for mkldnn conv test=develop * Add asymetric padding support for mkldnn conv_transpose test=develop
-
- 04 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
test=release/1.6
-
- 01 11月, 2019 4 次提交
-
-
由 Zhang Ting 提交于
fix the bug of conv_transpose cudnn kernel:cherry-pick #20958
-
由 liym27 提交于
* [cherry-pick]fix bug in reshape: (#20781) consider the situation that shape of input can contain more than one -1. * [cherry-pick]support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780) * improve split and concat op: 1. support Tensor for argument 'dim' in split op. 2. support Tensor for argument 'axis' in concat op. * redefine function GetDataFromTensor and set unknown output shape to - 1. * add check: Attr(sections) match Input(X). * support Tensor for attr(sections) and attr(sections) can contain -1. * modify error message and fix bug for concat and call Resize only when necessary. test=release/1.6 * [cherry-pick]improve unsqueeze op to support int, Tensor for argument axes (#20824) * improve unsqueeze op to support int, Tensor and Tensor list for argument axes. * call Resize only when necessary. test=release/1.6 * [cherry-pick]Compatible int32 and int64 for attr in concat/split/unsqueeze. test=release/1.6 (#20912)
-
由 123malin 提交于
* update pserver decay blocks * update distributed notify handler
-
由 Chengmo 提交于
* Fix Paddle Cloud role maker (#20860)
-
- 31 10月, 2019 3 次提交
-
-
由 Wilber 提交于
- fix jit::matmul bug input x, shape(m, k), weight, shape(k, n)
-
由 Zhang Ting 提交于
[cherry-pick] fix the bug of conv_transpose: compitable with AnyLayout setting, test=release/1.6 #(20897) (#20918)
-
由 Pei Yang 提交于
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733) (#20902) * fix pool2d trt converter, test=develop * add fix for split op converter, test=develop
-