- 24 4月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 21 4月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
* cherry-pick,Optimize the error messages of paddle CUDA API * fix the error messages of paddle CUDA API * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL * remove build_ex_string
-
- 10 4月, 2020 1 次提交
-
-
由 littletomatodonkey 提交于
add addmm op
-
- 26 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop
-
- 11 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* 1. add embedding eltwise layernorm fuse 2. add embedding eltwise layernorm op 3. refine inplace_add_relu 4. refine fc_eltwise_layernorm test=develop * 1. refine fc test=develop * fix comments test=develop * fix comments test=develop
-
- 28 2月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 23 2月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 10 2月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
test=develop
-
- 07 2月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
Fix the integer overflow problem in the op of sequence2batch, change the int32_t to size_t, In the /paddle/fluid/operators/math/sequence2batch.h#L122.
-
- 06 2月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456) * Add log in memory::Copy for debug purpose. * Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one. * Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one. test=develop * Change the type of second_dim from size_t to int64_t. test=develop
-
- 19 1月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 07 1月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 04 1月, 2020 1 次提交
-
-
由 Kaipeng Deng 提交于
-
- 23 12月, 2019 1 次提交
-
-
由 GaoWei8 提交于
test=develop
-
- 11 12月, 2019 1 次提交
-
-
由 GaoWei8 提交于
test=develop
-
- 02 12月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* fix -Wno-error=sign-compare warning in gcc8 test=develop * fix warning in distributed codes test=develop
-
- 28 11月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* remove -Wno-error=sign-compare, make warning as error test=develop test=document_fix * fix exist compile warning test=develop
-
- 27 11月, 2019 1 次提交
-
-
由 GaoWei8 提交于
test=develop
-
- 26 11月, 2019 1 次提交
-
-
由 GaoWei8 提交于
* Add fc padding to solve mkl performance test=develop * fix gpu pass and error information test=develop * fix fc_fuse_pass_test test=develop * fix error information test=develop * fix error information test=develop * fix name and add fc op padding test test=develop * fix attributes test=develop * optimize fc padding test=develop * fix test test=develop
-
- 22 11月, 2019 1 次提交
-
-
由 Liufang Sang 提交于
* add int8 kernel to lookup_table op and add dequantize op test=develop * change paddle_enforce to paddle_enforce_eq test=develop * change copyright and change some not suitable code test=develop * remove debug log test=develop * replace GetInputType with IndicateVarDataType test=develop * fix EmptyGradMaker test=develop * fix diff between cpu and gpu test=develop * use memcopy when int8_t test=develop
-
- 14 11月, 2019 1 次提交
-
-
由 whs 提交于
-
- 12 11月, 2019 1 次提交
-
-
由 lilong12 提交于
* set the default value of alpha for prelu to 0.25, test=develop * add the call to __syncthreads(), test=develop * fix the implementation of cpu prelu, test=develop * repair the implementation of element mode prelu, test=develop * modify test_prelu_op.py, test=develop
-
- 08 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* fix activation_functions deps, test=develop, test=document_fix * add error_codes_proto deps, test=develop, test=document_fix * try delete enforce.h, test=develop, test=document_fix
-
- 05 11月, 2019 2 次提交
-
-
由 zhaoyuchen2018 提交于
ocr_recognition fails, so add a path to handle small frame_size. test=develop
-
由 Tao Luo 提交于
test=develop
-
- 01 11月, 2019 1 次提交
-
-
由 zhaoyuchen2018 提交于
seems shuffle_sync cannot handle small size test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
- 31 10月, 2019 2 次提交
-
-
由 Zhang Ting 提交于
* maxout support channel_last input, test=develop * modified details of Input(X) and Attr(groups, axis) in doc, test=develop
-
由 Zhang Ting 提交于
-
- 30 10月, 2019 1 次提交
-
-
由 zhang wenhui 提交于
-
- 28 10月, 2019 1 次提交
-
-
由 Aurelius84 提交于
-
- 23 10月, 2019 1 次提交
-
-
由 Pei Yang 提交于
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733) * fix pool2d trt converter, test=develop * add fix for split op converter, test=develop
-
- 16 10月, 2019 1 次提交
-
-
由 qingqing01 提交于
* Support fp16 in fused_elemwise_activation_op. * Fix unit testing in ONLY-CPU mode.
-
- 13 10月, 2019 1 次提交
-
-
由 Zhang Ting 提交于
-
- 09 10月, 2019 1 次提交
-
-
由 liym27 提交于
* Delete PadFuntion, include padding.h instead. test=develop * move function(IsSymmetricPadding) from conv_cudnn_op.cu/conv_transpose_cudnn_op.cu to padding.h, test=develop
-
- 07 10月, 2019 1 次提交
-
-
由 Zhang Ting 提交于
-
- 30 9月, 2019 1 次提交
-
-
由 danleifeng 提交于
Improve elementwise operators performance in same dimensions
-
- 29 9月, 2019 1 次提交
-
-
由 liym27 提交于
1.support asymmetric padding; 2.support padding algorithm:"SAME" and "VALID"; 3.support channel_last: data_format NHWC and NDHWC; 4.change doc of python API and c++; test=develop, test=document_preview
-
- 28 9月, 2019 1 次提交
-
-
由 liym27 提交于
* fix pool2d pool3d: 1. support asymmetric padding; 2. support padding algorithm:"SAME" and "VALID"; 3. support channel_last: data_format NHWC and NDHWC; 4. support inferring shape when input with negative dims in compile time; 5. change doc of python API and c++; 6. fix bug in cuda kernel when Attr(adaptive) is true. test=develop,test=document_preview * fix 'tensors' to 'Tensors'. test=develop,test=document_preview * add test for converage ValueError.test=develop,test=document_preview * resolve conflict in test_pool2d. test=develop
-
- 27 9月, 2019 1 次提交
-
-
由 chengduo 提交于
* make pad and split support fp16 test=develop
-
- 25 9月, 2019 1 次提交
-
-
由 Bob Zhu 提交于
* add support of matmul with multiple head even different width and height Original matmul with multiple head supports only the mat_a.width == mat_b.height, in that case, mat_b will be horizontally split. In this patch, we extend the support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height, in this case, mab_b will be vertically split. One example is A is [3, 8], B is [2, 16], head_number is 4. In this case, A will be split as [3, 2], B will be (vertically) split as [2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16] test=develop * add support of matmul with multiple head even different width and height Original matmul with multiple head supports only the mat_a.width == mat_b.height, in that case, mat_b will be horizontally split. In this patch, we extend the support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height, in this case, mab_b will be vertically split. One example is A is [3, 8], B is [2, 16], head_number is 4. In this case, A will be split as [3, 2], B will be (vertically) split as [2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16] test=develop * refactor the code of matmul with multiple head even different width and height test=develop
-