- 11 1月, 2021 2 次提交
-
-
由 wangchaochaohu 提交于
-
由 AshburnLee 提交于
-
- 10 1月, 2021 1 次提交
-
-
由 wangchaochaohu 提交于
reduce the occupied size of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
-
- 06 1月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 28 12月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
* add gru op_register_version; test=op_version; * Update fc,mul version;test=op_version;
-
- 14 12月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 07 12月, 2020 1 次提交
-
-
由 LoveAn 提交于
* Compiling operator libraries with Unity Build on Windows CPU. * Compiling operator libraries with Unity Build on Windows GPU, no_test, test=windows_ci * Add option in windows ci script, no_test, test=windows_ci * Optimize parallel compiling, test=develop * remove limit of parallel compile and skip some ops in UB, test=develop * remove changes of header file, test=develop * remove changes of header file, test=develop * fix test_eye_op unittest failed, test=develop * Compiling operator libraries with Unity Build on Linux, test=develop * set default WITH_UNITY_BUILD=OFF, test=develop * Move unity build rules into a single file and add comment, test=develop * optimize parallel compilation, test=develop * fix undefined reference error on coverage ci, test=develop
-
- 27 11月, 2020 1 次提交
-
-
由 arlesniak 提交于
-
- 20 11月, 2020 1 次提交
-
-
由 gongweibao 提交于
-
- 19 11月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
* Add multi_gru op and tests * removed redundant disable_dygraph()
-
- 12 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* skip_layernorm_op done * add unittest * slice op convertor support trt < 6 * skip_layernorm only work in ernie
-
- 27 10月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* add fuse_bn_add_act pass
-
- 14 10月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* polish some error message * add white list * revert shell script change
-
- 09 10月, 2020 2 次提交
-
-
由 Jacek Czaja 提交于
-
由 cc 提交于
-
- 04 10月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
Polish the error message of fc, fused_fc_elementwise_layernorm and fused_embedding_seq_pool. (#27692) * Polish the error message of fc_op. * Polish the error message of fused_fc_elementwise_layer_norm op. * Polish an error message in fused_embedding_seq_pool_op.
-
- 01 10月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 23 9月, 2020 2 次提交
-
-
由 Shang Zhizhou 提交于
* [bug fix]:Memory increases after adapting the cudnn version to 8 * [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
-
由 Zhang Ting 提交于
* add fused_bn_add_relu op
-
- 22 9月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 21 9月, 2020 2 次提交
-
-
由 Shang Zhizhou 提交于
* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0 * fix seqconv_eltadd_relu_fuse_pass unitest error
-
由 huangxu96 提交于
* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests. * Finished channel-wise quantize strategy in imperative quantization. * Added Cuda code of ChannelWiseQuantDequantMaxAbsOP Add Cuda code of ChannelWiseQuantDequantMaxAbsOp * Add quant_axis for channel_wise quant. * fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement. * Added some assert infomation and fixed some coding style mistakes.
-
- 14 9月, 2020 1 次提交
-
-
由 Adam 提交于
* Add int8 GRU kernel with UTs * Lint fixes * More lint fixes
-
- 12 9月, 2020 1 次提交
-
-
由 lidanqing 提交于
* Fix the lookup_table_v2 failed on GRU mkldnn kernel issue test=develop * fix according to reviews, removed x_num_col_dims test=develop * update gru model. change according to reviews test=develop * change according to reviews test=develop
-
- 10 9月, 2020 1 次提交
-
-
由 Qi Li 提交于
-
- 12 8月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 10 8月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 06 8月, 2020 1 次提交
-
-
由 Adam 提交于
* Add oneDNN fusion_gru kernel and fix fc+gru pass test=develop * Formatting changes test=develop * Lint fixes test=develop * Add memory::format_tag::any to GRU weights test=develop * Fix build with CUDA * Fix build with CUDA v2
-
- 05 8月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* cunn8 support test=develop * fix ci error test=develop
-
- 24 7月, 2020 1 次提交
-
-
由 Adam 提交于
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 26 4月, 2020 1 次提交
-
-
由 liuwei1031 提交于
* save InferVarType changes, test=develop * remove code comments, test=develop * tweak code, test=develop * fix compilation warning, update merge_ids_op split_ids_op to new interface, test=develop * modify fused_bn_activation_op, test=develop * fix error of fused_bn_activation_op, test=develop * fix PADDLE_ENFORCE and unittest coverage issue, test=develop * tweak PADDLE_ENFORCE messages, test=develop * improve unittest coverage, test=develop * add StaticGraphInferVarType class, test=develop * rebase develop branch, test=develop * fix unittest error, test=develop * remove comments, test=develop * improve unittest coverage, test=develop * imporve error message and imporve unittest coverage, test=develop * upgrade InferVarType API, test=develop * tweak pyfunc error message, test=develop * fix compilation conflict - save_combine_op, test=develop
-
- 23 4月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* support the head number == 1 test=develop * fix slice op error. test=develop
-
- 21 4月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 20 4月, 2020 3 次提交
-
-
由 zhaoyuchen2018 提交于
* OP(fusion_gru) error message enhancement. test=develop * refine code, test=develop * Refine inout log, test=develop * Refine description, test=develop
-
由 Zhou Wei 提交于
* Optimize the error messages of paddle CUDA API, test=develop * fix the error messages of paddle CUDA API, test=develop * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop * remove build_ex_string,test=develop * merge conflict,test=develop
-
由 Yiqun Liu 提交于
-
- 15 4月, 2020 2 次提交
-
-
由 yiicy 提交于
-
由 zhaoyuchen2018 提交于
* API(fused_embedding_fc_lstm) error message enhancement. test=develop C++ API enhancement. * Refine code, test=develop * Refine code. test=develop
-