- 24 2月, 2021 1 次提交
-
-
由 alncat 提交于
* added support for fake_quantize_dequantize_abs_max op in quantization inference pass * remove const_cast to pass ci * remove compare operator to pass ci-coverage * added detailed error message for unregistered tensorrt_subgrah_pass
-
- 02 2月, 2021 1 次提交
-
-
由 alncat 提交于
* fixed compilation error on gcc 4.8.x due to the usage of isfinite (#30733) * modified conv+bn fuse pass to fix wrong mask in mask rcnn (#30704)
-
- 14 1月, 2021 2 次提交
- 13 1月, 2021 2 次提交
-
-
由 JZ-LIANG 提交于
-
由 石晓伟 提交于
* Register op version for grid_sampler, test=op_version (#29916) * add op version for fake_quant and fake_dequant ops, test=op_version (#29923) * Register op version for print, test=op_version (#29945) * add gru op_register_version; test=op_version; (#29931) * Register op version for coalesce_tensor. (#29940) * register op version for conv2d_transpose, conv3d_transpose and depthwise_conv2d_transpose, test=op_version (#29937) * add op_register_version for allclose op; test=op_version (#29968) * register ModifyAttr for instance_norm, test=op_version (#29938) * add op_version for flip op [test=op_version] (#30019) * add the op version check for the elementwise ops, test=op_version (#30010) * add the support the op version check for matmul, test=op_version (#30011) * Revert "register ModifyAttr for instance_norm, test=op_version (#29938)" * add REGISTER_OP_VERSION for generate_proposals, roi_align, roi_pool test=op_version (#30034) * Fix rank_attention op_version, test=op_version (#30006) * fix rank_attention, test=op_version * Register op version for linspace,test=op_version (#30025) * fix op_register_version for compare ops, test=op_version (#30007) Co-authored-by: Nzhoushunjie <zhoushunjie@baidu.com> * register ModifyAttr for instance_norm, test=op_version (#30065) * register instance norm, test=op_version * add trace op_register_version and fix version bug; test=op_version (#30000) * fix a bug in op_version_registry, test=develop, test=op_version (#29994) * Add version checking, test=op_version (#30129) * fix a bug in gaussian_random_op version, test=release/2.0 Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com> Co-authored-by: Ncc <52520497+juncaipeng@users.noreply.github.com> Co-authored-by: NQi Li <qili93@qq.com> Co-authored-by: NJack Zhou <zhoushunjie@baidu.com> Co-authored-by: NGuo Sheng <whucsgs@163.com> Co-authored-by: Nwangxinxin08 <69842442+wangxinxin08@users.noreply.github.com> Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com> Co-authored-by: NFlyingQianMM <245467267@qq.com> Co-authored-by: Nceci3 <ceci3@users.noreply.github.com> Co-authored-by: Nhutuxian <hutuxian2011@sina.cn> Co-authored-by: Nchalsliu <45041955+chalsliu@users.noreply.github.com> Co-authored-by: Nwangguanzhong <jerrywgz@126.com> Co-authored-by: NShenLiang <shenliang03@baidu.com> Co-authored-by: Nyinhaofeng <66763551+yinhaofeng@users.noreply.github.com> Co-authored-by: Nchannings <chenlingchi@baidu.com> Co-authored-by: Nchentianyu03 <chentianyu03@baidu.com> Co-authored-by: Nruri <shipeng1108@163.com>
-
- 12 1月, 2021 1 次提交
-
-
由 wangchaochaohu 提交于
* reduce the occupied size of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885) * register OPMaker and Infer Shape Check for fused_elementwise_add (#30259)
-
- 11 1月, 2021 1 次提交
-
-
由 WangXi 提交于
* Optimization grad merge performance (#29784) * [fleet] combine amp and gradient merge, test=develop (#30086) * fix assign_op_xpu concat_op_xpu warining (#30120) Co-authored-by: Nliuyuhui <liuyuhui@baidu.com>
-
- 07 1月, 2021 1 次提交
-
-
由 liuyuhui 提交于
-
- 05 1月, 2021 1 次提交
-
-
由 cc 提交于
-
- 04 1月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
-
- 29 12月, 2020 3 次提交
-
-
由 liuyuhui 提交于
* [Kunlun] PR1:Support one Kunlun card training in parallel executor (#29337) * [Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574) * [Kunlun] bug fix of PR2: Support MultiDevicePass and BKCL in parallel executor (#29926) * add bkcl.so in whl for kunlun (#29947) * [Kunlun] bug fix of PR2: Support MultiDevicePass and BKCL in parallel executor (#29961) Co-authored-by: NQingshuChen <qingshu.chen714@gmail.com>
-
由 cc 提交于
-
由 Wilber 提交于
-
- 07 12月, 2020 1 次提交
-
-
由 cc 提交于
-
- 30 11月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
-
- 26 11月, 2020 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bf16 pool2d and unify bf16 unit tests * Add change default ops test
-
由 joanna.wozna.intel 提交于
* Fix cpu_bfloat16_pass * Add output_format * Fix incorrect SetOutput * Change fromating
-
- 25 11月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
* Add multi_gru_fuse_pass and tests * fix date * cleaned up headers
-
- 24 11月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
* Add multi_gru_seq_fuse_pass and tests * fix date * removed unused functions
-
- 20 11月, 2020 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bf16 matmul, fc, elementwise add and mul * Correct unit test
-
由 Wojciech Uss 提交于
-
- 17 11月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 16 11月, 2020 3 次提交
-
-
由 joanna.wozna.intel 提交于
* Add checkpoint to quantize * Change bfloat16 option
-
由 lidanqing 提交于
* add mkldnn inplace op version test * update mkldnn_inplace fuse pass * update the inplace test
-
由 Leo Chen 提交于
* register op_version for addto * upgrade pass capability * change eq to le * change eq to le * fix merge
-
- 12 11月, 2020 2 次提交
-
-
由 Shang Zhizhou 提交于
* skip_layernorm_op done * add unittest * slice op convertor support trt < 6 * skip_layernorm only work in ernie
-
由 lidanqing 提交于
-
- 11 11月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* upgrade pass capability
-
- 10 11月, 2020 2 次提交
-
-
由 Pei Yang 提交于
* paddle-trt support mul channelwise quant * add support for depthwise_conv2d * add errmsg for unsupported op type
-
由 YUNSHEN XIE 提交于
-
- 09 11月, 2020 1 次提交
-
-
由 YUNSHEN XIE 提交于
-
- 08 11月, 2020 1 次提交
-
-
由 YUNSHEN XIE 提交于
* disable ut test_parallel_executor_fetch_isolated_var,test=document_fix * test for limiting ut exec time as 15S * fix an error caused by cannot find ut * fix some error * can not find test_transformer * fix error caused by ut not run in windows * fix error caused by Compiler Options * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt * setting timeout value to 120s for old ut * add the timeout value setting * fix error caused by ut only run in coverage_ci * add analyzer_transformer_profile_tester * fix some error * fix some error * fix error with inference option * fix error with inference option setting as ON_INFER * add some ut to set timeout * modified some option * fix error * fix some timeout error * fix error * fix error * fix timeout for test_analyzer_bfloat16_resnet50 * fix error * setting timeout properity for some ut * first pr for new ut timeout as 15S
-
- 06 11月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bfloat16 softmax and gelu * Add pass attr bfloat16_enabled_op_types * Changes from review
-
- 05 11月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
* - Added sum bf16 oneDNN test=develop * - Fix to UT of sum bf16 test=develop
-
- 04 11月, 2020 2 次提交
-
-
由 lidanqing 提交于
* add conv_bias pass version test * update according to reviews
-
由 石晓伟 提交于
* enhance the op_version_registry, test=develop * add unittests, test=develop * enhance the op_version_registry, test=develop * fix bugs, test=develop * revert pybind_boost_headers.h, test=develop * fix a attribute bug, test=develop
-
- 29 10月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 27 10月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* add fuse_bn_add_act pass
-
- 26 10月, 2020 1 次提交
-
-
由 Adam Osewski 提交于
-