- 02 4月, 2021 2 次提交
- 01 4月, 2021 1 次提交
-
-
由 zlsh80826 提交于
* add anchor generator op plugin * add anchor generator unit_test * remove dbg info * remove redundant line * replace assertion with paddle enforce * dynamic plugin replaces assertion with paddle enforce * anchor generator support dynamic shape on spatial axis * anchor generator test with fp16, dynamic shape * add anchor generator test all * add back main * reduce test input size to not exceed the timelimit of ci * change super to InferencePassTest for python2 compatibility * reuse paddle operator anchor generator * move creator construct to header with default * add cuda ifdef * reduce line * change super to InferencePassTest for python2 compatibility * fix anchor generator fp16 serialize setting * split unittest from test_all * restrict anchor generator input format before version 7234 * anchor generator only support greater than trt7.1 * change min_graph_size to 2 * min_graph size to 3 if dynamic shape * reduce dynamic shape size to avoid trt search tactic too long to exceed time limit * remove anchor from fetch list * anchor generator support all trt version * fix memory not allocated but if serialized
-
- 30 3月, 2021 2 次提交
-
-
由 Shang Zhizhou 提交于
* fix batchnorm when inpu dims < 3 * add unittest for batchnorm dims = 2
-
由 zlsh80826 提交于
* yolobox converter and plugin * yolobox unittest * add dynamic shape restriction * fix git merge log
-
- 29 3月, 2021 2 次提交
-
-
由 zlsh80826 提交于
* add roi_align_plugin * add roi align unit_test * add roi align serialization * remove roi align static plugin because of batch dim issue * refine roi align unittest and add fp16/serialization * add trt roi align condition to op_teller * refine error message * remove unnecessary reshape layer
-
由 zlsh80826 提交于
* trt affine channel converter * add trt affine channel base test * add trt affine channel NHWC * remove asterisk for python2 compatibility * trt affine channel converter * add trt affine channel base test * add trt affine channel NHWC * remove asterisk for python2 compatibility * fix rebase * move LodTensor to Tensor * add dbg info * affine channel converter only support NCHW * scale,bias are parameters, use create_parameters api * reduce test input size to not exceed the timelimit of ci * refine affine channel unittest and add serialization/dynamic test * change super to InferencePassTest for python2 compatibility * change super to InferencePassTest for python2 compatibility * fix affine channel fp16 serialize setting
-
- 26 3月, 2021 1 次提交
-
-
由 zlsh80826 提交于
* add multiclass_nms * add multiclass_nms unittest * add default enable_tensorrt_oss option * refine multiclas nms unittest and add serialization/dynamic test * change super to InferencePassTest for python2 compatibility * refine multiclass nms unittest * move out dynamic shape test due to ci timelimit
-
- 23 3月, 2021 2 次提交
-
-
由 Wilber 提交于
-
由 Shang Zhizhou 提交于
* fix tensorrt output varible reshape * move padding shape x 1 x 1 in ernie to qkv and fc * update layer name * fix softmax when input is dynamic, fc not padding any more * fix varlen * move fc x_dim assert to op_teller
-
- 22 3月, 2021 1 次提交
-
-
由 zlsh80826 提交于
* nearest_interp op converter w/ dynamic/static * fix data_layout include * add trt nearest unit_test * add nearest_interp NHWC test * update trt nearest interp nhwc testcase * remove asterisk for python2 compatibility * add empty line to prevent conflict * nearest_interp op converter w/ dynamic/static * fix data_layout include * add trt nearest unit_test * add nearest_interp NHWC test * update trt nearest interp nhwc testcase * remove asterisk for python2 compatibility * add empty line to prevent conflict * change the priority of out_h, out_w
-
- 18 3月, 2021 2 次提交
- 12 3月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* add serialize unittest * fix element_op trt plugin serialize bug
-
- 10 3月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
-
- 03 3月, 2021 1 次提交
-
-
由 Pei Yang 提交于
-
- 02 3月, 2021 3 次提交
-
-
由 Shang Zhizhou 提交于
-
由 Pei Yang 提交于
* add n-d input support for trt scale converter * add flatten for ut * fix dims
-
由 Qi Li 提交于
-
- 24 2月, 2021 1 次提交
-
-
由 Pei Yang 提交于
* add group norm plugin * fix compile problems * move concat axis check to trt op teller * add nbDims for scale and bias nv dims * add group norm unit test * fix unittest * add trt version restriction for group norm op teller * fix unittest
-
- 22 2月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* update trt int8 calibrator to IEntropyCalibratorV2 * add delele opt_cache for trt_split_converter_test
-
- 19 2月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 18 2月, 2021 1 次提交
-
-
由 Pei Yang 提交于
-
- 04 2月, 2021 2 次提交
-
-
由 Shang Zhizhou 提交于
* fix split trt plugin initialize * update
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 02 2月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* fix trt plugin clone and initialize bugs * fix unit test error * enable trt in ci py3 * update unittest timeout
-
- 25 1月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* add dla * add dla done * add python api Co-authored-by: Nshangzhizhou <root@szth-rp-fanyi-opera49.szth.baidu.com>
-
- 19 1月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* unify calling cudaSetDevice * fix compile
-
- 13 1月, 2021 1 次提交
-
-
由 alncat 提交于
* added support for inference using qunatization aware trained dygraph * added support for inference using qunatization aware trained dygraph correct boost get usage * Delete incorrect warning message (#30196) * fix warning and no grad * clean redundant API alias in 2.0 - part 2 (#30013) * delete paddle.nn.functional.assign * fix dynamic to static error * just add the op error message for the matmul xpu (#30246) add the op error message for the matmul xpu * Add Static Variable Clone (#30208) Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat * use wget to replace curl to download the lcov file (#30229) * use wget to replace curl to download the lcov file * add cache for lcov * fix test_pool3d_op timeout issue (#30248) * Fix unittests bugs. (#30250) * modify error message based on comments (#30189) * modify error message based on comments * edit code according to review. * Correct spelling according to review. * Fix bug for 'save mutiple method' (#30218) * Fix bug for 'save mutiple method' * To pass coverage. * edit code to pass coverage. * edit code to pass coverage. * add unittest for coverage. * change for coverage. * edit for coverage. * added support for inference using qunatization aware trained dygraph * Alias from paddle.fluid.layers.auc to paddle.static.auc (#30206) * add alias from fluid.layers.auc to static.auc * Update __init__.py * added support for inference using qunatization aware trained dygraph correct boost get usage * corrected boost get usage * corrected naming issues and enforcing zero check * correct paddle enforce message * added more error checkings * corrected error report message and optimized code * corrected findvar usage * corrected paddle_enforce in scope * correct error messages * correct error reporting format Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com> Co-authored-by: NXiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com> Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com> Co-authored-by: NHuihuang Zheng <zhhsplendid@gmail.com> Co-authored-by: NYUNSHEN XIE <1084314248@qq.com> Co-authored-by: NBai Yifan <me@ethanbai.com> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: NWeiXin <weixin10@baidu.com> Co-authored-by: NJiaqi Liu <liujiaqi06@baidu.com>
-
- 11 1月, 2021 1 次提交
-
-
由 WeiXin 提交于
* modify error message based on comments * edit code according to review. * Correct spelling according to review.
-
- 08 1月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 08 12月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* change hard_swish from plugin to layer * add ut when threshold != scale
-
- 07 12月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 27 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake * comile with cuda9 * add some unittest * notest;test=coverage * add unittest for trt plugin swish && split * update ernie unittest * fix some error message * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter * fix comile errror when CUDA_ARCH_NAME < Pascal" * fix comile error * update unittest timeout * compile with cuda9 * update error msg * fix code style * add some comments * add define IF_CUDA_ARCH_SUPPORT_FP16 * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
-
- 23 11月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* change avg pooling and global pooling to trt layer * add support for static shape global pooling * modify trt errmsg
-
- 12 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* skip_layernorm_op done * add unittest * slice op convertor support trt < 6 * skip_layernorm only work in ernie
-
- 03 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* fp16 result ok * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS * auto detect special slice op converter for ernie with trt oss * ernie oss only support fp16 * fix special_slice_plugin serialize bug * matmul in tensorrt ok * ernie unittest ok * add matmul tensorrt unittest * remove demo code
-
- 21 10月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 13 10月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* add info log for trt input dynamic shape check * fix error msg error
-
- 28 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* add unittests and op version register for tensorrt_subgraph_pass * rename to test_trt_subgraph_pass.py * fix softmax converter diff when padding dim=1
-