- 12 7月, 2021 1 次提交
-
-
由 zlsh80826 提交于
* add trt LT version helper * upgrade PluginTensorRT to IPluginV2Ext * trt plugin factory is not usable in IPluginV2 * upgrade add plugin api to use IPluginV2 * remove IPlugin register and adapt getSerializeSize(), serialize() * adapt IPluginV2Layer * downgrade to IPluginV2 * implement elementwise clone * add gelu plugin creator and fix gelu serialization bug * add swish plugin creator and fix swish serialization bug * format * fix typo * add elementwise plugin creator and fix serialization * add base creator class * add gelu plugin creator * add hard swish creator and fix serialization * add instance norm creator and fix serialization * add layer norm creator and fix serialization * add pool creator and fix serialization * add prelu creator and fix serialization * add slice creator and fix serialization * add swish creator and fix serialization * add instance norm op unittest * remove redundent api * fix wrong graph size to enable trt * instance norm function move to cc * add trt elementwise ut to trigger coverage * remove opt cahce to hit serialization coverage * remove opt cahce to hit serialization coverage * remove unused code * remove unused inputs_ * add dbg info * remove dbg info * add instance norm serialization * roll back * remove comment code * remove trt plugin registery * fix prelu dynamic serialization * add prelu ut and reduce the input size to reduce memory usage * fix pool dynamic plugin serialization and add ut * refine pool ut with subtest * add env for avoiding oom * reduce test input size & increase pool op ut to 45s * add the contributor * remove copyright (will add in contributor) * remove copyright (will add in contributor)
-
- 24 6月, 2021 1 次提交
-
-
由 zlsh80826 提交于
* add trt LT version helper * remove deprecated nvinfer1::DimsCHW and replace it to nvinfer1::Dims3 * remove deprecated nvinfer1::DimsNCHW and replace it to nvinfer1::Dims4 * update deserialize engine * update to createNetworkV2 * update to createNetworkV2 * update buildWithConfig and remove redundent config settings * replace createNetwork to createNetworkV2 * fix int8 * addMatrixMultiply * remove unnecessary const cast * IBuilder->setInt8Calibrator() is deprecated * auto enable fp16 when using int8 * remove the redundant line
-
- 23 4月, 2021 1 次提交
-
-
由 wenbin 提交于
* move semantic checks to op_teller * more ops * more ops * revert block related change * part1 * revert activation * remove if * remove const_cast * reslove conflict * remove const_cast * delete useless var * replace vlog(1) with vlog(3), replace assert with PADDLE_ENFORCE * down to 19 files
-
- 04 2月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 12 4月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add elementwise pool2d, prelu, shuffle channel test=develop * add scale and refine concat eltwise conveter test=develop * refine elementwise converter test=develop * refine ut test and enforce error. test=develop * modify const cast test=develop
-
- 10 4月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 24 7月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* update paddle-trt for: 1. fix bug: when batch > 2, core in split plugin. 2. add leaky_relu trt5.0 support (yolov3 from 65ms to 42ms.) 3. add new attr to dropout. 4. shuffle channel, swish, relu6 support test=develop * 1. fix ci test=develop
-
- 25 5月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* fluid int8 train and trt int8 predict align. trt int8 predict init op converter * 2. align fluid int8 train and trt int8 inference. enhance quant dequant fuse pass enhance op converter, trt engine, trt engine op, trt subgraph pass. * 3. add delete_quant_dequant_pass for trt test=develop * 4. add the missing file test=develop * 5. i modify the c++ interface, but forget to modify the pybind code fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter test=develop
-
- 13 11月, 2018 1 次提交
-
-
由 nhzlx 提交于
-
- 12 11月, 2018 1 次提交
-
-
由 nhzlx 提交于
-
- 08 11月, 2018 1 次提交
-
-
由 minqiyang 提交于
Fix code to support cpplint syntax check test=develop
-
- 13 9月, 2018 2 次提交
- 24 8月, 2018 1 次提交
-
-
由 nhzlx 提交于
-
- 25 7月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 24 7月, 2018 2 次提交
- 07 6月, 2018 2 次提交
-
-
由 Luo Tao 提交于
-
由 Yan Chunwei 提交于
-
- 06 6月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 01 6月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 14 5月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 03 5月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 27 4月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 25 4月, 2018 2 次提交
- 23 4月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 26 2月, 2018 2 次提交
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 09 1月, 2018 1 次提交
-
-
由 Yiqun Liu 提交于
* Add Seq2BatchFunctor, which will be used in WarpCTCOp. * Implement WrapCTCFunctor and WrapCTCKernel. * Add unittest of warpctc_op. * Modify the check_output inferface in python unittest framework to allow check a subset of outputs. * Use absolute offset lod in warpctc_op and related functors. * Refine the comments of warpctc_op. * The new python unittest supports checking a subset of the outputs, so revoke the previous change. * Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor. * Update to the newest codes. * Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.
-
- 04 8月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 11 7月, 2017 1 次提交
-
-
由 Yu Yang 提交于
-