- 24 3月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* fix conflict * fix compile error * cherry-pick #31316 * Refine cudnn softmax (#25757) * refine cudnn softmax * Trt elementwise plugin serialize (#31587) * add serialize unittest * fix element_op trt plugin serialize bug * remove PassVersionChecker.IsCompatible * fix unittest Co-authored-by: NPei Yang <peiyang@baidu.com> Co-authored-by: NGaoWei8 <53294385+GaoWei8@users.noreply.github.com>
-
- 07 12月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* cherry-pick PR #27933 * fix: cuda version is in varibale CUDA_VERSION in 1.8 cuda.cmake * close unittest failed temporarily * cherry-pick PR #27544, fix layer_norm and softmax bug in tensorRT
-
- 10 2月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
[Refine Paddle-TRT INT8]: Support PaddleSlim's Resnet50, Mobilenetv1, Yolov3 models for Inference. (#22483) * add int8 op teller for trt. * refine trt int8 * add int8 op teller for trt. test=develop
-
- 25 5月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* fluid int8 train and trt int8 predict align. trt int8 predict init op converter * 2. align fluid int8 train and trt int8 inference. enhance quant dequant fuse pass enhance op converter, trt engine, trt engine op, trt subgraph pass. * 3. add delete_quant_dequant_pass for trt test=develop * 4. add the missing file test=develop * 5. i modify the c++ interface, but forget to modify the pybind code fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter test=develop
-
- 12 11月, 2018 1 次提交
-
-
由 nhzlx 提交于
-
- 08 11月, 2018 1 次提交
-
-
由 minqiyang 提交于
Fix code to support cpplint syntax check test=develop
-
- 09 8月, 2018 1 次提交
-
-
由 nhzlx 提交于
-
- 25 7月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 24 7月, 2018 2 次提交
- 07 6月, 2018 2 次提交
-
-
由 Luo Tao 提交于
-
由 Yan Chunwei 提交于
-
- 06 6月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 01 6月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 14 5月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 03 5月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 27 4月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 25 4月, 2018 2 次提交
- 23 4月, 2018 1 次提交
-
-
由 Luo Tao 提交于
-
- 26 2月, 2018 2 次提交
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 09 1月, 2018 1 次提交
-
-
由 Yiqun Liu 提交于
* Add Seq2BatchFunctor, which will be used in WarpCTCOp. * Implement WrapCTCFunctor and WrapCTCKernel. * Add unittest of warpctc_op. * Modify the check_output inferface in python unittest framework to allow check a subset of outputs. * Use absolute offset lod in warpctc_op and related functors. * Refine the comments of warpctc_op. * The new python unittest supports checking a subset of the outputs, so revoke the previous change. * Rename the transform from LoDTensor to Tensor with shape [max_sequence_length, num_sequences, sequence_width] to PaddingSequenceFunctor. * Update to the newest codes. * Rename the PaddingSequenceFunctor to PaddingLoDTensorFunctor and remove the computation of dimensions out of the functos.
-
- 04 8月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 11 7月, 2017 1 次提交
-
-
由 Yu Yang 提交于
-