- 09 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 05 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* Fix TRT plugin registry without TRT lib (#25982) * fix trt plugin registry without trt lib * support trt4 * refine code style * pick ea851796 from develop * cherry-pick develop PR #26273 && #27796 * fix unittest error * fix unittest error * remove const_cast Co-authored-by: NPei Yang <peiyang@baidu.com>
-
- 18 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* [Paddle-TRT] Stack op plugin (#25605) * add stack_op to CMakeLists * add dim=3 support for scale op * add trt stack op, test=develop * remove debug message * add stack plugin serialize * remove slice, scale op, will add later * enhence error message * revise trt ernie test to conver the stack op CI testi, test=develop * add stack op serialization * fix test shape after adding stack op * remove slice op, will add after implementing serialization * roll back to min_graph=5 to avoid using slice op * fix scale op output layer * implement stack op createPlugin * use workspace and move the defination to .cu * move stack plugin creator definition to .cu, test=develop * sync ut with develop Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
-
- 30 7月, 2020 1 次提交
-
-
由 石晓伟 提交于
* ignore warnings of external libraries, test=develop (#24193) * fix repeat definitions in liengine.cc, test=develop (#25020) * remove paddle_use_kernel and paddle_use_op. test=develop (#25189) * fix compile for lite subgraph. test=develop (#25285) * [CI] [Lite-Subgraph] CI add lite subgraph check. (#25346) * supports xpu runtime, test=develop (#25554) * fix cmake of lite, test=develop (#25680) * change commit files, test=release/1.8 Co-authored-by: NWilber <jiweibo@baidu.com>
-
- 23 4月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* Chery_pick:[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723) * refine ernie trt dynamic shape support 1. add slice op converter 2. add emb eltwise layernorm fp16 support test=develop * fix dynamic shape test ut test=develop * fix comments. test=develop * fix comments test=develop * cherry-pick [BUG]: Head number can only be > 1 on multihead op (#23974) * support the head number == 1 test=develop * fix slice op error. test=develop * cherry-pick :disable trt test, test=develop (#23984) test=release/2.0-beta
-
- 20 4月, 2020 1 次提交
-
-
由 guofei 提交于
cherry-pick #23645
-
- 17 4月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 12 4月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add elementwise pool2d, prelu, shuffle channel test=develop * add scale and refine concat eltwise conveter test=develop * refine elementwise converter test=develop * refine ut test and enforce error. test=develop * modify const cast test=develop
-
- 10 4月, 2020 1 次提交
-
-
由 Tao Luo 提交于
-
- 08 4月, 2020 2 次提交
-
-
由 Pei Yang 提交于
* add hard_sigmoid trt op converter * add hard_swish op converter and plugin. test=develop
- 26 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop
-
- 09 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* change the ci trt from version 5. to 6.0 * paddle-trt dynamic shape support init * conv+bias or conv+bn dynamic shape support test=develop * modity trt engine opconvert test=develop * fix ci error test=develop
-
- 04 3月, 2020 1 次提交
-
-
由 石晓伟 提交于
* encapsulate the PaddleTensorToLoDTensor, test=develop * serialize the pd_tensor, test=develop * serialize tensors to file, test=develop
-
- 24 2月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* Add an interface of disabling FC padding * fix bert regression * polish fc padding interface * recover pass function * fix argument error * fix mkldnn error
-
- 04 2月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 09 1月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 07 1月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* add TRT support for instance_norm op
-
- 06 1月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* add gelu plugin * align trt bert with gpu * add support for fused fc with relu, * add unittest for bert trt
-
- 16 12月, 2019 1 次提交
-
-
由 石晓伟 提交于
-
- 11 12月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 05 12月, 2019 1 次提交
-
-
由 Pei Yang 提交于
-
- 04 12月, 2019 1 次提交
-
-
由 Pei Yang 提交于
* make DisableGlogInfo able to mute all logs in inference.
-
- 03 12月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 02 12月, 2019 1 次提交
-
-
由 Lv Mengsi 提交于
* fix transpose conv,test=develop * fix comments test=develop
-
- 27 11月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 23 10月, 2019 2 次提交
- 14 10月, 2019 1 次提交
-
-
由 Pei Yang 提交于
-
- 27 9月, 2019 1 次提交
-
-
由 石晓伟 提交于
* update operator compatible info, test=develop * revert cmake/version.cmake, test=develop * add unit_tests and fix bugs, test=develop * update ../paddle/fluid/framework/framework.proto, test=develop * fix bug of paddle/fluid/inference/api/analysis_predictor.cc, test=develop * update paddle/fluid/framework/version_test.cc, test=develop * add comments and rename interfaces, test=develop
-
- 25 9月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969) * fix memory optimization type test=develop * 1. fix BUG: open trt and memory optim will trigger bug. 2. Clean memory optim bug. test=develop
-
- 17 9月, 2019 1 次提交
-
-
由 Pei Yang 提交于
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
-
- 19 8月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* fix mask rcnn bug: 1. affine channel fuse (diff) 2. condition block op (memory leak) 3. merge lod tensor op (diff) 4. memroy optim (diff) test=develop * fix ci aboud PADDLE_ENFOCE fix merge lod infer op ut test=develop
-
- 24 7月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* update paddle-trt for: 1. fix bug: when batch > 2, core in split plugin. 2. add leaky_relu trt5.0 support (yolov3 from 65ms to 42ms.) 3. add new attr to dropout. 4. shuffle channel, swish, relu6 support test=develop * 1. fix ci test=develop
-
- 11 7月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy test=develop * enhance MkldnnPostReset test=develop * add comments for mkldnn_cache_capacity field test=develop
-
- 08 7月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* Fix Mask rcnn predictor 1. refine memory optim algorithm to support the model with the block op. 2. output diff : modify the affine channel fuse 3. add condition_block_infer op add interface for setting trt calib table dir test=develop * add the missing files. test=develop
-
- 02 7月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 06 6月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 03 6月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 27 5月, 2019 1 次提交
-
-
由 Zhen Wang 提交于
* fix the bug that sub_scope_ may be null in AnalysisPredictor::Run. * add more directions about io APIs' docs. * update the API.spec. test=develop test=document_preview
-