- 22 6月, 2021 1 次提交
-
-
由 Pei Yang 提交于
-
- 25 4月, 2021 2 次提交
-
-
由 Wilber 提交于
-
由 denglin-github 提交于
* Add dlnne engine runtime * Fix log * Remove <const_cast> and remove unrelated modify with dlnne, +clang-format * Fix CMakeList format error * Add copyright message * Fix dlnne CMakeList.txt * Add some paddlepaddle_pass to support more networks * Fix some format bug
-
- 20 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 04 2月, 2021 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Update Xbyak and add bf16 fast performance verification * Fix formating * Change LOG message * Trigger an update of a new tag
-
- 03 2月, 2021 1 次提交
-
-
由 石晓伟 提交于
* support xpu inference with analysis predictor, test=develop * merge the cmake of the xpu toolchain, test=develop * add c-apis, test=develop * fix a bug in extern_xpu, test=develop
-
- 25 1月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* add dla * add dla done * add python api Co-authored-by: Nshangzhizhou <root@szth-rp-fanyi-opera49.szth.baidu.com>
-
- 13 1月, 2021 1 次提交
-
-
由 alncat 提交于
* added support for inference using qunatization aware trained dygraph * added support for inference using qunatization aware trained dygraph correct boost get usage * Delete incorrect warning message (#30196) * fix warning and no grad * clean redundant API alias in 2.0 - part 2 (#30013) * delete paddle.nn.functional.assign * fix dynamic to static error * just add the op error message for the matmul xpu (#30246) add the op error message for the matmul xpu * Add Static Variable Clone (#30208) Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat * use wget to replace curl to download the lcov file (#30229) * use wget to replace curl to download the lcov file * add cache for lcov * fix test_pool3d_op timeout issue (#30248) * Fix unittests bugs. (#30250) * modify error message based on comments (#30189) * modify error message based on comments * edit code according to review. * Correct spelling according to review. * Fix bug for 'save mutiple method' (#30218) * Fix bug for 'save mutiple method' * To pass coverage. * edit code to pass coverage. * edit code to pass coverage. * add unittest for coverage. * change for coverage. * edit for coverage. * added support for inference using qunatization aware trained dygraph * Alias from paddle.fluid.layers.auc to paddle.static.auc (#30206) * add alias from fluid.layers.auc to static.auc * Update __init__.py * added support for inference using qunatization aware trained dygraph correct boost get usage * corrected boost get usage * corrected naming issues and enforcing zero check * correct paddle enforce message * added more error checkings * corrected error report message and optimized code * corrected findvar usage * corrected paddle_enforce in scope * correct error messages * correct error reporting format Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com> Co-authored-by: NXiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com> Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com> Co-authored-by: NHuihuang Zheng <zhhsplendid@gmail.com> Co-authored-by: NYUNSHEN XIE <1084314248@qq.com> Co-authored-by: NBai Yifan <me@ethanbai.com> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: NWeiXin <weixin10@baidu.com> Co-authored-by: NJiaqi Liu <liujiaqi06@baidu.com>
-
- 06 1月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* snap * add inference api: DisableTensorRtOPs * fix code style * update api to experimental * update variable name
-
- 17 12月, 2020 1 次提交
-
-
由 Wilber 提交于
* enable_use_gpu has higher priority than FLAGS * update.
-
- 02 12月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 21 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 12 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* skip_layernorm_op done * add unittest * slice op convertor support trt < 6 * skip_layernorm only work in ernie
-
- 03 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* fp16 result ok * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS * auto detect special slice op converter for ernie with trt oss * ernie oss only support fp16 * fix special_slice_plugin serialize bug * matmul in tensorrt ok * ernie unittest ok * add matmul tensorrt unittest * remove demo code
-
- 21 10月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 20 10月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add AVX512 instruction check for C-API * Fix formatting
-
- 13 10月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 16 9月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 02 9月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add mkldnn bfloat16 option to C-API * Add test for bfloat16 gpu * Change coverage test * Repair capi_gpu test
-
- 01 9月, 2020 1 次提交
-
- 28 8月, 2020 2 次提交
-
-
由 Wilber 提交于
* refine paddle inference api Co-authored-by: Nnhzlx <nhzlx.dragon@gmail.com>
-
由 joanna.wozna.intel 提交于
* Add mkldnn bfloat16 option to C-API * Add test for bfloat16 gpu * Change coverage test
-
- 22 7月, 2020 1 次提交
-
-
由 石晓伟 提交于
* update ResetHolder, test=develop * add TensorShare for lite engine, test=develop * tensor data changed from copying to sharing, test=develop * supports xpu runtime, test=develop * fix code styles, test=develop
-
- 23 6月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* Paddle-TensorRT support slim QAT. test=develop * add comments. test=develop * use RenameInput instead of ResetInputs. test=develop
-
- 29 4月, 2020 1 次提交
-
-
由 石晓伟 提交于
* update the analysis predictor, test=develop * update the unit test, test=develop * no priority set before the inferface determined, test=develop * interface name generalization, test=develop
-
- 23 4月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 09 4月, 2020 1 次提交
-
-
由 mozga-intel 提交于
* Remove the NGraph engine from PDPD repository 1. Each operator was removed from the operator's directory 2. Each test was removed from the unittest directory 3. The parallel executor support was removed from the PDPD 4. The CMake file was removed from the PDPD 5. The NG flags were removed from the repository test=develop * Remove ngraph from: 1. Cmake file 2. Python file test=develop
-
- 26 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop
-
- 09 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* change the ci trt from version 5. to 6.0 * paddle-trt dynamic shape support init * conv+bias or conv+bn dynamic shape support test=develop * modity trt engine opconvert test=develop * fix ci error test=develop
-
- 24 2月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* Add an interface of disabling FC padding * fix bert regression * polish fc padding interface * recover pass function * fix argument error * fix mkldnn error
-
- 04 2月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 09 1月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 20 12月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
* Disable memory opt pass when DNNL is on * Refine comment above mem optimization pass enablement test=develop
-
- 14 10月, 2019 1 次提交
-
-
由 Pei Yang 提交于
-
- 25 9月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969) * fix memory optimization type test=develop * 1. fix BUG: open trt and memory optim will trigger bug. 2. Clean memory optim bug. test=develop
-
- 17 9月, 2019 1 次提交
-
-
由 Pei Yang 提交于
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
-
- 03 9月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Add a interface to enable cudnn for inference. * Add cudnn_placement_pass. test=develop * Set the default value of cudnn_enabled_op_types to null. test=develop * Write the common basic class, placement_pass_base, to refine the codes. test=develop * Call EnableCUDNN in unittest. test=develop * Refine cudnn_placement_pass tester. * Enable the testing of cudnn_placement_pass in inference's unittest. test=develop * Add the check of op kernels. test=develop
-
- 11 7月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy test=develop * enhance MkldnnPostReset test=develop * add comments for mkldnn_cache_capacity field test=develop
-
- 08 7月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* Fix Mask rcnn predictor 1. refine memory optim algorithm to support the model with the block op. 2. output diff : modify the affine channel fuse 3. add condition_block_infer op add interface for setting trt calib table dir test=develop * add the missing files. test=develop
-