- 11 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* 1. add embedding eltwise layernorm fuse 2. add embedding eltwise layernorm op 3. refine inplace_add_relu 4. refine fc_eltwise_layernorm test=develop * 1. refine fc test=develop * fix comments test=develop * fix comments test=develop
-
- 09 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* change the ci trt from version 5. to 6.0 * paddle-trt dynamic shape support init * conv+bias or conv+bn dynamic shape support test=develop * modity trt engine opconvert test=develop * fix ci error test=develop
-
- 04 3月, 2020 1 次提交
-
-
由 石晓伟 提交于
* encapsulate the PaddleTensorToLoDTensor, test=develop * serialize the pd_tensor, test=develop * serialize tensors to file, test=develop
-
- 28 2月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 24 2月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* Add an interface of disabling FC padding * fix bert regression * polish fc padding interface * recover pass function * fix argument error * fix mkldnn error
-
- 15 2月, 2020 1 次提交
-
-
由 flame 提交于
-
- 14 2月, 2020 1 次提交
-
-
由 flame 提交于
* support golang inference
-
- 13 2月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* 1. optim multihead matmul: fuse three fc to multihtead matmul test=develop * fix conflict test=develop * fix comments test=develop
-
- 05 2月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add mutex for trt engine test=develop * add the test for copy_to_cpu test=develop
-
- 04 2月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 14 1月, 2020 3 次提交
-
-
由 liu zhengxi 提交于
-
由 silingtong123 提交于
* remove the useless third_party library from C++ inference library * revert removing the install directory
-
由 zhouwei25 提交于
faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
-
- 09 1月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 07 1月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* add TRT support for instance_norm op
-
- 06 1月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* add gelu plugin * align trt bert with gpu * add support for fused fc with relu, * add unittest for bert trt
-
- 30 12月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 27 12月, 2019 1 次提交
-
-
由 石晓伟 提交于
* fix multi-thread error of fc_gru_fuse_pass.cc, test=develop * export FLAGS and GLOG symbols, test=develop
-
- 20 12月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
* Disable memory opt pass when DNNL is on * Refine comment above mem optimization pass enablement test=develop
-
- 16 12月, 2019 1 次提交
-
-
由 石晓伟 提交于
-
- 12 12月, 2019 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add reshape int8 op test=develop * Change test to CPUPlace test=develop * Correct tests test=develop
-
- 11 12月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 10 12月, 2019 1 次提交
-
-
由 rensilin 提交于
* ZeroCopyTensor::mutable_data in the right device, test=develop * add unittest for zerocopy, test=develop
-
- 05 12月, 2019 1 次提交
-
-
由 Pei Yang 提交于
-
- 04 12月, 2019 1 次提交
-
-
由 Pei Yang 提交于
* make DisableGlogInfo able to mute all logs in inference.
-
- 03 12月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 02 12月, 2019 1 次提交
-
-
由 Lv Mengsi 提交于
* fix transpose conv,test=develop * fix comments test=develop
-
- 27 11月, 2019 2 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
由 Michał Gallus 提交于
* Implement Int8 FC * Integrate FC into INT8v2 test=develop * int8 FC: transpose weights before computing scales test=develop * Add support for activation_type string in FC test=develop * Disable MKL-DNN's FC in VGG16 and 19 test=develop * Disable FC quantization when mkldnn FC is disabled test=develop * Solve PADDLE_ENFORCES in FC int8 * Fix Paddle enforces and remove const cast test=develop * Fix style changes test=develop * Fix quantizer_tester test and add fc quantization test=develop * Fix FC test fail on CUDA * Remove unnecessary log from quantize placement pass test=develop * Add Thread ID to FC hash key test=develop * Add comments to MKL-DNN FC Kernel test=develop * Refactor quantizer test=develop * Fix linter issues test=develop * Fix crash in slim googlenet test=develop * Fix PADDLE_ENFORCE messages test=develop
-
- 26 11月, 2019 1 次提交
-
-
由 silingtong123 提交于
-
- 19 11月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 18 11月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* refine trt int8 for dynamic range set test=develop * refine trt int8 test=develop
-
- 08 11月, 2019 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add transpose2 INT8 for mkl-dnn test=develop * Fix test_transpose_int8_mkldnn test=develop * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2" This reverts commit 34011bdb, reversing changes made to 2ce6473f. * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"" This reverts commit 23754dd7. * Add template to TransposeMKLDNNHandler test=develop * Resolve conflict test=develop * Restore get_size and refactor test=develop
-
- 23 10月, 2019 2 次提交
- 18 10月, 2019 1 次提交
-
-
由 石晓伟 提交于
* support MLU nums, test=develop * change anakin apis, test=develop
-
- 14 10月, 2019 1 次提交
-
-
由 Pei Yang 提交于
-
- 13 10月, 2019 1 次提交
-
-
由 zhaoyuchen2018 提交于
* Add multihead fuse pass for ernie opt * Refine softmax test=develop * Refine cuda kernel * Refine cuda version * Refine cmake test=develop * refine header file * refine test case and pass * refine comments
-
- 12 10月, 2019 1 次提交
-
-
由 Adam 提交于
* Add ConvTranspose + BatchNorm fuse pass test=develop * Add tests for conv+bn and conv_transpose+bn passes test=develop
-
- 10 10月, 2019 1 次提交
-
-
由 石晓伟 提交于
-