提交 · 707d838b06b5165155b13e8a511be8d9a6ff220b · 机器未来 / Paddle

19 9月, 2022 1 次提交
- W
  
  cherry-pick 46152 (#46183) · 707d838b
  由 Wilber 提交于 9月 19, 2022
  
  707d838b
05 9月, 2022 2 次提交

New format quant model support for MKLDNN (#45416) · 4e4f4586

由 yeliang2258 提交于 9月 05, 2022

* support onnx format quantized model

* update code

* add test

* add test

* fix

* fix test

* fix cmake

* update code

* change scale file path to calibration file path

* update code

* update code

* fix build bug

* fix build bugs

* fix

* fix

4e4f4586

Update DlNNE engine (#45027) · 638965c5

由 denglin-github 提交于 9月 05, 2022

* add config param for enable_dlnne and support calibration mode
* remove useless file
* refine code and add annotation
* refine code of Warnning tips

638965c5

05 8月, 2022 1 次提交

update trt workspace size param (#44469) · bdce552b

由 Zhang Jun 提交于 8月 05, 2022

* update trt workspace size param

* update

* update

* update

* use int64_t

* use int64_t

* upate

* update

bdce552b

29 6月, 2022 1 次提交
- W
  
  convert to mixed model python api (#43881) · cbaebb04
  由 Wilber 提交于 6月 29, 2022
  
  cbaebb04
24 6月, 2022 1 次提交
- W
  revert 40531 (#43807) · 7985407b
  由 Wilber 提交于 6月 24, 2022
```
* revert 40531

* update
```
  7985407b
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
02 6月, 2022 1 次提交
- W
  [Paddle-Inference] new general transformer inference support (#43077) · 2810dfea
  由 Wangzheee 提交于 6月 02, 2022
```
* new general transformer inference support
```
  2810dfea
13 5月, 2022 1 次提交

[IPU] fix ipu and add python infer api, test=develop (#42724) · 9029fde7

由 Qi Li 提交于 5月 13, 2022

* [IPU] fix ipu and add python infer api, test=develop

* [IPU] add paddlepaddle-ipu package name, test=develop

9029fde7

25 4月, 2022 1 次提交
- L
  
  fix gcc warning of cast-function-type (#42235) · e95838dd
  由 Leo Chen 提交于 4月 25, 2022
  
  e95838dd
14 4月, 2022 1 次提交

add mkldnn int8 pass [step3] (#41599) · 8e2d4d30

由 baoachun 提交于 4月 14, 2022

* add mkldnn int8 pass [step3]

* Add test for compute_propagate_scales_mkldnn_pass

* update pass

* update api comment and python api
Co-authored-by: Nwozna <joanna.wozna@intel.com>

8e2d4d30

12 4月, 2022 1 次提交

add python share_data interface (#41626) · be4a2077

由 JingZhuangzhuang 提交于 4月 12, 2022

* add python share_data interface

* Update inference_api.cc

* Update inference_api.cc

* add python share_data interface

be4a2077

17 3月, 2022 1 次提交
- B
  
  support gpu mixed precision inference (#40531) · 06fee998
  由 baoachun 提交于 3月 17, 2022
  
  06fee998
10 3月, 2022 1 次提交

Inference add ONNXRuntime back-end (#39988) · 431afc39

由 heliqi 提交于 3月 10, 2022

* add onnxruntime predictor

* Add code comments

* support link paddle2onnx onnxruntime

* support onnxruntime with python

* support onnxruntime with python

* support onnxruntime with windows

* paddle2onnx compile with windows

* supoort windows compile

* supoort windows compile with onnxruntime

* supoort windows compile with paddle2onnx

* supoort mac compile

* compile with mac

* compile with mac

* add code comments

* fix remind word

* code optimization

* add test case

* add test case

* add inference demo_ci test case

* fix compile paddle2onnx with no python

* add inference demo_ci test case

* add inference demo_ci test case

* add inference infer_ut test case

* support c go api and test cases

* add converage test case

* add converage test case

* add capi test case

* add capi test case

431afc39

02 3月, 2022 1 次提交
- Y
  [fleet_executor] Add entrance of FleetExecutor in AnalysisPredictor for... · 244ae318
  由 Yuang Liu 提交于 3月 02, 2022
```
[fleet_executor] Add entrance of FleetExecutor in AnalysisPredictor for distributed inference (#39992)
```
  244ae318
11 2月, 2022 1 次提交
- L
  
  Add TensorRT inspector into Paddle-TRT (#38362) · 69793a27
  由 Leo Chen 提交于 2月 11, 2022
  
  69793a27
31 12月, 2021 1 次提交
- W
  
  fix python ascend run error. (#38605) · 1df354e7
  由 Wilber 提交于 12月 31, 2021
  
  1df354e7
20 10月, 2021 1 次提交

Add FasterTokenizer Operator (#34491) · 3f2d6a3f

由 Steffy-zxf 提交于 10月 20, 2021

Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent.

* support the text string as an input Tensor
* support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens
* Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization.
* It first applies basic tokenization, followed by wordpiece tokenization.

3f2d6a3f

19 10月, 2021 1 次提交
- W
  Inference add type check in copy_from_cpu (#36429) · be6a8330
  由 Wilber 提交于 10月 19, 2021
```
* update

* fix ut error

* update ut
```
  be6a8330
22 9月, 2021 2 次提交
- T
  Fix copy elision warning (#35885) · 47d6bc86
  由 Tomasz Socha 提交于 9月 22, 2021
```
* Fix copy elision warning

* Remove redundand code
```
  47d6bc86
- J
  
  [Inference] Support NNAdapter and ascend310 (#35226) · 10e53044
  由 JingZhuangzhuang 提交于 9月 22, 2021
  
  10e53044
15 9月, 2021 1 次提交
- H
  
  add set-xpu-device-id function for inference config. (#35572) · a74d7fb6
  由 houj04 提交于 9月 15, 2021
  
  a74d7fb6
14 9月, 2021 1 次提交
- W
  
  [Inference] Add tuned trt_dynamic_shape mode. (#34806) · 7c96efed
  由 Wilber 提交于 9月 14, 2021
  
  7c96efed
04 9月, 2021 1 次提交
- W
  
  update inference trt ut framework (#35418) · e8772486
  由 Wilber 提交于 9月 04, 2021
  
  e8772486
31 8月, 2021 1 次提交
- S
  Revert "Revert "Add copy from tensor (#34406)" (#35173)" (#35256) · 6116f9af
  由 Shang Zhizhou 提交于 8月 31, 2021
```
* Revert "Revert "Add copy from tensor (#34406)" (#35173)"

This reverts commit 32c1ec42.

* add template instantiation
```
  6116f9af
27 8月, 2021 1 次提交
- Z
  Revert "Add copy from tensor (#34406)" (#35173) · 32c1ec42
  由 zhangchunle 提交于 8月 27, 2021
```
This reverts commit ac33c0ca.
```
  32c1ec42
26 8月, 2021 1 次提交

Add copy from tensor (#34406) · ac33c0ca

由 Shang Zhizhou 提交于 8月 26, 2021

* add api

* temp save

* revert

* copytocpu async ok

* fix style

* copy sync ok

* fix compile error

* fix compile error

* api done

* update python async api

* fix compile

* remove async python api; add c++ async unittest

* remove python async api

* update unittest

* update unittest

* add C++ unittest for copytensor

* add unittest

* update namespace utils to class TensorUtils

* add unittest

* update unittest

* update unittest

* update code style

* update code style

* update unittest

ac33c0ca

12 8月, 2021 1 次提交
- W
  
  [Inference] Inference python api support fp16 (#34676) · 6326c3ef
  由 Wilber 提交于 8月 12, 2021
  
  6326c3ef
19 7月, 2021 1 次提交
- W
  
  [Inference] Add config.Summary api (#34122) · 831c1c6c
  由 Wilber 提交于 7月 19, 2021
  
  831c1c6c
14 7月, 2021 1 次提交
- W
  
  Inference support Ascend910 (#34101) · 4e3fb219
  由 Wilber 提交于 7月 14, 2021
  
  4e3fb219
08 6月, 2021 1 次提交

add dynamic layer_norm plugin (#33293) · 45d1ae21

由 Shang Zhizhou 提交于 6月 08, 2021

* add dynamic layer_norm plugin

* fix bug

* fix numpy.allclose

* fix format

* fix code style

* remove shepe in dynamic shape

* code format

* remove layer norm fp16

* fix format

45d1ae21

25 4月, 2021 2 次提交

W

update lite subgraph api. (#32513) · 92dc9b2b
由 Wilber 提交于 4月 25, 2021

92dc9b2b

Nne integration (#32255) · feb2e476

由 denglin-github 提交于 4月 25, 2021

* Add dlnne engine runtime

* Fix log

* Remove <const_cast> and remove unrelated modify with dlnne, +clang-format

* Fix CMakeList format error

* Add copyright message

* Fix dlnne CMakeList.txt

* Add some paddlepaddle_pass to support more networks

* Fix some format bug

feb2e476

19 2月, 2021 1 次提交
- W
  
  fix python pass builder error. (#30946) · 0020d915
  由 Wilber 提交于 2月 18, 2021
  
  0020d915
03 2月, 2021 1 次提交

石

support xpu with analysis predictor, test=develop (#30832) · 2ac4143b

由石晓伟提交于 2月 03, 2021

* support xpu inference with analysis predictor, test=develop

* merge the cmake of the xpu toolchain, test=develop

* add c-apis, test=develop

* fix a bug in extern_xpu, test=develop

2ac4143b

25 1月, 2021 1 次提交

add DLA support：C++&&Python api (#30165) · ae0f88a9

由 Shang Zhizhou 提交于 1月 25, 2021

* add dla

* add dla done

* add python api
Co-authored-by: Nshangzhizhou <root@szth-rp-fanyi-opera49.szth.baidu.com>

ae0f88a9

04 1月, 2021 1 次提交
- C
  [Inference] zero_copy_tensor supports int8_t (#30053) · 68398abc
  由 cc 提交于 1月 04, 2021
```
* zero_copy_tensor supports int8_t
```
  68398abc
15 12月, 2020 1 次提交
- W
  
  fix none-contiguous bug for python api. (#29615) · 78dad786
  由 Wilber 提交于 12月 15, 2020
  
  78dad786
11 11月, 2020 1 次提交
- W
  
  [Inference] Add TryShrinkMemory interface. (#28409) · 1bf48365
  由 Wilber 提交于 11月 11, 2020
  
  1bf48365
03 11月, 2020 1 次提交

TensorRT中ernie模型推理性能优化，支持变长输入 (#28367) · ea851796

由 Shang Zhizhou 提交于 11月 03, 2020

* fp16 result ok

* change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS

* auto detect special slice op converter for ernie with trt oss

* ernie oss only support fp16

* fix special_slice_plugin serialize bug

* matmul in tensorrt ok

* ernie unittest ok

* add matmul tensorrt unittest

* remove demo code

ea851796

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致