提交 · 3e6d9dbbcac1b003253f9cb437e51e360970f407 · 机器未来 / Paddle

14 10月, 2021 1 次提交
- W
  inference support bert when exists matmul_v2 (#36424) · 3e6d9dbb
  由 Wilber 提交于 10月 14, 2021
```
* support bert when exists matmul_v2

* update
```
  3e6d9dbb
13 10月, 2021 1 次提交

[PaddleInference] Pass: add int8 flag for op (#36042) · d7858c99

由 Wangzheee 提交于 10月 13, 2021

* add_int_pass

* add_int8_flag_pass

* add_int8_flag_pass

* fix CMakeLists.txt

* fix test_trt_fc_fuse_quant_dequant_pass.py

* fix python/paddle/fluid/tests/unittests/ir/inference/test_trt_fc_fuse_quant_dequant_pass.py

* fix test_trt_fc_fuse_quant_dequant_pass.py

d7858c99

11 10月, 2021 1 次提交
- J
  
  added missing bf16 ops (#36291) · 14393876
  由 jakpiase 提交于 10月 11, 2021
  
  14393876
22 9月, 2021 1 次提交
- W
  
  fix: delete_quant_dequant_filter_op_pass, delete_quant_dequant_op_pass (#35879) · 5cda6b2b
  由 Wangzheee 提交于 9月 22, 2021
  
  5cda6b2b
06 9月, 2021 1 次提交

Add fusion_lstm INT8 PTQ (#35334) · 7ef04da6

由 joanna.wozna.intel 提交于 9月 06, 2021

* Add fusion_lstm INT8 PTQ

* Correct mkldnn_cache_capacity and enable fc_lstm_fuse_pass only for this test

* Change mkldnn_cache_capacity

7ef04da6

27 8月, 2021 1 次提交

Add fusion_gru and multi_gru to PTQ (Post-Training Quantization) (#33749) · 7debae3a

由 joanna.wozna.intel 提交于 8月 27, 2021

* Add calculation for gru op

* Correct the types

* Remove mkldnn only

* Correct mkldnn ifdef

* Remove mkldnn ifdef

* Separate mkldnn quantizer test

* Correct Windows test

* Check different cmake fix

* Revert cmake change

* Cmake change 2

* Cmake change 3

7debae3a

16 8月, 2021 1 次提交

Fix elementwise_add quantization (#34820) · ae80df91

由 joanna.wozna.intel 提交于 8月 16, 2021

* Remove force_fp32_output from elementwise_add quantization

* Fix cpu_quantize_placement test

* Review related changes

ae80df91

30 7月, 2021 1 次提交

Added reshape, reshape2, squeeze and squeeze2 BF16/FP32 FWD/BWD kernels (#34219) · 22c4c189

由 jakpiase 提交于 7月 30, 2021

* test version of matmul_v2

* added matmul_v2 grad kernel

* minor changes

* minor changes

* minor change for CI approval

* CI fix

* CI fix

* added squeeze and squeeze2 kernels

* CI fix

* CI fix

* CI fix

* disabled tests when compiled with cuda

* added setting format_tag by strides

* added sigmoid BF16 FWD/BWD and gelu BF16 BWD

* changes after review

* Revert "added sigmoid BF16 FWD/BWD and gelu BF16 BWD"

This reverts commit 6e3f76720b545abfcff9f6052b46b73a1e745cae.

* Revert "Merge branch 'matmul_v2_grad' into squeeze2_op"

This reverts commit 06fcf67843a4a7884eccdf67a02a03575e1d4cb8, reversing
changes made to 6e3f76720b545abfcff9f6052b46b73a1e745cae.

* minor change

* added reshape1/2 kernels

* moved some functions into private block

* CI fix

* CI fix

* CI fix

22c4c189

20 7月, 2021 1 次提交
- P
  
  optimize fusion pass logs to avoid duplication (#34261) · 52e2c83e
  由 Pei Yang 提交于 7月 20, 2021
  
  52e2c83e
07 7月, 2021 1 次提交
- J
  Added PRelu BF16/FP32 FWD/BWD kernels (#33878) · 375e5618
  由 jakpiase 提交于 7月 07, 2021
```
* added prelu bf16/fp32 fwd/bwd kernel
```
  375e5618
30 6月, 2021 1 次提交

Added matmul_v2 BF16/FP32 FWD kernel (#33750) · 24783c84

由 jakpiase 提交于 6月 30, 2021

* added matmul_v2 bf16/fp32 FWD kernel

added matmul_v2 bf16/fp32 FWD kernel

* added formatting

* removed some tests due to timeout in CI

* refactored tests

* merged tests classes into one file

* minor change

* removed test guard for CUDA

* remove skipIf

* changes after review

* formated one file

* minor change

* added skipping UT in CUDA place

24783c84

24 6月, 2021 1 次提交
- J
  [oneDNN] Fix to #33282 , added support of X input broadcasting to oneDNN elementwise ops (#33549) · 049dd853
  由 Jacek Czaja 提交于 6月 24, 2021
```
* - fix to #33282

* - Increased threshold for elementwise_mul_bf16 grad

* -disabled faulty UT

* - fix to approval
```
  049dd853
23 6月, 2021 1 次提交

Added split op bf16/fp32 oneDNN kernel (#33584) · 68106509

由 jakpiase 提交于 6月 23, 2021

* base changes for split op

* 90% of split functionality added

* full fp32 functionality

* added bf16 test

* added submemory caching

* added bf test to static mode whitelist

* minor change

* enabled split op for inference

* minor fix

* minor fix

68106509

12 6月, 2021 1 次提交

由 joanna.wozna.intel 提交于 6月 11, 2021

* Small changes related to BF16 fusion_gru and fusion_lstm

* Correct to pass arg by value

* Add conditions to rnn op

* Correct the spelling mistake

* Improving the test with checking activation

* Trigger CI

cd95ea82

28 4月, 2021 1 次提交

Nne integration (#32604) · abcb3f54

由 denglin-github 提交于 4月 28, 2021

* Add dlnne engine runtime

* Fix log

* Remove <const_cast> and remove unrelated modify with dlnne, +clang-format

* Fix CMakeList format error

* Add copyright message

* Fix dlnne CMakeList.txt

* Add some paddlepaddle_pass to support more networks

* Fix some format bug

* Add delete dropout_op pass

* Fix some format bug

* Fix format bug

abcb3f54

30 3月, 2021 1 次提交

[Paddle-TRT] TRT inference support for BERT/Transformer in paddle 2.0 api (#31744) · 14b7e3cf

由 Pei Yang 提交于 3月 30, 2021

* support multihead_matmul_fuse_pass_v3

* fix compile problems

* embedding_eltwise_ln pass support lookup_table_v2

* suppoort matmul and matmul_v2 in qkv matmul

14b7e3cf

01 3月, 2021 1 次提交
- A
  
  updated conv bn fuse pass to make it compatible with latest batch_norm op (#31272) · bfb8a642
  由 alncat 提交于 3月 01, 2021
  
  bfb8a642
23 2月, 2021 1 次提交

Unification of BF16 enablement process (#31034) · 781df300

由 joanna.wozna.intel 提交于 2月 23, 2021

* Unification of bfloat16 enablement process and refactor

* Remove unnecessary function

* Standardize the output name search

781df300

18 2月, 2021 1 次提交

Add Conv Transpose BF16 (#30877) · caf9d398

由 joanna.wozna.intel 提交于 2月 18, 2021

* Add conv transpose BF16

* Share function GetWeightsTz

* Adjust to review and fix op compatibility

* Add bias to unique handler name

* Remove errors related to paddle enforce

* Add conv2d_transpose to bf16 list and kernel refator

caf9d398

04 2月, 2021 1 次提交
- W
  use iwyu clean include second time, test=develop (#30829) · 35c5b23f
  由 wanghuancoder 提交于 2月 04, 2021
```
* use iwyu clean include second time, test=develop
```
  35c5b23f
03 2月, 2021 1 次提交
- A
  
  Layer normalization fuse pass. (#30721) · 4f066e31
  由 Adam Osewski 提交于 2月 03, 2021
  
  4f066e31
27 1月, 2021 1 次提交
- A
  
  modified conv+bn fuse pass to fix wrong mask in mask rcnn (#30704) · 5ace20fc
  由 alncat 提交于 1月 27, 2021
  
  5ace20fc
13 1月, 2021 1 次提交

Added support for inference using quantization aware trained dygraph (#30288) · 7bbf3ac5

由 alncat 提交于 1月 13, 2021

* added support for inference using qunatization aware trained dygraph

* added support for inference using qunatization aware trained dygraph
correct boost get usage

* Delete incorrect warning message (#30196)

* fix warning and no grad

* clean redundant API alias in 2.0 - part 2 (#30013)

* delete paddle.nn.functional.assign

* fix dynamic to static error

* just add the op error message for the matmul xpu (#30246)

 add the op error message for the matmul xpu

* Add Static Variable Clone (#30208)

Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat

* use wget to replace curl to download the lcov file (#30229)

* use wget to replace curl to download the lcov file

* add cache for lcov

* fix test_pool3d_op timeout issue (#30248)

* Fix unittests bugs. (#30250)

* modify error message based on comments (#30189)

* modify error message based on comments

* edit code according to review.

* Correct spelling according to review.

* Fix bug for 'save mutiple method' (#30218)

* Fix bug for 'save mutiple method'

* To pass coverage.

* edit code to pass coverage.

* edit code to pass coverage.

* add unittest for coverage.

* change for coverage.

* edit for coverage.

* added support for inference using qunatization aware trained dygraph

* Alias from  paddle.fluid.layers.auc to paddle.static.auc (#30206)

* add alias from  fluid.layers.auc to static.auc

* Update __init__.py

* added support for inference using qunatization aware trained dygraph
correct boost get usage

* corrected boost get usage

* corrected naming issues and enforcing zero check

* correct paddle enforce message

* added more error checkings

* corrected error report message and optimized code

* corrected findvar usage

* corrected paddle_enforce in scope

* correct error messages

* correct error reporting format
Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com>
Co-authored-by: NXiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com>
Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
Co-authored-by: NHuihuang Zheng <zhhsplendid@gmail.com>
Co-authored-by: NYUNSHEN XIE <1084314248@qq.com>
Co-authored-by: NBai Yifan <me@ethanbai.com>
Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
Co-authored-by: NWeiXin <weixin10@baidu.com>
Co-authored-by: NJiaqi Liu <liujiaqi06@baidu.com>

7bbf3ac5

29 12月, 2020 1 次提交
- C
  map matmul/squeeze2+matmul/reshape2+matmul to mul (#29911) · 6a0102b0
  由 cc 提交于 12月 29, 2020
```
* map matmul/squeeze2+matmul/reshape2+matmul to mul
```
  6a0102b0
24 12月, 2020 1 次提交
- J
  
  Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) (#29772) · edc06c6a
  由 jakpiase 提交于 12月 24, 2020
  
  edc06c6a
30 11月, 2020 1 次提交
- W
  
  Add quantization of multi_gru op and tests (#28615) · 4fd4095d
  由 Wojciech Uss 提交于 11月 30, 2020
  
  4fd4095d
26 11月, 2020 2 次提交
- J
  Add bf16 pool2d and unify bf16 unit tests (#29039) · b0d1ac16
  由 joanna.wozna.intel 提交于 11月 26, 2020
```
* Add bf16 pool2d and unify bf16 unit tests

* Add change default ops test
```
  b0d1ac16
- J
  Fix cpu_bfloat16_pass (#28730) · fddea674
  由 joanna.wozna.intel 提交于 11月 26, 2020
```
* Fix cpu_bfloat16_pass

* Add output_format

* Fix incorrect SetOutput

* Change fromating
```
  fddea674
25 11月, 2020 1 次提交
- W
  Add multi_gru_fuse_pass and tests (#28601) · 7b5a8e46
  由 Wojciech Uss 提交于 11月 25, 2020
```
* Add multi_gru_fuse_pass and tests

* fix date

* cleaned up headers
```
  7b5a8e46
24 11月, 2020 1 次提交
- W
  Add multi_gru_seq_fuse_pass and tests (#28604) · 991345b3
  由 Wojciech Uss 提交于 11月 24, 2020
```
* Add multi_gru_seq_fuse_pass and tests

* fix date

* removed unused functions
```
  991345b3
20 11月, 2020 1 次提交
- J
  Add bf16 matmul, fc, elementwise add and mul (#28729) · 8c0ea4bf
  由 joanna.wozna.intel 提交于 11月 20, 2020
```
* Add bf16 matmul, fc, elementwise add and mul

* Correct unit test
```
  8c0ea4bf
17 11月, 2020 1 次提交
- J
  
  [oneDNN] Layer norm bf16 kernel (#28619) · 6d8d3d4c
  由 Jacek Czaja 提交于 11月 17, 2020
  
  6d8d3d4c
06 11月, 2020 1 次提交
- J
  Add bfloat16 softmax and gelu (#28394) · 7821759d
  由 joanna.wozna.intel 提交于 11月 06, 2020
```
* Add bfloat16 softmax and gelu

* Add pass attr bfloat16_enabled_op_types

* Changes from review
```
  7821759d
05 11月, 2020 1 次提交
- J
  [oneDNN]Sum bf16 kernel (#28382) · ca415414
  由 Jacek Czaja 提交于 11月 05, 2020
```
* - Added sum bf16 oneDNN

test=develop

* - Fix to UT of sum bf16

test=develop
```
  ca415414
29 10月, 2020 1 次提交
- J
  
  Add bf16 transpose2, reshape2, concat ops (#28195) · 571a63e7
  由 joanna.wozna.intel 提交于 10月 29, 2020
  
  571a63e7
27 10月, 2020 1 次提交
- Z
  add Fuse bn add act pass (#28196) · fdc06f21
  由 Zhang Ting 提交于 10月 27, 2020
```
* add fuse_bn_add_act pass
```
  fdc06f21
26 10月, 2020 1 次提交
- A
  
  oneDNN BatchNorm + Act fusion pass. (#27912) · 7db747d9
  由 Adam Osewski 提交于 10月 26, 2020
  
  7db747d9
09 10月, 2020 1 次提交
- J
  
  [oneDNN] GRU BF16 kernel (#27731) · 606611d3
  由 Jacek Czaja 提交于 10月 09, 2020
  
  606611d3
01 10月, 2020 1 次提交
- W
  
  Added support for quantization of fusion_gru (#27518) · 966447e3
  由 Wojciech Uss 提交于 10月 01, 2020
  
  966447e3
26 9月, 2020 1 次提交
- J
  
  Add conv2d bfloat16 support (#27325) · b0ee1405
  由 joanna.wozna.intel 提交于 9月 26, 2020
  
  b0ee1405

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致