提交 · 9e006987626c265d13c72a384a8303bc2077e5cb · PaddlePaddle / Paddle

04 11月, 2022 1 次提交
- J
  Optimized oneDNN FC and added operator+unsqueeze2 and operator+reshape2 oneDNN fuse passes (#47391) · 9e006987
  由 jakpiase 提交于 11月 04, 2022
```
* tmp save

* minor chnage

* CI fix

* added FC optimizations

* latest update

* CI fix

* fixed bug with fusing fc
```
  9e006987
03 11月, 2022 3 次提交

Y
Fix ComputePropagateScalesMkldnnPass of MKLDNN (#47574) · 5fc92943
由 yeliang2258 提交于 11月 03, 2022
```
* add constant_folding_pass pass for mkldnn int8

* update UpdateScaleOpInOutScales
```
5fc92943

[PHI] Migrate softmax kernel (#47339) · b8ae3858

由 Sławomir Siwek 提交于 11月 03, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* remove redundant imports

* migrate softmax

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* merge dev

* fix map at error

* adjust attribute

* adapt funcs to PHI
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>

b8ae3858

W

bug fix (#47611) · 5160628c
由 wenbin 提交于 11月 03, 2022

5160628c

02 11月, 2022 1 次提交
- 丁
  
  Logsigmoid and Tanhshrink ops convert to trt (#47322) · b045fdfb
  由丁一提交于 11月 02, 2022
  
  b045fdfb
01 11月, 2022 1 次提交
- K
  fix memory copy in prepare_data of FusedMultiTransformer pass (#47306) · 9ad0e37e
  由 Kaipeng Deng 提交于 11月 01, 2022
```
* fix memory copy in prepare_data. test=develop
```
  9ad0e37e
31 10月, 2022 1 次提交
- F
  feat: add int8 support for vit (#47330) · 2953b708
  由 feng_shuai 提交于 10月 31, 2022
```
* feat: add int8 support for vit

* test:add test
```
  2953b708
27 10月, 2022 2 次提交

make all cpp tests dynamic linked to libpaddle.so [except windows] (#47088) · 2096448b

由 Leo Chen 提交于 10月 27, 2022

* make all cpp tests dynamic linked to libpaddle.so

* add comments

* keep old cc_test for some tests

* fix some ut

* make some ut use cc_test_old

* fix typos and fit for win32

* fix lib path

* fix some tests

* skip lite test

* fit for rocm

* fit for cinn

* fit for mac

* fit for win32

* skip inference ut

* skip  windows

* fix coverage

2096448b

C
Fix compile error of mkldnn and tensorrt (#47388) · 19feba38
由 Chen Weihang 提交于 10月 26, 2022
```
* fix compile error of mkldnn

* fix tensorrt error
```
19feba38

26 10月, 2022 3 次提交

Preln_Layernorm_Shift_Partition (#47099) · d17d0cd1

由 wenbin 提交于 10月 26, 2022

* prelnlayernorm_shift

* add ut

* remove paddle_enforce

* remove useless

* add UT

* remove UT

* add UT

* set timeout

d17d0cd1

FC/matmul(v2) + scale fuse pass (#47127) · c1c2be2d

由 Sławomir Siwek 提交于 10月 26, 2022

* fc/matmuls + scale fuse pass

* remove double-extension

* add unit tests

* comments from review

* codestyle

* add pass to int8 list

* new codestyle

* attr name typo

c1c2be2d

C
Remove the declaration of using LoDTensor in framework/lod_tensor.h (Part2) (#46953) · 1cb12ff5
由 Chen Weihang 提交于 10月 25, 2022
```
* remove using lodtensor part2

* resolve code format error

* resolve conflict

* resolve conflict

* replace added frameworrk tensor
```
1cb12ff5

24 10月, 2022 1 次提交
- Y
  Fix compilation bug caused by incorrect log information (#47254) · 40212582
  由 yeliang2258 提交于 10月 24, 2022
```
* fix log bugs

* more fix

* fix bugs
```
  40212582
21 10月, 2022 1 次提交
- A
  
  fix runtime error (#47133) · 016766cc
  由 Allen Guo 提交于 10月 21, 2022
  
  016766cc
20 10月, 2022 2 次提交
- K
  Add FusedMultiTransformer fuse pass for GPT3 (#45907) · 5a2e5179
  由 Kaipeng Deng 提交于 10月 20, 2022
```
* add fused_multi_transformer_encoder/decoder pass, run GPT-3 success
```
  5a2e5179
- S
  
  log only if > 0 (#47181) · d6208aad
  由 Sylwester Fraczek 提交于 10月 20, 2022
  
  d6208aad
19 10月, 2022 2 次提交
- R
  Support stream overlap for c_allreduce_sum (#47030) · d00b7d83
  由 Ruibiao Chen 提交于 10月 19, 2022
```
* Support stream overlap for c_allreduce_sum

* Test CI

* Add notes

* Add SingleStreamGuard for BuildOpFuncList
```
  d00b7d83
- W
  [Dy2St]Fix recurrent op eager deletion pass error in dy2st (#47105) · 94132190
  由 WangZhen 提交于 10月 19, 2022
```
* Fix recurrent op eager deletion pass error in dy2st

* Polish code

* Refine error message
```
  94132190
18 10月, 2022 2 次提交

Merge layernorm trt fuse (#46320) · 5e9f491e

由 Wang Bojun 提交于 10月 18, 2022

* first version, accuracy corrected

* disable debug print

* use blockReduceSum in phi

* add UT

* add opCompat

* code style

* code refine

* bug fix

* code refine

* test fix

* bugfix

* codesytle fix

* code style

* code-style

* code-style

* code-style

5e9f491e

FC + activation fuse passes (#45183) · b7a23adb

由 Sławomir Siwek 提交于 10月 18, 2022

* git

* style

* leave default relu in kernel

* style

* cleanup FCMKLDNN pattern

* merge conflicts

* update develop

* update develop

* add const

* rename to oneDNN and adjust attributes

* whitespace

b7a23adb

17 10月, 2022 4 次提交
- H
  Revert "add common subexpression elimination (#44386)" (#47062) · 7c6835ca
  由 hong 提交于 10月 17, 2022
```
This reverts commit 166ff39a.
```
  7c6835ca
- W
  Layernorm shift partition enhance (#46816) · 9e08633c
  由 Wang Bojun 提交于 10月 17, 2022
```
* first version of ln_s_p with s>0

* refine and UT

* pass opt draft

* pass opt

* code refine

* code-style

* bug fix

* fix ci test

* code style
```
  9e08633c
- J
  
  fix for conv_bias_mkldnn_pass (#47037) · acbda3e4
  由 jakpiase 提交于 10月 17, 2022
  
  acbda3e4
- P
  skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr (#46911) · 2e7dc666
  由 pangyoki 提交于 10月 17, 2022
```
* skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr

* update ut

* test_dist_allreduce_op failed

* fix test_dist_allreduce_op

* add ut

* fix nccl cpu compile

* fix
```
  2e7dc666
16 10月, 2022 1 次提交
- Z
  
  add common subexpression elimination (#44386) · 166ff39a
  由 ZeKai Zhou 提交于 10月 16, 2022
  
  166ff39a
13 10月, 2022 2 次提交

Fix quantize model deploy bugs when using MKLDNN (#45920) · 561fd8c8

由 yeliang2258 提交于 10月 13, 2022

* fix immutable op quantize bugs

* fix

* fix build bug

* fix test

* notest,test=inference

* fix ppyoloe acc drop bugs

* fix test

* fix test

* add test

* fix

* fix

* fix test

* fix refined name bug

* fix test

* bias fix

* fix matmul weight dequant bug

* re-ci

* fix tester

* fix test

* fix tester

* update weight dequantize func

* update code

* update test for converage

* update test

* update cmake

* update cmakelist

* update code

* rerun ci

* remove useless code

561fd8c8

Add unsigned int8 scale propagation (#46378) · c72b3bfa

由 joanna.wozna.intel 提交于 10月 13, 2022

* Add unsigned int8 propagation

* Add or modify unit tests

* Correct concat scale checking

* Apply review suggestions

* Corrections

c72b3bfa

12 10月, 2022 1 次提交
- W
  
  remove all control_vars in IR graph (#46888) · bf1dc548
  由 weishengying 提交于 10月 12, 2022
  
  bf1dc548
11 10月, 2022 2 次提交
- S
  add logging to fc residual fuse pass (#46760) · 21668cb2
  由 Sylwester Fraczek 提交于 10月 11, 2022
```
* add logging to fc residual fuse pass

* expand logging message to fc residual fuse pass

* Add test for fc residual not fusing with activation
```
  21668cb2
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
10 10月, 2022 3 次提交
- S
  Add fc residual pattern (#46757) · 0c789ae5
  由 Sylwester Fraczek 提交于 10月 10, 2022
```
* fix fc pattern

remove use_bias
add residual input switch
fix references to pattern

* review fixes
```
  0c789ae5
- S
  add function FindInputNameByVarName (#46759) · 8eaff62d
  由 Sylwester Fraczek 提交于 10月 10, 2022
```
* Add methods that find input or output name by var name

* kind of bugfix - initialize variables

* ci fix

* review fixed
```
  8eaff62d
- Z
  
  [Paddle-TRT] support new quant format from slim (#46022) · 7987a905
  由 zhoutianzi666 提交于 10月 10, 2022
  
  7987a905
30 9月, 2022 2 次提交
- A
  [IPU] paddle-inference support custom-ops (#45235) · a6b4bee3
  由 Allen Guo 提交于 9月 30, 2022
```
* paddle-inference support custom-ops
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>

* fix tolower
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
```
  a6b4bee3
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46629) · abee2210
  由 HongyuJia 提交于 9月 30, 2022
  
  abee2210
29 9月, 2022 1 次提交
- Y
  Remove calibration file path when deploy quantize model (#46283) · d71f1b3f
  由 yeliang2258 提交于 9月 29, 2022
```
* remove calibration file path

* remove useless code
```
  d71f1b3f
28 9月, 2022 3 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

R
Convert GradMergeAllReduceOpHandle in GraphToBlock (#46544) · 6a706e63
由 Ruibiao Chen 提交于 9月 28, 2022
```
* Convert GradMergeAllReduceOpHandle in GraphToBlock

* Set FLAGS_CONVERT_GRAPH_TO_PROGRAM to False
```
6a706e63
L

remove const qualifier in function return (#46546) · 8c5b9cf8
由 Leo Chen 提交于 9月 28, 2022

8c5b9cf8

27 9月, 2022 1 次提交
- W
  [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(3) (#46243) · 4d772144
  由 Wangzheee 提交于 9月 27, 2022
```
* [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(3)
```
  4d772144

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功