提交 · bdd3dde32322eee59d642d7d33169ffb382c78a6 · PaddlePaddle / Paddle

18 10月, 2022 1 次提交

[code-gen] Support code-gen for opmaker of sparse op (#46993) · bdd3dde3

由 zyfncg 提交于 10月 18, 2022

* support generating code of opmaker for backward op invoke forward op

* gsupport code-gen of opmaker for sparse op

* refind logic of choose phi kernrel

* fix complie budg

* fix code_gen bug

* fix bug

* fix kernel signature code-gen

* fix complie bug of VarType

* fix complie bug of VarType

* fix test_sparse_conv_op

* fix test_sparse_norm_op

bdd3dde3

17 10月, 2022 7 次提交

Add enable_partial_send_recv switch in pipeline_configs (#46992) · b9a2f29c

由 Ghost Screaming 提交于 10月 17, 2022

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* Support allow_partial switch, which can be configure in
pipeline_configs. If sent tensor are not the same from
different hosts, they shouldn't been sent partially and
then concated as a whole tensor.

* Change name allow_partial to enable_partial_send_recv.

* Add global variable _enable_partial_send_recv

b9a2f29c

H
Revert "add common subexpression elimination (#44386)" (#47062) · 7c6835ca
由 hong 提交于 10月 17, 2022
```
This reverts commit 166ff39a.
```
7c6835ca
Y
[PHI]Modify DataLayout's namespace from paddle::experimental to phi (#46869) · ec749398
由 YuanRisheng 提交于 10月 17, 2022
```
* namespace modify

* update by comment
```
ec749398

Layernorm shift partition enhance (#46816) · 9e08633c

由 Wang Bojun 提交于 10月 17, 2022

* first version of ln_s_p with s>0

* refine and UT

* pass opt draft

* pass opt

* code refine

* code-style

* bug fix

* fix ci test

* code style

9e08633c

J

fix for conv_bias_mkldnn_pass (#47037) · acbda3e4
由 jakpiase 提交于 10月 17, 2022

acbda3e4

skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr (#46911) · 2e7dc666

由 pangyoki 提交于 10月 17, 2022

* skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr

* update ut

* test_dist_allreduce_op failed

* fix test_dist_allreduce_op

* add ut

* fix nccl cpu compile

* fix

2e7dc666

H

fix typo error in operator.cc (#46995) · 328236d2
由 HongyuJia 提交于 10月 17, 2022

328236d2

16 10月, 2022 1 次提交
- Z
  
  add common subexpression elimination (#44386) · 166ff39a
  由 ZeKai Zhou 提交于 10月 16, 2022
  
  166ff39a
14 10月, 2022 1 次提交
- S
  
  Update distributed_strategy.proto (#46531) · fcdc6777
  由 Shijie 提交于 10月 14, 2022
  
  fcdc6777
13 10月, 2022 5 次提交

Fix quantize model deploy bugs when using MKLDNN (#45920) · 561fd8c8

由 yeliang2258 提交于 10月 13, 2022

* fix immutable op quantize bugs

* fix

* fix build bug

* fix test

* notest,test=inference

* fix ppyoloe acc drop bugs

* fix test

* fix test

* add test

* fix

* fix

* fix test

* fix refined name bug

* fix test

* bias fix

* fix matmul weight dequant bug

* re-ci

* fix tester

* fix test

* fix tester

* update weight dequantize func

* update code

* update test for converage

* update test

* update cmake

* update cmakelist

* update code

* rerun ci

* remove useless code

561fd8c8

Y

fix bugs (#46951) · 20335b7c
由 YuanRisheng 提交于 10月 13, 2022

20335b7c
L
[new-exec] remove variable scope, stage2 (#43936) · 1230a3f4
由 Leo Chen 提交于 10月 13, 2022
```
* remove class ScopeBase

* reopen test
```
1230a3f4

[Kernel Selection] Remove hard code of PADDLE_WITH_MKLDNN (#46606) · ef1c8759

由 HongyuJia 提交于 10月 13, 2022

* remove PADDLE_WITH_MKLDNN, test white_list=abs

* fix unique_ptr

* fix op.Type()

* remove TODO in kernel_dispatch.h

* remove IndicateVarDataType function, update white_list

* remove mkldnn hard code

* add comments

* fix ==

* update mkldnn_op_list

* delete hard code of OPs

* update mkldnn_op_list

* update mkldnn_op_list, remove interp

* add error check for ExecutionContext

* update mkldnn_op_list, remove transpose2_grad

* remove interpolate mkldnn

* remove fill_constant mkldnn

* opt HasAttr in DygraphExecutionContext

* deprecated commit, test mkldnn_white_list

* deprecated commit, test mkldnn_white_list

* deprecated commit, test mkldnn_black_list

* update mkldnn_op_list, add assert error op

* solve cudnn related op

* fix error

* add mkldnn fallback in phi_utils.cc

* remove mkldnn fallback in phi_utils.cc

* opt code implementation

* polish Copyright License

ef1c8759

Add unsigned int8 scale propagation (#46378) · c72b3bfa

由 joanna.wozna.intel 提交于 10月 13, 2022

* Add unsigned int8 propagation

* Add or modify unit tests

* Correct concat scale checking

* Apply review suggestions

* Corrections

c72b3bfa

12 10月, 2022 5 次提交
- S
  fix wz review (#46937) · cdc44a54
  由 sunli 提交于 10月 12, 2022
```
* fix wz review

* update code
```
  cdc44a54
- L
  clean code of interpretercore (#46891) · 5303b66b
  由 Leo Chen 提交于 10月 12, 2022
```
* refactor

* refine code
```
  5303b66b
- W
  
  remove all control_vars in IR graph (#46888) · bf1dc548
  由 weishengying 提交于 10月 12, 2022
  
  bf1dc548
- [Zero-Dim] support input 0D Tensor for some unary api (#45992) · 05c2b9ba
  由 zhouweiwei2014 提交于 10月 12, 2022
```
* [Zero-Dim] support input 0D Tensor for unary api

* fix CI
```
  05c2b9ba
- S
  Optimize sub graph detector (#45040) · 50ca5bda
  由 sunli 提交于 10月 12, 2022
```
* optimize cinn subgraph detector

* fix update subgraph

* add annotation
```
  50ca5bda
11 10月, 2022 4 次提交
- S
  add logging to fc residual fuse pass (#46760) · 21668cb2
  由 Sylwester Fraczek 提交于 10月 11, 2022
```
* add logging to fc residual fuse pass

* expand logging message to fc residual fuse pass

* Add test for fc residual not fusing with activation
```
  21668cb2
- A
  
  [CINN] support expand with static expand_times (#46776) · 09fae5cd
  由 Aganlengzi 提交于 10月 11, 2022
  
  09fae5cd
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
- Z
  Fix some bugs hidden in build_cinn_pass. (#46843) · a19b082e
  由 Zhen Wang 提交于 10月 11, 2022
```
* Fix some bugs hidden in build_cinn_pass.

* Update codes about OpTransInfo.

* Only support for the static reshape/reshape2 op.
```
  a19b082e
10 10月, 2022 5 次提交

[PHI]Add RNN yaml (#46812) · ab60fd8b

由 YuanRisheng 提交于 10月 10, 2022

* add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta

* WIP: move rnn kernrl to phi

* Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.

* fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.

* fix ci bugs

* fix ci bugs

* fix ci bugs

* modify code according comment
Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>

ab60fd8b

reduce time cost on atomic in interpretercore (#46688) · dd3d45de

由 Leo Chen 提交于 10月 10, 2022

* reduce time cost on atomic in interpretercore

* clear code of PrepareAtomic in interpretercore

* refine threadpool cache

dd3d45de

Add fc residual pattern (#46757) · 0c789ae5

由 Sylwester Fraczek 提交于 10月 10, 2022

* fix fc pattern

remove use_bias
add residual input switch
fix references to pattern

* review fixes

0c789ae5

add function FindInputNameByVarName (#46759) · 8eaff62d

由 Sylwester Fraczek 提交于 10月 10, 2022

* Add methods that find input or output name by var name

* kind of bugfix - initialize variables

* ci fix

* review fixed

8eaff62d

Z

[Paddle-TRT] support new quant format from slim (#46022) · 7987a905
由 zhoutianzi666 提交于 10月 10, 2022

7987a905

09 10月, 2022 2 次提交
- Z
  
  Update device_worker.cc (#46723) · 57cdde13
  由 zmxdream 提交于 10月 09, 2022
  
  57cdde13
- Z
  
  interpretercore thread not always spin (#46687) · 2e217dbb
  由 zhangbo9674 提交于 10月 09, 2022
  
  2e217dbb
08 10月, 2022 1 次提交
- H
  
  fix typo (#46680) · 6e9bb9f9
  由 HongyuJia 提交于 10月 08, 2022
  
  6e9bb9f9
03 10月, 2022 1 次提交
- J
  Requantize to use Memory Desc in Tensors (#46608) · a579e523
  由 Jacek Czaja 提交于 10月 03, 2022
```
* - some more MD changes

* - lint

* - compilation fixes

* - compilation fixes

* - lint

* - fix
```
  a579e523
30 9月, 2022 4 次提交
- R
  
  Release memory cache after build_op_func_list in interpretercore (#46670) · 255890ff
  由 Ruibiao Chen 提交于 9月 30, 2022
  
  255890ff
- A
  [IPU] paddle-inference support custom-ops (#45235) · a6b4bee3
  由 Allen Guo 提交于 9月 30, 2022
```
* paddle-inference support custom-ops
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>

* fix tolower
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
```
  a6b4bee3
- Y
  fix bugs of tipc, test=kunlun (#46540) · d16360c8
  由 ykkk2333 提交于 9月 30, 2022
```
* migrate sigmoid with cross entropy, and tile xpu kernels to phi, test=kunlun

* migrate add_n kernep to phi, test=kunlun

* fix bugs of tipc, test=kunlun
```
  d16360c8
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46629) · abee2210
  由 HongyuJia 提交于 9月 30, 2022
  
  abee2210
29 9月, 2022 2 次提交
- Z
  [GPUPS]add afs OpenWriter (#46611) · c7d60ce4
  由 zmxdream 提交于 9月 29, 2022
```
* add afs OpenWriter

* update
```
  c7d60ce4
- Y
  Remove calibration file path when deploy quantize model (#46283) · d71f1b3f
  由 yeliang2258 提交于 9月 29, 2022
```
* remove calibration file path

* remove useless code
```
  d71f1b3f
28 9月, 2022 1 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功