提交 · e15ef948bf4fb94767ccfbc4a05039f908e3ba08 · PaddlePaddle / Paddle

10 2月, 2023 1 次提交
- W
  
  [XPU] bind op: atan & deformable_conv_v1 (#50373) · e15ef948
  由 wangshengxiang 提交于 2月 10, 2023
  
  e15ef948
09 2月, 2023 23 次提交

Z
[trt][inference]support int64 shapetensor as engine input (#50170) · 14a92c8c
由 Zhang Jun 提交于 2月 09, 2023
```
* update

* support int64 shape tensor as engine input

* add inference_predictor ut
```
14a92c8c
L

Modify full kernel for xpu. test=kunlun (#50209) · 18e0e01d
由 Leo Guo 提交于 2月 09, 2023

18e0e01d
R
[kunlun] support async send/recv via group (#50329) · 350cd82a
由 Roc 提交于 2月 09, 2023
```
Co-authored-by: Nzhangxiaoci <zhangxiaoci@baidu.com>
```
350cd82a
X

consider grad_op exist in forward program. (#50321) · 3862f347
由 xiongkun 提交于 2月 09, 2023

3862f347
J
Adjust mkldnn_placement_pass to check library type and data type (#49899) · ebdf3ef9
由 joanna.wozna.intel 提交于 2月 09, 2023
```
* Adjust mkldnn_placement_pass to check library type and data type

* Check if var has inputs

* Remove unrelated test

* Refactor
```
ebdf3ef9

remove paddle.fluid.dygraph.parallel.ParallelEnv (#50157) · 9dd1f4bf

由 zqw_1997 提交于 2月 09, 2023

* remove dygraph.parallel.ParallelEnv

* logger.py error: AttributeError: module 'paddle' has no attribute 'distributed'

* move the implenmentation to the root folder

* logger.py import ParallelEnv from paddle.parallel to avoid circular import

* add the comment of why import ParallelEnv from paddle.parallel in logger.py and remove the api interface in the paddle/parallel.py

* outdated Env and note removed

* decouple the logger.py and ParallelEnv

* remove another ref of parallel in init.py

9dd1f4bf

[PHI decoupling] move strided_memcpy.h to phi (#50346) · 17318c1a

由 Huang Jiyi 提交于 2月 09, 2023

* decouple strided_memcpy

* move strided_memcpy

* move strided_memcpy to phi

* fix namespace

* update

* fix gpu compile bugs

17318c1a

H

remove layout_utils in phi (#50355) · 90650534
由 Huang Jiyi 提交于 2月 09, 2023

90650534

Add MultiTenosrAdam OP (#49220) · 10654c77

由 yuehuayingxueluo 提交于 2月 09, 2023

* add multi_tenosr_adam

* update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py

* fix adam.py optimizer.py

* fix adamw.py

* fix test_multi_tensor_adam.py

* fix CI bug

* fix CI coverage

* fix ci bug

* fix betapow

* fix some bugs

* fix test_adamw_op.py

* fix CI coverage

* fix multi_tensor_adam_kernel.cc

* fix CI bug

* fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py

* fix code style

* update C++ parts

* remove python parts modification temporarily

* add C++ ut

* update betapow copy code logic

* fix ci ut

* fix windows ci

* fix coverage ci

* improve coverage rate

---------
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

10654c77

Y
[audio] fix doc typo (#50343) · d676d552
由 YangZhou 提交于 2月 09, 2023
```
* fix typo

* add sox_io in audio test

* fix

* fix
```
d676d552
L

fix gc bug and start interceptor (#50344) · 5d5cb256
由 LiYuRio 提交于 2月 09, 2023

5d5cb256

Fix bugs in pass_base.py (#50136) · 5cae5fdd

由 yuehuayingxueluo 提交于 2月 09, 2023

* fix the processing order of passes in pass_base.py

* fix processing order

* add _PASS_PROCESS_ORDER_LIST

* delete some pass in _PASS_PROCESS_ORDER_LIST

* add assert in pass_base.py

* remove fuse_optimizer

* add _fusion_opt_list_rule

* add test_pass_base_list.py

* fix some bug

* add fused_attention

* add some passes to list

* fix ci bug

* fix ci bug

5cae5fdd

W

[rm fluid] for the non distribution (#50313) · 7edfac9e
由 wangzhen38 提交于 2月 09, 2023

7edfac9e

[Paddle-TRT] GroupNorm int8 nchw32 fake kernel (#50146) · d93c63a0

由 zhoutianzi666 提交于 2月 09, 2023

* add fmha_flashattention oss plugin

* add fmhca

* add oss fmhca

* code reconstruct and add ut

* code style refine

* fix ut and enforce check

* refine trt version check

refine compile

fix compile

* fix cross ut

* code refine

* use runtime trt version check

* bug fix and code refine

* compile fix

* merge develop

* add GN QDQ kernel

* support GN int8 fake kernel

* add with_int8

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8  UT

* add verison > 8000  in GN int8  UT

* add some check in .cu

* add stdlib.h in UT

* little change  in .cu

* remove rand_r use rand

* remove use rand

* setAxis(1)

* when int8 is on allow fall back to fp16

---------
Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>

d93c63a0

W

clean communicator (#50339) · d9b70950
由 wangxiaoning 提交于 2月 09, 2023

d9b70950
K
[BugFix][ConditionalBlock] fix judgement about scope validation (#50086) · 61f9f136
由 kangguangli 提交于 2月 09, 2023
```
* fix judgement about scope validation

* fix ci bug: same address is not enough for data consistency

* remove useless check
```
61f9f136
P

Fix pscore test (#50349) · fe811625
由 pangengzheng 提交于 2月 09, 2023

fe811625
J

fix bn composite error shape (#50338) · e389f2fc
由 Jiabin Yang 提交于 2月 09, 2023

e389f2fc

[IR] Type system stage1: add class TypeId, class AbstractType, class TypeStorage (#50242) · f11c913e

由 zhangbo9674 提交于 2月 09, 2023

* add TypeID

* Specification comment code

* refine code

* add AbstractType

* add TypeStorage

* fix unittest bug

* change dir

* change dir

* refine code

* fix bug

* Refine code by comment

* delete unused code

* normative naming rules

* refine code by comment

* refine doc

* refine codestyle

f11c913e

W

fixoptminizer _set_auxiliary_var bug (#50335) · c44005f0
由 wanghuancoder 提交于 2月 09, 2023

c44005f0
Z

add logical_and, logical_or and logical_xor for xpu (#50228) · 0036316e
由 zhangyikun02 提交于 2月 09, 2023

0036316e
W
[TRT] Transpose layernorm fusion with different input format (#50082) · b2bb7ec9
由 Wang Bojun 提交于 2月 09, 2023
```
* trans_layernorm
```
b2bb7ec9
傅

fix set_value_65965 (#50340) · b3f60f39
由傅剑寒提交于 2月 09, 2023

b3f60f39

08 2月, 2023 16 次提交
- P
  fuse quantize+transpose and transpose+dequantize (#49509) · 197a4ffe
  由 Paulina Gacek 提交于 2月 08, 2023
```
* QuantTranpose pattern is being found by pass

* quant + transpose fuse

* code style changes

* UT written, reorder fixed

* Dequantize + transpose2 fuse  added

* pass name changed

* UT added & shift corrected

* got rid of redundancy

* review changes

* AsIntermediate corrected

* compat added
```
  197a4ffe
- S
  Add bf16 support for fused matmul (#50254) · b47923b4
  由 Sławomir Siwek 提交于 2月 08, 2023
```
* add support for bf16 fused_ops

* fused_matmul only
```
  b47923b4
- W
  [code style]fix cpplint codestyle (#50314) · 209d534d
  由 wangxiaoning 提交于 2月 08, 2023
```
* fix codestyle

* fix std
```
  209d534d
- Z
  [inference][trt] Disable ShapeTensor for nearest_interp_v2 when trt version < 8.2 (#50258) · fa284076
  由 Zhang Jun 提交于 2月 08, 2023
```
* update

* update

* format code

* update

* Update test_trt_convert_nearest_interp_v2.py
```
  fa284076
- Y
  
  Fused attention pass mp support (#50320) · e44ff495
  由 Yuang Liu 提交于 2月 08, 2023
  
  e44ff495
- Z
  [pglbox]hidden unzip (#50292) · a7539508
  由 zmxdream 提交于 2月 08, 2023
```
* hidden unzip

* fix

* fix
```
  a7539508
- W
  
  Export custom operator-related function symbols (#50238) · f9c801ff
  由 weishengying 提交于 2月 08, 2023
  
  f9c801ff
- H
  
  Use inference, save construct time (#50163) · 7a82b6de
  由 HongyuJia 提交于 2月 08, 2023
  
  7a82b6de
- T
  
  test=document_fix (#50323) · cd6ebca6
  由 tianshuo78520a 提交于 2月 08, 2023
  
  cd6ebca6
- T
  
  test=document_fix (#50322) · be2a1d94
  由 tianshuo78520a 提交于 2月 08, 2023
  
  be2a1d94
- Z
  Fix bn performance degradation (#50287) · 6f1ec935
  由 zhangkaihuo 提交于 2月 08, 2023
```
* fix bn performance degradation
```
  6f1ec935
- C
  [feature] use prim flag in shell (#50309) · 8c14b02b
  由 cyber-pioneer 提交于 2月 08, 2023
```
* add flag

* change flag

* use prim flag

* fix code

* fix softmax prim flag

* set case timeout
```
  8c14b02b
- L
  
  Optimize gc in executor (#50301) · 9268f392
  由 LiYuRio 提交于 2月 08, 2023
  
  9268f392
- H
  [Tensor Support unsigned] Tensor::data() supports unsigned int and bfloat16 (#50257) · 80dc81c5
  由 HongyuJia 提交于 2月 08, 2023
```
* support unsigned int and bfloat16

* update unit test

* update DenseTensor datatype

* unsupport more datatype of mutable_data(Place)

* fix unittest
```
  80dc81c5
- R
  
  fix gcc12 error (#48176) · ffa2ec19
  由 risemeup1 提交于 2月 08, 2023
  
  ffa2ec19
- R
  Add eager jit dir (#49369) · 32804fe8
  由 risemeup1 提交于 2月 08, 2023
```
* add_eager_and_jit,test=ljd_test

* test

* test,test=ljd_test

* test,test=ljd_test

* test,test=ljd_test

* test,test=ljd_test

* test,test=ljd_test

* test,test=ljd_test

* add_eager_jit_dir,test=ljd_test

* fix conflict,test=ljd_test

* test,test=ljd_test

* get new precise_map,test=ljd_test

* test
```
  32804fe8

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功