提交 · 62fe3cf5305d66524d08969cb7d896ee20ac3ef4 · PaddlePaddle / Paddle

10 2月, 2023 7 次提交
- L
  Fix Python IndexError of Case14: paddle.nn.functional.glu (#50016) · 62fe3cf5
  由 LoneRanger 提交于 2月 10, 2023
```
* 为split增加取值范围维度的判断

* 为glu的axis进行取值判断并添加单测

* 完善glu的单测

* fix glu
```
  62fe3cf5
- A
  
  [Dy2St]Fix func.__self__ problem in FunctionSpec (#50404) · 3374600e
  由 Aurelius84 提交于 2月 10, 2023
  
  3374600e
- M
  [Zero-Dim] support input 0D Tensor for std/var (#49735) · 86cc694f
  由 mhy-666 提交于 2月 10, 2023
```
* add test_std

* add test_var

* fix std/var assertequal

* fix std/var assertequal

* fix std/var assertequal

* -madd api name to reduce_api

* fix

* fix var

* fix

* fix

* fix stat

* fix unitest

* fix stat/var

* fix stat/var, unittest

* fix stat/std, unittest

* add unittest of var,std, fix stat/var,std

* fix stat/var, unittest

* fix

* fix unittest

* fix

* fix

* fix

* fix unittest
```
  86cc694f
- H
  
  fix default_attr=nullptr bug (#50383) · efef3035
  由 HongyuJia 提交于 2月 10, 2023
  
  efef3035
- H
  [phi decoupling] remove AllocatorFacade in phi (#50380) · d1bfb4b7
  由 Huang Jiyi 提交于 2月 10, 2023
```
* remove AllocatorFacade in phi

* fix include

* fix bugs
```
  d1bfb4b7
- H
  [phi decoupling] rm gradient_accumulator in phi (#50385) · 13f57ec0
  由 Huang Jiyi 提交于 2月 10, 2023
```
* rm gradient_accumulator in phi

* update
```
  13f57ec0
- W
  
  [XPU] bind op: atan & deformable_conv_v1 (#50373) · e15ef948
  由 wangshengxiang 提交于 2月 10, 2023
  
  e15ef948
09 2月, 2023 23 次提交

Z
[trt][inference]support int64 shapetensor as engine input (#50170) · 14a92c8c
由 Zhang Jun 提交于 2月 09, 2023
```
* update

* support int64 shape tensor as engine input

* add inference_predictor ut
```
14a92c8c
L

Modify full kernel for xpu. test=kunlun (#50209) · 18e0e01d
由 Leo Guo 提交于 2月 09, 2023

18e0e01d
R
[kunlun] support async send/recv via group (#50329) · 350cd82a
由 Roc 提交于 2月 09, 2023
```
Co-authored-by: Nzhangxiaoci <zhangxiaoci@baidu.com>
```
350cd82a
X

consider grad_op exist in forward program. (#50321) · 3862f347
由 xiongkun 提交于 2月 09, 2023

3862f347
J
Adjust mkldnn_placement_pass to check library type and data type (#49899) · ebdf3ef9
由 joanna.wozna.intel 提交于 2月 09, 2023
```
* Adjust mkldnn_placement_pass to check library type and data type

* Check if var has inputs

* Remove unrelated test

* Refactor
```
ebdf3ef9

remove paddle.fluid.dygraph.parallel.ParallelEnv (#50157) · 9dd1f4bf

由 zqw_1997 提交于 2月 09, 2023

* remove dygraph.parallel.ParallelEnv

* logger.py error: AttributeError: module 'paddle' has no attribute 'distributed'

* move the implenmentation to the root folder

* logger.py import ParallelEnv from paddle.parallel to avoid circular import

* add the comment of why import ParallelEnv from paddle.parallel in logger.py and remove the api interface in the paddle/parallel.py

* outdated Env and note removed

* decouple the logger.py and ParallelEnv

* remove another ref of parallel in init.py

9dd1f4bf

[PHI decoupling] move strided_memcpy.h to phi (#50346) · 17318c1a

由 Huang Jiyi 提交于 2月 09, 2023

* decouple strided_memcpy

* move strided_memcpy

* move strided_memcpy to phi

* fix namespace

* update

* fix gpu compile bugs

17318c1a

H

remove layout_utils in phi (#50355) · 90650534
由 Huang Jiyi 提交于 2月 09, 2023

90650534

Add MultiTenosrAdam OP (#49220) · 10654c77

由 yuehuayingxueluo 提交于 2月 09, 2023

* add multi_tenosr_adam

* update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py

* fix adam.py optimizer.py

* fix adamw.py

* fix test_multi_tensor_adam.py

* fix CI bug

* fix CI coverage

* fix ci bug

* fix betapow

* fix some bugs

* fix test_adamw_op.py

* fix CI coverage

* fix multi_tensor_adam_kernel.cc

* fix CI bug

* fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py

* fix code style

* update C++ parts

* remove python parts modification temporarily

* add C++ ut

* update betapow copy code logic

* fix ci ut

* fix windows ci

* fix coverage ci

* improve coverage rate

---------
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

10654c77

Y
[audio] fix doc typo (#50343) · d676d552
由 YangZhou 提交于 2月 09, 2023
```
* fix typo

* add sox_io in audio test

* fix

* fix
```
d676d552
L

fix gc bug and start interceptor (#50344) · 5d5cb256
由 LiYuRio 提交于 2月 09, 2023

5d5cb256

Fix bugs in pass_base.py (#50136) · 5cae5fdd

由 yuehuayingxueluo 提交于 2月 09, 2023

* fix the processing order of passes in pass_base.py

* fix processing order

* add _PASS_PROCESS_ORDER_LIST

* delete some pass in _PASS_PROCESS_ORDER_LIST

* add assert in pass_base.py

* remove fuse_optimizer

* add _fusion_opt_list_rule

* add test_pass_base_list.py

* fix some bug

* add fused_attention

* add some passes to list

* fix ci bug

* fix ci bug

5cae5fdd

W

[rm fluid] for the non distribution (#50313) · 7edfac9e
由 wangzhen38 提交于 2月 09, 2023

7edfac9e

[Paddle-TRT] GroupNorm int8 nchw32 fake kernel (#50146) · d93c63a0

由 zhoutianzi666 提交于 2月 09, 2023

* add fmha_flashattention oss plugin

* add fmhca

* add oss fmhca

* code reconstruct and add ut

* code style refine

* fix ut and enforce check

* refine trt version check

refine compile

fix compile

* fix cross ut

* code refine

* use runtime trt version check

* bug fix and code refine

* compile fix

* merge develop

* add GN QDQ kernel

* support GN int8 fake kernel

* add with_int8

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8 fake kernel

* add GN int8  UT

* add verison > 8000  in GN int8  UT

* add some check in .cu

* add stdlib.h in UT

* little change  in .cu

* remove rand_r use rand

* remove use rand

* setAxis(1)

* when int8 is on allow fall back to fp16

---------
Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>

d93c63a0

W

clean communicator (#50339) · d9b70950
由 wangxiaoning 提交于 2月 09, 2023

d9b70950
K
[BugFix][ConditionalBlock] fix judgement about scope validation (#50086) · 61f9f136
由 kangguangli 提交于 2月 09, 2023
```
* fix judgement about scope validation

* fix ci bug: same address is not enough for data consistency

* remove useless check
```
61f9f136
P

Fix pscore test (#50349) · fe811625
由 pangengzheng 提交于 2月 09, 2023

fe811625
J

fix bn composite error shape (#50338) · e389f2fc
由 Jiabin Yang 提交于 2月 09, 2023

e389f2fc

[IR] Type system stage1: add class TypeId, class AbstractType, class TypeStorage (#50242) · f11c913e

由 zhangbo9674 提交于 2月 09, 2023

* add TypeID

* Specification comment code

* refine code

* add AbstractType

* add TypeStorage

* fix unittest bug

* change dir

* change dir

* refine code

* fix bug

* Refine code by comment

* delete unused code

* normative naming rules

* refine code by comment

* refine doc

* refine codestyle

f11c913e

W

fixoptminizer _set_auxiliary_var bug (#50335) · c44005f0
由 wanghuancoder 提交于 2月 09, 2023

c44005f0
Z

add logical_and, logical_or and logical_xor for xpu (#50228) · 0036316e
由 zhangyikun02 提交于 2月 09, 2023

0036316e
W
[TRT] Transpose layernorm fusion with different input format (#50082) · b2bb7ec9
由 Wang Bojun 提交于 2月 09, 2023
```
* trans_layernorm
```
b2bb7ec9
傅

fix set_value_65965 (#50340) · b3f60f39
由傅剑寒提交于 2月 09, 2023

b3f60f39

08 2月, 2023 10 次提交
- P
  fuse quantize+transpose and transpose+dequantize (#49509) · 197a4ffe
  由 Paulina Gacek 提交于 2月 08, 2023
```
* QuantTranpose pattern is being found by pass

* quant + transpose fuse

* code style changes

* UT written, reorder fixed

* Dequantize + transpose2 fuse  added

* pass name changed

* UT added & shift corrected

* got rid of redundancy

* review changes

* AsIntermediate corrected

* compat added
```
  197a4ffe
- S
  Add bf16 support for fused matmul (#50254) · b47923b4
  由 Sławomir Siwek 提交于 2月 08, 2023
```
* add support for bf16 fused_ops

* fused_matmul only
```
  b47923b4
- W
  [code style]fix cpplint codestyle (#50314) · 209d534d
  由 wangxiaoning 提交于 2月 08, 2023
```
* fix codestyle

* fix std
```
  209d534d
- Z
  [inference][trt] Disable ShapeTensor for nearest_interp_v2 when trt version < 8.2 (#50258) · fa284076
  由 Zhang Jun 提交于 2月 08, 2023
```
* update

* update

* format code

* update

* Update test_trt_convert_nearest_interp_v2.py
```
  fa284076
- Y
  
  Fused attention pass mp support (#50320) · e44ff495
  由 Yuang Liu 提交于 2月 08, 2023
  
  e44ff495
- Z
  [pglbox]hidden unzip (#50292) · a7539508
  由 zmxdream 提交于 2月 08, 2023
```
* hidden unzip

* fix

* fix
```
  a7539508
- W
  
  Export custom operator-related function symbols (#50238) · f9c801ff
  由 weishengying 提交于 2月 08, 2023
  
  f9c801ff
- H
  
  Use inference, save construct time (#50163) · 7a82b6de
  由 HongyuJia 提交于 2月 08, 2023
  
  7a82b6de
- T
  
  test=document_fix (#50323) · cd6ebca6
  由 tianshuo78520a 提交于 2月 08, 2023
  
  cd6ebca6
- T
  
  test=document_fix (#50322) · be2a1d94
  由 tianshuo78520a 提交于 2月 08, 2023
  
  be2a1d94

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功