提交 · d03ef0541959391e7414d6d8780f6248383fef18 · PaddlePaddle / Paddle

22 8月, 2022 6 次提交
- S
  Extend conv_concat_relu to support all activations (#45089) · d03ef054
  由 Sławomir Siwek 提交于 8月 22, 2022
```
* merge conv_concat_relu to conv_act

* fix typo

* extend unit test

* reuse existing gpd

* codestyle

* enforce mkldnn conv
```
  d03ef054
- Z
  
  [Paddle-TRT] support output_padding in conv2d_transpose and conv3d_transpose (#45004) · 25d25b00
  由 zhoutianzi666 提交于 8月 22, 2022
  
  25d25b00
- W
  [Eager] some python c api use final state (#45221) · d2ef888b
  由 wanghuancoder 提交于 8月 22, 2022
```
some python c api use final state
```
  d2ef888b
- Y
  
  remove trt_skip_layernorm_fuse_pass from gpu passes (#45293) · 25d58db6
  由 Yuanle Liu 提交于 8月 22, 2022
  
  25d58db6
- H
  [jit] add jit layer function default constructor (#45169) · e3574f72
  由 Hui Zhang 提交于 8月 22, 2022
```
* fix jit layer function

* fix comment

* fix comment
```
  e3574f72
- R
  
  [CustomDevice] fix custom ccl (#45276) · 307ad60d
  由 ronnywang 提交于 8月 22, 2022
  
  307ad60d
20 8月, 2022 2 次提交
- W
  [Eager] pylayer detach output tensor if it is equal with input (#45065) · bba13e21
  由 wanghuancoder 提交于 8月 20, 2022
```
* pylayer detach output tensor if it is equal with input

* pylayer detach output tensor if it is equal with input
```
  bba13e21
- S
  【autograd】add max_p primitive operator for new autograd (#45178) · 197f4048
  由 Sing_chan 提交于 8月 20, 2022
```
* add max_p without test

* add test of max_p

* make max_p consistent with paddle.maximum
```
  197f4048
19 8月, 2022 9 次提交

W
fix layernormTrt meanVar alloc bug (#45255) · 6fb34e74
由 Wang Bojun 提交于 8月 19, 2022
```
* fix layernormTrt meanVar alloc bug
```
6fb34e74
H

polish REGISTER_OPERATOR parameter of fill_any (#45263) · 1c4134f6
由 HongyuJia 提交于 8月 19, 2022

1c4134f6
R
Fix random op dependency and lr_shedule bugs for standalone executor (#45265) · 6d4ae007
由 Ruibiao Chen 提交于 8月 19, 2022
```
* Fix random op depenency and lr_shedule bugs for standalone executor

* Fix CI errors

* Fix CI errors

* Fix CI errors
```
6d4ae007
W
Trt groupnorm dynamic plugin (#44911) · 1aa6adb1
由 Wang Bojun 提交于 8月 19, 2022
```
* add group_norm dyanmic plugin
```
1aa6adb1
H

[XPU] c_allreduce support int. update bkcl to 1.0.5. test=kunlun (#45248) · 9f1f1b0a
由 houj04 提交于 8月 19, 2022

9f1f1b0a

Make up beam_search_decode operator test cases on xpu and cpu environment (#45264) · 3deab77f

由 mengqingchun02 提交于 8月 19, 2022

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* make up beam_search_decode operator test cases on xpu and cpu environment. test=kunlun

3deab77f

[XPU] add merged_momentum unittest and change momentum (#45241) · e0f1c9f2

由 dongfangshenzhu 提交于 8月 19, 2022

* add merged_momentum *test=kunlun

* add merged_momentum *test=kunlun

* add fp16 to merged_momentum,*test=kunlun

* change dist_model.cc

* add merged_momentum unittest and  change momentum,test=kunlun

* add merged_momentum unittest and  change momentum,test=kunlun

* add merged_momentum unittest and  change momentum,test=kunlun

* add merged_momentum unittest and  change momentum,test=kunlun

e0f1c9f2

fix some auto code generation bugs (#45232) · 9556c688

由 Charles-hit 提交于 8月 19, 2022

* 修复生成动态图代码时，如果输出没有配置名字，会导致下标越界的问题。

* decide forward_return[0] is not none

* 修改反向yaml前向输出只有一个时，未配置名字，那么输出自动生成为out

* modify code style

9556c688

Support beam search decode op in XPU environment (#44917) · adaffb7b

由 mengqingchun02 提交于 8月 19, 2022

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* fix beam_search operator bugs on xpu. test=kunlun

* fix beam_search operator bugs on xpu. test=kunlun

* fix beam_search operator bugs on xpu. test=kunlun

* fix beam_search operator bugs on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

* support beam_search_decode operator on xpu. test=kunlun

adaffb7b

18 8月, 2022 7 次提交

[inference]predictor add GetInputType interface (#45143) · a8ae87f1

由 heliqi 提交于 8月 18, 2022

* predictor add GetInputType interface

* predictor change GetInputType to GetInputTypes

* predictor add tester

* predictor add tester

* predictor change GetInputType to GetInputTypes

* predictor change GetInputType to GetInputTypes

* predictor add tester

a8ae87f1

[Eager] Add get_tensor_from_selected_rows (#45227) · d257acc6

由 Weilong Wu 提交于 8月 18, 2022

* [Eager] add get_tensor_from_selected_rows

* add PADDLE_ENFORCE to check SelectedRows

* use _ prefix in temp

d257acc6

O

fix typo of pybind.cc (#45239) · 41294cb5
由 OccupyMars2025 提交于 8月 18, 2022

41294cb5

apply buffer_shared_inplace_pass and inplace_addto_op_pass pass to program in... · d8d124b6

由 pangyoki 提交于 8月 18, 2022

apply buffer_shared_inplace_pass and inplace_addto_op_pass pass to program in Standalone Executor (#45085)

* apply inplace addto in python apply_pass

* fix

* apply inplace pass for program

* skip feed and fetch var

* fix block_desc.move_from

* fix block desc

* alltoall remove inplace

* fix

d8d124b6

A
[OpAttr]Squeeze axes support Tensor (#45189) · c93451f4
由 Aurelius84 提交于 8月 18, 2022
```
* [OpAttr]Squeeze axes support Tensor

* add support_tensor

* fix unittest

* fix coverage
```
c93451f4

change to async mode for xpu multi-card training in static graph mode, test=kunlun (#45024) · 41bdf41d

由 zhangxiaoci 提交于 8月 18, 2022

* change to async mode for xpu multi-card training in static graph mode

* minor bugfix

* irrelevant. move to another pr

* move change to other pr

* fix stream issue

* fix 'stream not meet with current context' error

* fix branch diverge, test=kunlun

41bdf41d

fix infer tans scope (#45203) · 2d0bb2c3

由 JingZhuangzhuang 提交于 8月 18, 2022

* fix infer tans scop

* fix infer trans scope

* fic infer trans scope

* fic infer trans scope
Co-authored-by: Ndingjiawei <327396238@qq.com>

2d0bb2c3

17 8月, 2022 10 次提交
- Z
  
  refine eager_gen for amp (#45211) · e31a0a50
  由 zyfncg 提交于 8月 17, 2022
  
  e31a0a50
- S
  
  add dependency of phi_enforce (#45191) · aa96f70e
  由 Sing_chan 提交于 8月 17, 2022
  
  aa96f70e
- A
  [OpAttr]Add SupportTensor for OpMaker with whitelist mechanism (#45084) · 2594935a
  由 Aurelius84 提交于 8月 17, 2022
```
* [OpAttr]Add SupportTensor for OpMaker

* fix typo

* fix code style

* add SupportTensor for concat op

* add unittest for register Tensor

* add shape checker and split attribute
```
  2594935a
- W
  fix multi stream error. (#45196) · a79d4a75
  由 Wilber 提交于 8月 17, 2022
```
* fix multi stream error.
```
  a79d4a75
- L
  Reuse addKernel to replace TensorAdd (#45161) · 0e3b49d4
  由 Leo Chen 提交于 8月 17, 2022
```
* use addKernel

* fix compile

* remove elementwiseAddto

* add return

* fix custom place
```
  0e3b49d4
- F
  
  fix:op version (#45192) · d0cd0a11
  由 feng_shuai 提交于 8月 17, 2022
  
  d0cd0a11
- W
  [Eager]fix_stop_gradient (#45154) · cccba68c
  由 wanghuancoder 提交于 8月 17, 2022
```
* fix_stop_gradient
```
  cccba68c
- F
  
  [MLU] fix copy error (#45194) · 75690584
  由 fwenguang 提交于 8月 17, 2022
  
  75690584
- Y
  add instance norm op for xpu (#45097) · 216d25ac
  由 ykkk2333 提交于 8月 17, 2022
```
* xpu unittest grad compute supports more types, *test=kunlun

* add instance norm xpu, *test=kunlun
```
  216d25ac
- S
  Fix squared_l2_norm wrong stream bug (#45174) · 951010a2
  由 sneaxiy 提交于 8月 17, 2022
```
* fix squared_l2_norm bug

* update buffer.h
```
  951010a2
16 8月, 2022 6 次提交

[Phi] Move amp ops into phi (#45079) · b4f67757

由 Chen Weihang 提交于 8月 16, 2022

* move check finite and unscale kernel into phi

* move infershape into phi

* move update_loss_scaling kernel into phi

* remove original kernels

* move update loss scaling infershape into phi

* add header for xpu and npu

* solve coverage failed

* fix npu test failed

* remove mutable data in cu file

* fix new executor failed

* add valid check for meta tensor output

b4f67757

[Eager] Forword only add dygraph func (#45153) · 933db9d4

由 Weilong Wu 提交于 8月 16, 2022

* [Eager draft] forward_only interface migrate to autograd_api

* strings api add dygraph forward function

* rm useless comments

* draft version for check CI

* fix ci

* forward-only no need compute_require_grad and pass stop_gradient, rm useless comments

* polish yaml and using CPUPlace = phi::CPUPlace

* rm useless comments

* polish yaml and update some test case

* rm useless funcs

* polish eager_gen code

* polish code

933db9d4

convert multihead to oss (#45019) · f706d95d

由 feng_shuai 提交于 8月 16, 2022

* convert multihead to oss

* fix:bug

* fix:delete const cast

* fix:don't support bias_qk

* add vit pass

* fix:convert bug and add preln_residual_bias

* support length=-1

* add UT for convert

* add no_bias_qk support for gpu_multihead_op

* delete infer_shape depends on bias_qk

* oss just can be used in T4 and A*

* fix:change api for ROCM CI

f706d95d

A

support fp16 softmax on custom place (#45177) · a0bbfbd4
由 Aganlengzi 提交于 8月 16, 2022

a0bbfbd4
F
Fix problem that the shape of tensor is not inited correctly when backward in static graph (#45030) · e26f80ad
由 feifei-111 提交于 8月 16, 2022
```
* fix_shape

* code style

* fix assert

* fix to_tensor badreturn
```
e26f80ad
W

fix new quant (#45155) · 2fb65e44
由 Wangzheee 提交于 8月 16, 2022

2fb65e44

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功