提交 · 60ee518aa9232929916f3671110f79eb944f8967 · BaiXuePrincess / Paddle

17 1月, 2023 3 次提交

disable scatter zero_dim test (#49853) · 86fa1715
由 zhouweiwei2014 提交于 1月 17, 2023

86fa1715
W
[Dy2St]Support call backward() without params in dy2st (#49812) · 2f24b2d8
由 WangZhen 提交于 1月 17, 2023
```
* Support call backward() without params in dy2st
```
2f24b2d8

【Prim】Add multiply,expand,div vjp rules (#49831) · 39c6765a

由 Xiaoxu Chen 提交于 1月 17, 2023

* support elementwise base func

* fix compiling error and add test

* support vjp for div using comp

* remove additional change

* fix dy2st error with magic num

* fix dy magic num

* another magic

* another magic

* another magic

* add skip rename strategy

* support add vjp

* support add with new axis cal

* support sub vjp

* [prim] add multiply vjp rules

* [prim] add multiply vjp rules

* [prim] fix no infershape with composite in _append_backward_ops

* [prim] add expand vjp rule

* [prim] add exp vjp rule

* uncomment infer shape for reshape/sum static prim api

* [prim] fix tanh nullptr error

* remove some print message

* fix magic number in run_program relative tests @JiaBinYang

* [prim] add expand,multiply,exp vjp rules

* fix only support single direction reduce error

* infer reduce dims using out dims
Co-authored-by: NJiabinYang <360788950@qq.com>

39c6765a

16 1月, 2023 6 次提交
- W
  
  [PHI] channel_shuffle add yaml (#49808) · 56dbe426
  由 Weilong Wu 提交于 1月 16, 2023
  
  56dbe426
- W
  
  add add_n for the 0d tensor (#49854) · 65b0181e
  由 wawltor 提交于 1月 16, 2023
  
  65b0181e
- Y
  [Paddle-TRT] support nhwc (#49633) · e43f7102
  由 Yuanle Liu 提交于 1月 16, 2023
```
* add trt_support_nhwc_pass
```
  e43f7102
- Q
  
  add prod for kunlun (#49816) · bd03652f
  由 QingshuChen 提交于 1月 16, 2023
  
  bd03652f
- Z
  
  add sqrt_comp_grad composite rule (#49769) · 70378584
  由 zqw_1997 提交于 1月 16, 2023
  
  70378584
- X
  
  【prim】vjp for reduce sum (#49736) · 292f3f77
  由 xiaoguoguo626807 提交于 1月 16, 2023
  
  292f3f77
15 1月, 2023 1 次提交

【Prim】Enhance tests (#49814) · 090aa45d

由 Jiabin Yang 提交于 1月 15, 2023

* support elementwise base func

* fix compiling error and add test

* remove additional param

* support vjp for div using comp

* remove additional change

* fix dy2st error with magic num

* fix dy magic num

* another magic

* another magic

* add more test

* fix windows problem

* another magic

* fix windows compile

* invoke ci

* add skip rename strategy

* support add vjp

* fix test_tanh

* support add with new axis cal

* fix resnet and some test

* add composite log

* support sub vjp

* enhance_tests

* support more dtype for full

090aa45d

13 1月, 2023 16 次提交
- W
  
  [Phi] heaviside add yaml (#49807) · 4b7aeba4
  由 Weilong Wu 提交于 1月 13, 2023
  
  4b7aeba4
- C
  
  New feature: add register composite rule of ops (#49605) · 6ed8221a
  由 cyber-pioneer 提交于 1月 13, 2023
  
  6ed8221a
- W
  add oss flash fmha and fmhca support (#49438) · a48b8e2c
  由 Wang Bojun 提交于 1月 13, 2023
```
* add fmha_flashattention oss plugin
```
  a48b8e2c
- [Zero-Dim]simplify static unittest (#49805) · 650a0836
  由 zhouweiwei2014 提交于 1月 13, 2023
  
  650a0836
- R
  [Zero-Dim] add where, atan2, median 0-Dim ut (#49692) · 1508cae7
  由 ronnywang 提交于 1月 13, 2023
```
* add where, atan2, median 0d ut

* add where, atan2, median 0d ut

* update

* update

* update
```
  1508cae7
- Z
  [inference][trt]set output data type of trt network (#49712) · 690d7a69
  由 Zhang Jun 提交于 1月 13, 2023
```
* update trt engine to set in/out data type

* update

* Update engine.cc

* Update engine.cc

* update

* set engine output type before freeze the network

* update

* update trt autoscan ut

* update

* update ut

* fix equal bug, update ut

* fix cast and equal ut

* update cast ut using TRT < 8.4

* set datatype from scope

* check output var is nullptr

* Update op_converter.h

* update tensorrt_engine_op_test ut

* update
```
  690d7a69
- J
  【Prim】Support elementwise related VJP with primitives (#49784) · 561f9013
  由 Jiabin Yang 提交于 1月 13, 2023
```
* support elementwise base func

* fix compiling error and add test

* remove additional param

* support vjp for div using comp

* remove additional change

* fix dy2st error with magic num

* fix dy magic num

* another magic

* another magic

* add more test

* fix windows problem

* another magic

* fix windows compile

* invoke ci

* add skip rename strategy

* support add vjp

* fix test_tanh

* support add with new axis cal

* fix resnet and some test

* add composite log

* support sub vjp
```
  561f9013
- W
  
  fix a bug of stage2 offload. (#49767) · 1c8531ce
  由 wuhuachaocoding 提交于 1月 13, 2023
  
  1c8531ce
- J
  kunlun add support for c_concat and c_split (#49757) · a09b9a3f
  由 jameszhang 提交于 1月 13, 2023
```
* kunlun add support for c_concat and c_split

* replace mutable_data() and ShareDataWith()
```
  a09b9a3f
- Y
  
  add xpu adagrad and where_grad kernels (#49701) · a99c3cd4
  由 ykkk2333 提交于 1月 13, 2023
  
  a99c3cd4
- J
  fix xpu unittest issue (#49760) · ddc8a726
  由 jameszhang 提交于 1月 13, 2023
```
* fix xpu unittest issue: zero_dim_tensor

* deal with leftout issue introduced by #49470
```
  ddc8a726
- L
  
  Add unitest for set_value, set_value_grad. test=kunlun (#49773) · 5e722245
  由 Leo Guo 提交于 1月 13, 2023
  
  5e722245
- [Zero-Dim] add static graph gradient test method for 0D Tensor input (#49755) · 5fd115f3
  由 zhouweiwei2014 提交于 1月 13, 2023
  
  5fd115f3
- W
  
  add prelu & prelu_grad op for xpu (#49672) · 8d512b8f
  由 wangshengxiang 提交于 1月 13, 2023
  
  8d512b8f
- W
  [PHI] rrelu add yaml (#49779) · 8447f876
  由 Weilong Wu 提交于 1月 13, 2023
```
* [PHI] rrelu add yaml

* polish

* polish
```
  8447f876
- W
  
  update reader in sharding unit test. (#49652) · 163c6a9e
  由 wuhuachaocoding 提交于 1月 13, 2023
  
  163c6a9e
12 1月, 2023 7 次提交

lerp support 0 Tensor (#49667) · 8cd0d5b3

由 sunli 提交于 1月 12, 2023

* lerp support 0 Tensor

* fix lerp grad

* fix lerp zero test

* fix 0D + ND/ND + 0D

* fix check

* update code

* fix lerp infer shape

* static backward test

* updata static graph test

8cd0d5b3

Z

move fuild.contrib.mixed_precision to paddle.static.amp (#49412) · 69d01eb9
由 zhangkaihuo 提交于 1月 12, 2023

69d01eb9

Fix reduce func bug in process_group_bkcl (#49749) · 8e291bf7

由 jameszhang 提交于 1月 12, 2023

* Fix reduce func bug in process_group_bkcl

Also catch up with a recent process_group PR that failed to add XPU branch.
Note that reduce is still accomplished by allreduce for xpu. Fix this should
xccl lib be updated.

* fix compile issue for non-XPU

8e291bf7

W
more preln_gn patterns (#49728) · adcb0039
由 wenbin 提交于 1月 12, 2023
```
* compile fix

* fix compile

* compile fix

* add more preln
```
adcb0039

[Zero-Dim] support input 0D Tensor for fmax/fmin/complex api (#49730) · a015f815

由 FlyingQianMM 提交于 1月 12, 2023

* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss

* [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor

* [Zero-Dim] support input 0D Tensor for fmax,fmin,complex

a015f815

Y

change test class's name (#49729) · 280677c5
由 yuehuayingxueluo 提交于 1月 12, 2023

280677c5
Z
[AutoParallel] recovery annotation (#49665) · 5c9c1a39
由 zhaoyingli 提交于 1月 12, 2023
```
* recovery annotation

* bugfix
```
5c9c1a39

11 1月, 2023 6 次提交

N

Update the style of print for low precision op list (#49648) · 395520f1
由 niuliling123 提交于 1月 11, 2023

395520f1

add FusedLinear pass (#49606) · 0f08a432

由 yuehuayingxueluo 提交于 1月 11, 2023

* add FusedLinear pass

* add fused_op_list and renname PASSES to OP_FUSION

* add fused_passes_list to constants.py

* add test_passes.py

* fix test_fused_passes.py

* fix add if float(paddle.version.cuda()) >= 11.6:

* renamed test_fused_passes.py

* fix CMakeList.txt

0f08a432

[Dy2St] 移除 ProgramTranslator (#49628) · 2bb28f31

由 Ryan 提交于 1月 11, 2023

* add enable_to_static and drop some methods of ProgramTranslator

* fix code style

* fix cant import enable_to_static and update unitest

* change unitest and rollback code of PT

* fix can't import as of utils

* roll back PT

* fix roll back

* add some unitest

* add unitest and fix codestyle bug in api.py

* finish all unitest

* remove ProgramTranslator

* fix code style

* restore test_program_translator

* api.py remove get_func

* TestDygraphToStaticCode

* fix check_type and import err

* roll back PT without getcode

* roll back pt with get_code

* convert_to_static

* fix import __all__

2bb28f31

L

fix hsigmoid_loss (#49549) · 8f0adcb5
由 Linjie Chen 提交于 1月 11, 2023

8f0adcb5
L
Add input check for NLLLoss (#49547) · 08bf1b49
由 Linjie Chen 提交于 1月 11, 2023
```
* fix nll_loss

* fix nll_loss

* update

* update

* update

* fix
```
08bf1b49

姜

rm retain_grad_flag for tests part0 (#49655) · a504508c

由姜永久提交于 1月 11, 2023

* rm retain_grad_flag for tests

* modify transpose op

* retain grads for xpu tests

* lint

* modify xpu test

a504508c

10 1月, 2023 1 次提交

Use `CommContextManager` to init comm op using gloo backend (#49666) · 05df6973

由 Wen Sun 提交于 1月 10, 2023

* refactor: gloo comm context migration

* fix: headers & avoid mutable_data usage

* fix: cmake gloo dep

* style: rename funcs

* refactor: move to new files

* fix: gloo deps

* refactor: simplify create device

05df6973

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致