提交 · 0e9a48c7a65b6255d79fda3f69faa09327694622 · PaddlePaddle / Paddle

16 3月, 2023 3 次提交

由 Vegetable dog 提交于 3月 16, 2023

* update rnn.py

* update common.py

* update rnn.py

* update common.py

* fix CI

0e9a48c7

add index select op (#51498) · 543da561

由 xjmxyt 提交于 3月 16, 2023

* add index select op

* add to op teller

* add trt version control

* delete useless code

543da561

【Prim】Fix dropout CINN amp error (#51688) · 94cd1ba2

由 Jiabin Yang 提交于 3月 16, 2023

* support amp logic for layer_norm and softmax

* fix layer_norm amp

* fix layernorm api and dropout fp16

* fix layernorm api and dropout fp16

* fix bn, ln dtype in float16

* fix dropout fp16

* fix comment

* fix cinn dropout amp error

94cd1ba2

15 3月, 2023 22 次提交

add assign composite backward op (#51430) · 297182f7

由 SylarTiaNII 提交于 3月 15, 2023

* add assign composite backward op

* fix log msg

* code style

* fix comp rule

* replace assign with by_pass

297182f7

【Prim】Custom softmax grad (#51474) · f124c86f

由 Jiabin Yang 提交于 3月 15, 2023

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* Pr 50885 (#7)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix code in a dy2static-friendly way.

* [dystatic] add hooker for prim

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix cast prim and vjp dtype mapping error bug

* Cxx prim custom vjp (#8)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* Pr 50885 (#7)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix code in a dy2static-friendly way.

* [dystatic] add hooker for prim

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix cast prim and vjp dtype mapping error bug

* [dy2static-ci] fix dy2static ci errors.

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [Prim] enable whitelist and blacklist for custom_vjp

* support softmax grad

* remove additional code

* add test back

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>
Co-authored-by: Nxiongkun <807377414@qq.com>

f124c86f

Add option for setup.py (#51443) · 0b1a8a83

由 risemeup1 提交于 3月 15, 2023

* add option for setup.py

* add option for setup.py

* add option for setup.py

* add option for setup.py

* add ennv_dict.py and dist/ to .gitignore

* add ennv_dict.py and dist/ to .gitignore

* modify .gitignore

0b1a8a83

optimizing setup.py develop command (#51528) · e4e3675f

由 risemeup1 提交于 3月 15, 2023

* optimizing setup.py develop command

* add libpaddle.so

* modify setup.py

* add python/paddle/distributed/fleet/.gitignore

* add libpaddle.so to .gitignore

* add *.so to python/paddle/libs/.gitignore

* add new gitignore

e4e3675f

L

increase timeout limit of test_resnet_prim_cinn, test=document_fix (#51696) · be9515f2
由 Leo Chen 提交于 3月 15, 2023

be9515f2
L
support set_default_dtype bf16 (#51650) · 410f1629
由 Leo Chen 提交于 3月 15, 2023
```
* support set_default_dtype bf16

* support float
```
410f1629

feat: add rsqrt composite rule (#51432) · c9ca7c35

由 Kang Zhao 提交于 3月 15, 2023

* feat: add relu composite rule

* feat: add relu composite rule, maximum op

* feat: add relu composite rule, maximum op

* feat: add relu composite rule, polish comments

* feat: add relu composite rule, polish comments

* feat: add relu composite rule, add python api of relu

* feat: add relu composite rule, commit hook

* fix: maximum type error & ban cinn test

* fix: maximum input sequence bugs

* resolve conflicts

* fix: code style bugs

* add: relu fp16 test

* feat: add rsqrt composite rule

* feat: add rsqrt composite rule

* resolve conflicts of composite rule

* fix: delete check eager

c9ca7c35

K
remove unit tests about GraphExecutionOptimizer (#51575) · 09ae2852
由 kangguangli 提交于 3月 15, 2023
```
* remove unit tests about GraphExecutionOptimizer

* remove test file
```
09ae2852
K
remove parallel_executor related unit tests (#51632) · 892f94bc
由 kangguangli 提交于 3月 15, 2023
```
* remove parallel_executor related unit tests

* fix CI
```
892f94bc
W
support to_static for SpectralNorm (#51622) · 4283e19e
由 wangna11BD 提交于 3月 15, 2023
```
* support to_static for SpectralNorm
```
4283e19e
W

[AMP OP&Test]fix index_select bf16 test (#51652) · e5616448
由 wangxiaoning 提交于 3月 15, 2023

e5616448

【Prim】Support amp logic for layer_norm and softmax (#51473) · 64076727

由 Jiabin Yang 提交于 3月 15, 2023

* support amp logic for layer_norm and softmax

* fix layer_norm amp

* fix layernorm api and dropout fp16

* fix layernorm api and dropout fp16

* fix bn, ln dtype in float16

* fix dropout fp16

* fix comment

64076727

K
[with_data_parallel][part13] remove with_data_parallel in example code (#51588) · 14f1973d
由 kangguangli 提交于 3月 15, 2023
```
* remove with_data_parallel in example code

* revert python/paddle/fluid/data_feeder.py

* fix static.nn.fc api
```
14f1973d
W
support gather test on prim and cinn (#51376) · 5b3c7ee7
由 Weilong Wu 提交于 3月 15, 2023
```
* support gather test on prim and cinn

* reset timeout for gather
```
5b3c7ee7
C
[Prim] add pow composite rule (#51070) · 2d9e103e
由 chenjian 提交于 3月 15, 2023
```
* add pow composite rule

* fix test

* fix unit test

* update test

* fix test

* update
```
2d9e103e
W

fix python syntax issue (#51658) · 9045b882
由 Weilong Wu 提交于 3月 15, 2023

9045b882
Y

[AMP OP&Test] Support bf16/fp16 for roll op and add ut. (#51565) · 1fbf423a
由 Yuang Liu 提交于 3月 15, 2023

1fbf423a
G

fix quantization int8 weight save bug (#51500) · 8fc9a19f
由 Guanghua Yu 提交于 3月 15, 2023

8fc9a19f

【AMP OP&Test】Add fp16 test for divide, matmul, pnorm (#51005) · c2b24166

由 Siming Dai 提交于 3月 15, 2023

* add fp16 test for divide, matmul, pnorm

* add cumsum fp16 unittest

* fix threshold

* revert cumsum

* fix code-style

* fix according to review

* fix kernel not found

c2b24166

G

add inplace sigmoid_ and multiply_ (#50267) · b3caa233
由 Guoxia Wang 提交于 3月 15, 2023

b3caa233
Z
Delete hardswish_raw op (#51634) · 3e636ec9
由 zhangyuqin1998 提交于 3月 15, 2023
```
* Delete hardswish_raw op

* fix ut
```
3e636ec9
W
refine amp scaler (#51340) · 1e232e27
由 wanghuancoder 提交于 3月 15, 2023
```
* refine _found_inf
```
1e232e27

14 3月, 2023 15 次提交
- V
  
  Adjust tolerance without modify grad (#51459) · 145a6cbb
  由 Vvsmile 提交于 3月 14, 2023
  
  145a6cbb
- P
  
  delete numpy version (#49556) · 117df481
  由 pangyoki 提交于 3月 14, 2023
  
  117df481
- [Zero-Dim] correct some code to adapt to 0D Tensor (#51562) · 6737226f
  由 zhouweiwei2014 提交于 3月 14, 2023
  
  6737226f
- C
  add split and split_with_num composite rule (#51341) · bb9eb20f
  由 ccrrong 提交于 3月 14, 2023
```
* add split_with_num composite rule

* add split_with_num composite rule

* add split composite rule

* update

* update test

* update test

* delete split_with_num_grad
```
  bb9eb20f
- Q
  
  implement expand as using tile (#51577) · 300b687a
  由 qizhaoaoe 提交于 3月 14, 2023
  
  300b687a
- L
  Optimization for layerNormGrad [Part1] (#51282) · 7a3d05d9
  由 limingshu 提交于 3月 14, 2023
```
* first commit

* fix code bugs in for_loop

* fix bugs in cuLoadAddStridedInputs.

* optimization for LayerNormBackwardComputeGradInput

* add unitest for validating the optimization

* fix windows ci error
```
  7a3d05d9
- G
  
  [Divide by 0 Error] add DataNormKernel check (#51583) · e4ba5f86
  由 gouzil 提交于 3月 14, 2023
  
  e4ba5f86
- P
  cuda graph support multi-stream for new executor (#51389) · 579fb5fd
  由 pangyoki 提交于 3月 14, 2023
```
* cuda graph support multi-stream for new executor

* fix windows compile error

* delete create_cuda_graph_stream
```
  579fb5fd
- Z
  
  fix cmakelist (#51546) · 26007b1d
  由 zhaoyingli 提交于 3月 14, 2023
  
  26007b1d
- Y
  [AMP OP&Test] Append bf16/fp16 support 4 elementwise_max (#51151) · 143eceeb
  由 YuhangLi 提交于 3月 14, 2023
```
* wisemax fp16 support

* add bf16 support 4 elementwise_max

* append broadcast 4 op 4 fp16 / bf16

* fix elewise_max ut bf16 numeric delta

* append fp/bf16 uts

* add fp/bf16 uts

* change bf16 uts delta

* fix some issue

* add prim 4 fp16
```
  143eceeb
- W
  
  fix rank=1 (#51413) · b4f49aa1
  由 wangxiaoning 提交于 3月 14, 2023
  
  b4f49aa1
- W
  
  fix test_layernorm_shift_partition_pass time out (#51612) · b642461d
  由 wenbin 提交于 3月 14, 2023
  
  b642461d
- X
  
  [dy2static] fix the speed problem introduced by #50883 (#51606) · 46d6080d
  由 xiongkun 提交于 3月 14, 2023
  
  46d6080d
- W
  [TRT] Fix conv2d filter of trt elementwiseadd_trans fusion UT (#51294) · dca81a43
  由 Wang Bojun 提交于 3月 14, 2023
```
* fix conv2d filter
```
  dca81a43
- X
  【prim】test composite rules with -1 shape (#51435) · 82a7c33e
  由 xiaoguoguo626807 提交于 3月 14, 2023
```
* init

* modify
```
  82a7c33e

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功