提交 · cabf3921e36bea2e9167624a298287c80f143e2c · PaddlePaddle / Paddle

29 3月, 2023 2 次提交

Add group_norm composite rule (#51874) · cabf3921

由 Yichen Zhang 提交于 3月 29, 2023

* add group_norm composite rule

* add test for scale_grad and bias_grad

* resolve conflicts

* remove amp in composite_rule.py

* add float16 test

* deal with NHWC format

* keep the composite rule in float16 identical as original kernel

* resolve conflicts

cabf3921

S
Fix generate_kernels.py in CUDA 12.0 (#52232) · f2c96bc2
由 sneaxiy 提交于 3月 29, 2023
```
* fix generate_kernels.py in CUDA 12.0

* fix attrs bug
```
f2c96bc2

28 3月, 2023 5 次提交
- K
  [Executor] remove api `paddle.static.ParallelExecutor` (#51701) · e9c3da9e
  由 kangguangli 提交于 3月 28, 2023
```
* remove api `class ParallelExecutor`

* remove other references
```
  e9c3da9e
- N
  
  fix a typo, `sheduler` -> `scheduler` (#52149) · e492ee24
  由 Nyakku Shigure 提交于 3月 28, 2023
  
  e492ee24
- K
  
  [CodeStyle][PLR1701] unify multiple isinstance expressions as one (#52150) · c1838da6
  由 Kim 提交于 3月 28, 2023
  
  c1838da6
- 张
  
  [CodeStyle][PLC0414] remove self-alias and some discussion (#52122) · 888b8b6b
  由张春乔提交于 3月 28, 2023
  
  888b8b6b
- J
  【Prim】Optimize composite rule by making scalar shape as 1 (#51960) · 45acb717
  由 Jiabin Yang 提交于 3月 28, 2023
```
* optimize composite rule by making scalar shape as []1

* fix shape usage for 0D

* fix rules

* fix 0D error

* fix flatten 0D error

* fix bn eval mode

* fix bn test

* fix flatten
```
  45acb717
27 3月, 2023 1 次提交
- C
  
  fix eval branch of composite rule of batch_norm (#52154) · 20befdef
  由 cyber-pioneer 提交于 3月 27, 2023
  
  20befdef
25 3月, 2023 1 次提交
- 张
  
  [CodeStyle][PLR0402] import a.b to from a import b (#52125) · 8c17fc0b
  由张春乔提交于 3月 25, 2023
  
  8c17fc0b
24 3月, 2023 1 次提交

Memory Efficient Attention (#51867) · e5ad3859

由 ZhangDY-6483 提交于 3月 24, 2023

* first version, notest

* return final rst, notest

* use infinity() instead of max

* ut structure

* start up of ut

* generate lse

* update

* add depense

* reconstruct cmake

* move file

* add memory efficient attention and fix blasimpl

* update

* update cmake

* add namespace

* update cmake

* use .cu

* update for pad3d

* bug fix

* bug fix

* update

* bug fix

* update enforce

* add test case

* merge the lse pad

* fix kernel_fn of backward

* fix PADDLE_ENFORCE_EQ and phi_api

* fix PADDLE_ENFORCE

* fix PADDLE_ENFORCE

* rerun coverage

* fix memory efficient attention test

* rerun ci

* add cuda version condition

* add cuda version condition

* delete WIP test

* replace PADDLE_ENFORCE

* edit the namespace of datatype in multiple.cc

* rerun

* rerun

---------
Co-authored-by: Nliuyuang <liuyuang@baidu.com>

e5ad3859

23 3月, 2023 3 次提交

[Prim] add meshgrid composite rule (#51061) · 53bb883d

由 chenjian 提交于 3月 23, 2023

* add meshgrid composite rule

* add meshgrid composite rule

* update

* add into CMakeLists

* fix

* update

* update

* optimize code

* fix meshgrid op

* update test

53bb883d

I

[CodeStyle][C403] Unnecessary list comprehension (rewrite as a set comprehension) (#51968) · ca7394cd
由 Infinity_lee 提交于 3月 23, 2023

ca7394cd

[CodeStyle][C408][C409][C410] Fix unnecessary <dict/list/tuple> call and... · cf391b81

由 PuQing 提交于 3月 23, 2023

[CodeStyle][C408][C409][C410] Fix unnecessary <dict/list/tuple> call and unnecessary <list/tuple> passed to <list/tupule>() (#51928)

* autofix

* add select config

* autofix C410

* add C410 select

cf391b81

22 3月, 2023 3 次提交
- S
  
  add fused dropout add (#51752) · 6ba0507d
  由 ShenLiang 提交于 3月 22, 2023
  
  6ba0507d
- [CodeStyle][UP018] Unnecessary call to `str` (#51922) · 52a31b87
  由 iSerendipity 提交于 3月 22, 2023
  
  52a31b87
- A
  [CodeStyple][B011] replace assert false with raise AssertionError (#51935) · 2922aa67
  由 Ainavo 提交于 3月 22, 2023
```
* replace assert false with AssertionError

* 修改配置文件多余的部分
```
  2922aa67
21 3月, 2023 3 次提交
- A
  
  remove unnecessary generator set and dict (#51845) · cdc5896f
  由 Ainavo 提交于 3月 21, 2023
  
  cdc5896f
- C
  [prim] simplify batch_norm composite rule (#51827) · f47a5f7f
  由 cyber-pioneer 提交于 3月 21, 2023
```
* simplify batch_norm composite rule

* polish code
```
  f47a5f7f
- [Zero-Dim] Support 0D for... · c74aaf67
  由 zhouweiwei2014 提交于 3月 21, 2023
```
[Zero-Dim] Support 0D for numel/rank/size/optimizer/create_parameter/create_global_var, fix some usage to adapt 0D (#51566)
```
  c74aaf67
20 3月, 2023 7 次提交

A
[CodeStyle][UP008] remove super call with parameters (#51812) · 81f3f6b5
由 Ainavo 提交于 3月 20, 2023
```
* remove super call with parameters

* fix bug
```
81f3f6b5

【prim】New layer_norm grad (#51750) · 802a81d0

由 xiaoguoguo626807 提交于 3月 20, 2023

* Add flatten composite rule

* get the right xshape and pass func test

* add cinn unit test

* Remove cinn test, wait for it to be added after repair

* add comp test to test_flatten_contiguous_range_op.py

* remove func test on composite_ops

* Add comments to maybe_wrap_dim func

* remove commented code

* fix the problem with 0D tensor case

* add flatten split rule comment

* fix syntax issues

* block flatten on resnet_prim_cinn

* init change

* tmp commit

* add layer_norm InferMeta check

* cast type modify

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* Pr 50885 (#7)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix code in a dy2static-friendly way.

* [dystatic] add hooker for prim

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix cast prim and vjp dtype mapping error bug

* recover

* big tol

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* Pr 50885 (#7)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix code in a dy2static-friendly way.

* [dystatic] add hooker for prim

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix cast prim and vjp dtype mapping error bug

* Cxx prim custom vjp (#8)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* Pr 50885 (#7)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix code in a dy2static-friendly way.

* [dystatic] add hooker for prim

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix cast prim and vjp dtype mapping error bug

* [dy2static-ci] fix dy2static ci errors.

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [Prim] enable whitelist and blacklist for custom_vjp

* debug log

* clear log

* fix

* nothing

* less memory

* recover utils

* fix

* modify threshold value

* skip layer_norm for test_bert

* back to bert success state

* add epsion

* delete unnecessary compute

* modify amp dtype

* modify * order

* delete sqrt check and fp16

---------
Co-authored-by: Nxuyongsheng <xuyongsheng@baidu.com>
Co-authored-by: Nxysheng-baidu <121540080+xysheng-baidu@users.noreply.github.com>
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>
Co-authored-by: Nxiongkun <807377414@qq.com>

802a81d0

[Zero-Dim] fix Tensor.numpy, cntrol whether to hack process to 1D (#51757) · d7035454
由 zhouweiwei2014 提交于 3月 20, 2023

d7035454

【fluid clean】Move out layers and layers helper (#49415) · 1d5cad23

由 GGBond8488 提交于 3月 20, 2023

* remove no used fluid beam_search_decoder

* move Layer and related helper to paddle.nn.common

* modify Layer references from dygraph.layers.Layer to paddle.nn.common.layers

* stash changge

* remove fluid layer_object_helper, layers.py

* remove fluid layers init

* add setip

* fix unitest

* delete layers in fluid.dygraph

* merge paddle.tensor.stat,py

* fix circle import

* fix curcle import

* remove redundant in_dygraph_mode import

* revoce paddle.nn.common.* in fluid.__init__

* recovery nn.rnn

* paddle.frame use lazy import import paddle.jit to avoid circle import

* remove left dygraph.layers ref

* merge develop

* fix import error

* fix test error

* fxi merge error

* fix test fluid.Layer

* fix test error

* fix test error

* fix import error

* fix import error

* fix comments

* fix circle import

* fix rnn import error

* fix circle import

1d5cad23

add composite rules for squeeze op (#51539) · 89ff0d59

由 warrentdrew 提交于 3月 20, 2023

* add composite rule for squeeze

* fix pre commit

* fix pre commit

* simplify rules

* arrange code

* fix int axis

* simplify squeeze axis rules

* bugfix

* fix pre commit

89ff0d59

Fluid clean move out fill constant (#49511) · c985b1ac

由 GGBond8488 提交于 3月 20, 2023

* migrate fill_constant to paddle.tensor

* move fill_constant to paddle.tensor and repalce the reference

* add missing fill_constant replacement

* fix typro

* remove unused import fill_constant

* fix zeros import error

* fix circle import

* fix layers.zeros

* fix unitest

* fix unitests

* fix unitest

* use paddle.full replace fill_constant in samplecode

* fix sample code

* recovery xpu test

* recovery xpu test

* fix circle import

* fix utils import error

* fix utils error

* fix circle import

* redo

* fix circle import

* fix prim fill constant import

* fix type error

* fix increase error

* fix test error

* fix fill_constant

c985b1ac

J

support relue custom vjp (#51742) · 604b7a53
由 Jiabin Yang 提交于 3月 20, 2023

604b7a53

17 3月, 2023 4 次提交

fluid clean: remove fluid.ir and fluid.io (#51167) · 00877381

由 qizhaoaoe 提交于 3月 17, 2023

* fluid clean: remove fluid.ir to framework.ir and some funcs form fluid.layer.io to incubate.

* delete fluid.ir

00877381

[Prim] support batch_norm vjp (#51283) · ff40a7e5

由 cyber-pioneer 提交于 3月 17, 2023

* add bn vjp

* fix example

* fix code

* fix code

* fix cinn case

* fix code

* fix example

* fix code

* fix example

* fix example

ff40a7e5

Add sqrt composite rule (#51080) · aba9c4d4

由 mhy-666 提交于 3月 17, 2023

* add sqrt composite rule/test

* add sqrt composite rule/test

* fix ops/sqrt, add cinn test

* fix sqrt_comp

* fix sqrt_comp

* fix sqrt_comp

* fix

* fix codestyle

* fix codestyle

* add fp16 test

* add ops/sqrt

* fix

* fix

* fix unitest

* fix

* fix

* fix

aba9c4d4

N

[CodeStyle][B009][B010] use normal property access instead of getattr/setattr (#51530) · 2f2b1f23
由 Nyakku Shigure 提交于 3月 17, 2023

2f2b1f23

16 3月, 2023 2 次提交

R

Comp index select (#51215) · d1e2c61b
由 Roc 提交于 3月 16, 2023

d1e2c61b

【Prim】Fix dropout CINN amp error (#51688) · 94cd1ba2

由 Jiabin Yang 提交于 3月 16, 2023

* support amp logic for layer_norm and softmax

* fix layer_norm amp

* fix layernorm api and dropout fp16

* fix layernorm api and dropout fp16

* fix bn, ln dtype in float16

* fix dropout fp16

* fix comment

* fix cinn dropout amp error

94cd1ba2

15 3月, 2023 4 次提交

feat: add rsqrt composite rule (#51432) · c9ca7c35

由 Kang Zhao 提交于 3月 15, 2023

* feat: add relu composite rule

* feat: add relu composite rule, maximum op

* feat: add relu composite rule, maximum op

* feat: add relu composite rule, polish comments

* feat: add relu composite rule, polish comments

* feat: add relu composite rule, add python api of relu

* feat: add relu composite rule, commit hook

* fix: maximum type error & ban cinn test

* fix: maximum input sequence bugs

* resolve conflicts

* fix: code style bugs

* add: relu fp16 test

* feat: add rsqrt composite rule

* feat: add rsqrt composite rule

* resolve conflicts of composite rule

* fix: delete check eager

c9ca7c35

【Prim】Support amp logic for layer_norm and softmax (#51473) · 64076727

由 Jiabin Yang 提交于 3月 15, 2023

* support amp logic for layer_norm and softmax

* fix layer_norm amp

* fix layernorm api and dropout fp16

* fix layernorm api and dropout fp16

* fix bn, ln dtype in float16

* fix dropout fp16

* fix comment

64076727

C
[Prim] add pow composite rule (#51070) · 2d9e103e
由 chenjian 提交于 3月 15, 2023
```
* add pow composite rule

* fix test

* fix unit test

* update test

* fix test

* update
```
2d9e103e
W
refine amp scaler (#51340) · 1e232e27
由 wanghuancoder 提交于 3月 15, 2023
```
* refine _found_inf
```
1e232e27

14 3月, 2023 4 次提交
- Q
  
  implement expand as using tile (#51577) · 300b687a
  由 qizhaoaoe 提交于 3月 14, 2023
  
  300b687a
- X
  【prim】test composite rules with -1 shape (#51435) · 82a7c33e
  由 xiaoguoguo626807 提交于 3月 14, 2023
```
* init

* modify
```
  82a7c33e
- C
  
  [Prim] enable whitelist and blacklist for custom_vjp · 300f36c0
  由 cxxly 提交于 3月 05, 2023
  
  300f36c0
- C
  
  fix cast prim and vjp dtype mapping error bug · 5dda91a8
  由 cxxly 提交于 3月 02, 2023
  
  5dda91a8

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功