提交 · 5745a63f2a68f0c976ffab317f43c4454134fb31 · PaddlePaddle / Paddle

18 5月, 2023 1 次提交

support auto generate for op layer_norm (#53178) · 4f07b653

由 RedContritio 提交于 5月 18, 2023

* simplify layer_norm_op.cc

* support auto generate for op layer_norm

* update unittest for composite_layer_norm

* remove layer_norm_op.cc from scripts

* replace layer_norm_op with generated_op

* add get_expected_kernel for layer_norm

* update cmake kernel register function for layer_norm_mkldnn_op

4f07b653

22 4月, 2023 1 次提交

[Zero-Dim] support output 0D for... · b406a7db

由 wangfengsheng1999 提交于 4月 22, 2023

[Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase (#52850)

* [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase

* [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase

* add test case

* modify dot/metric.accuracy/static.accuracy/static.auc

* modfiy inner/tensordot bug

* test 9 api

* [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase

* fix bug

* support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy

* code style

* fix bug

* fix test_dot_op bug

* fix accuracy bug

* fix bug

* fix bug

* fix bug

* fix bug

* codestyle

* fix dot bug

* fix dot bug

* fix dot bug

* code style

* fix dot bug

* fix dot bug

* fix dot bug

* fix dot bug

* fix dot bug

* fix dot bug

* modify code

b406a7db

06 4月, 2023 1 次提交
- S
  Fix flash attention bug (#52551) · 8ac5a6b6
  由 sneaxiy 提交于 4月 06, 2023
```
* fix flash attn

* fix another API
```
  8ac5a6b6
29 3月, 2023 1 次提交

Add group_norm composite rule (#51874) · cabf3921

由 Yichen Zhang 提交于 3月 29, 2023

* add group_norm composite rule

* add test for scale_grad and bias_grad

* resolve conflicts

* remove amp in composite_rule.py

* add float16 test

* deal with NHWC format

* keep the composite rule in float16 identical as original kernel

* resolve conflicts

cabf3921

20 3月, 2023 1 次提交

【prim】New layer_norm grad (#51750) · 802a81d0

由 xiaoguoguo626807 提交于 3月 20, 2023

* Add flatten composite rule

* get the right xshape and pass func test

* add cinn unit test

* Remove cinn test, wait for it to be added after repair

* add comp test to test_flatten_contiguous_range_op.py

* remove func test on composite_ops

* Add comments to maybe_wrap_dim func

* remove commented code

* fix the problem with 0D tensor case

* add flatten split rule comment

* fix syntax issues

* block flatten on resnet_prim_cinn

* init change

* tmp commit

* add layer_norm InferMeta check

* cast type modify

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* Pr 50885 (#7)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix code in a dy2static-friendly way.

* [dystatic] add hooker for prim

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix cast prim and vjp dtype mapping error bug

* recover

* big tol

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* Pr 50885 (#7)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

* add unittest

* fix typo

* fix typo

* fix map.at

* fix find

* fix test

* fix cinn cache key structure realize

* using ordered map for attributes

* add test by review advice

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix code in a dy2static-friendly way.

* [dystatic] add hooker for prim

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix cast prim and vjp dtype mapping error bug

* Cxx prim custom vjp (#8)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* Pr 50885 (#7)

* [CINN]Enhance CacheKey hash logic by considering input dtypes (#50557)

* [CINN]Enhance CacheKey hash logic by considering input dtypes

---------
Co-authored-by: Njiangcheng <thisjiang@qq.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix code in a dy2static-friendly way.

* [dystatic] add hooker for prim

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [prim] enable dygraph_to_static to support custom_vjp

* fix cast prim and vjp dtype mapping error bug

* [dy2static-ci] fix dy2static ci errors.

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>

* [Prim] enable whitelist and blacklist for custom_vjp

* debug log

* clear log

* fix

* nothing

* less memory

* recover utils

* fix

* modify threshold value

* skip layer_norm for test_bert

* back to bert success state

* add epsion

* delete unnecessary compute

* modify amp dtype

* modify * order

* delete sqrt check and fp16

---------
Co-authored-by: Nxuyongsheng <xuyongsheng@baidu.com>
Co-authored-by: Nxysheng-baidu <121540080+xysheng-baidu@users.noreply.github.com>
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
Co-authored-by: Njiangcheng <thisjiang@qq.com>
Co-authored-by: Ncxxly <chenxx_id@163.com>
Co-authored-by: Nxiongkun <807377414@qq.com>

802a81d0

16 3月, 2023 1 次提交
- C
  rename flash_attn_raw to flash_attn_unpadded (#51704) · 0b778bdc
  由 Chitsing KUI 提交于 3月 16, 2023
```
* rename flash_attn_raw to flash_attn_unpadded

* fix static api

* fix static return
```
  0b778bdc
14 3月, 2023 1 次提交
- W
  
  fix rank=1 (#51413) · b4f49aa1
  由 wangxiaoning 提交于 3月 14, 2023
  
  b4f49aa1
01 3月, 2023 1 次提交

Integration flash attention (#49869) · 61611786

由 Chitsing KUI 提交于 3月 01, 2023

* flash attn

* seed

* almost

* softmax

* fix workspace

* add unitest; linux only

* fix setup

* fix datatype include

* fix setup typo

* fix def scope

* new error api

* use paddle fork

* fix attr bug; complete ut

* update flash hash

* fix rng reset

* fix offset

* fix comments

61611786

12 1月, 2023 1 次提交

lerp support 0 Tensor (#49667) · 8cd0d5b3

由 sunli 提交于 1月 12, 2023

* lerp support 0 Tensor

* fix lerp grad

* fix lerp zero test

* fix 0D + ND/ND + 0D

* fix check

* update code

* fix lerp infer shape

* static backward test

* updata static graph test

8cd0d5b3

16 12月, 2022 1 次提交
- Y
  
  0d tensor for scatter_ and scatter_nd (#49072) · 74582aaa
  由 Yuang Liu 提交于 12月 16, 2022
  
  74582aaa
03 12月, 2022 1 次提交
- Y
  
  Scatter 0D index for gather, 0D index and 0D updates for scatter. (#48452) · f9815bfe
  由 Yuang Liu 提交于 12月 03, 2022
  
  f9815bfe
17 11月, 2022 1 次提交
- Y
  [PHI]Standardise some C++ API (Part5) (#47860) · f3650201
  由 YuanRisheng 提交于 11月 17, 2022
```
* standard api

* fix xpu bugs
```
  f3650201
15 11月, 2022 1 次提交
- Y
  
  Update for scatter support fake 2d index (#47946) · e65bac28
  由 Yuang Liu 提交于 11月 15, 2022
  
  e65bac28
14 11月, 2022 1 次提交
- [Zero-Dim] support input 0D Tensor as scalar attribute for some api (#47689) · e0be4b94
  由 zhouweiwei2014 提交于 11月 14, 2022
```
* [Zero-Dim] support input 0D Tensor as scalar attribute for some api

* fix doc
```
  e0be4b94
01 11月, 2022 1 次提交
- Y
  [PHI]Standardise some C++ API (Part2) (#47510) · 399047d7
  由 YuanRisheng 提交于 11月 01, 2022
```
* standard_api

* add hardtanh
```
  399047d7
31 10月, 2022 1 次提交
- Y
  [PHI]Standardise some C++ API (#47385) · 60e0c506
  由 YuanRisheng 提交于 10月 31, 2022
```
* standard api

* fix ci bugs

* fix ci bugs

* fix ce bugs
```
  60e0c506
17 10月, 2022 1 次提交
- Y
  [PHI]Modify DataLayout's namespace from paddle::experimental to phi (#46869) · ec749398
  由 YuanRisheng 提交于 10月 17, 2022
```
* namespace modify

* update by comment
```
  ec749398
12 8月, 2022 3 次提交

Z

refix index resize in multiclassnms3 (#45095) · 49e2a4d8
由 zhiboniu 提交于 8月 12, 2022

49e2a4d8
Z

fix extra output of kernels for inference (#45048) · 1cb883da
由 zyfncg 提交于 8月 12, 2022

1cb883da

[geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3

由 Siming Dai 提交于 8月 12, 2022

* add init file

* add op definition and infermeta

* add kernel definition funcs

* add broadcast infer shape

* add gpu forward kernel

* delete SUB and DIV

* add x_grad

* add template

* add e_grad for min and max

* fix small bug

* temp commit

* temp commit

* add e_grad for sum and mean

* fix some compile bug

* fix compile bugs

* fix compile problem

* add sum forward unittest

* fix broadcast error, add kernel sig, register e_grad, change unit test

* fix grad

* add temp grad fix

* temp commit

* add min max unittest

* add max, min unittest, fix mul bug

* add cpu forward sum and mean

* add forward min max, fix mean unittest

* add cpu backward min max

* fix code-style

* add backward sum mean

* fix rocm ci

* set uniitest timeout

* fix bug of x broadcast to e, gpu grad

* fix bug of x broadcast to e, cpu grad

* rename BOOST_GET_CONST macro

* fix rocm ci

* mv graph_send_e_recv to graph_send_ue_recv

* move out_size to IntArray

* add eager op test

* fix max pool type bug, add unittest for api

* revise api doc

* add fp16 for atomic min and max, add unittest

* add unittest

* add fp16 support for graph_send_recv

* fix unittest fp16 bug

* change OutSizeTensor to Out_size

* move E to Y

* add copyright, fix comment

* review code

* fix thread block size

* fix thread block size

* change api attribute name: pool_type to reduce_op, compute_type to message_op

* change api attribute name, move pool_type to reduce_op, move compute_type to message_op

615b15a3

09 8月, 2022 1 次提交

[geometric]Add paddle.geometric.send_u_recv API (#44580) · 34b43555

由 Siming Dai 提交于 8月 09, 2022

* change out_size to INTArray

* fix out_size eager bug

* add unittest for out_size tensor

* add deprecated for paddle.incubate.graph_send_recv, add paddle.geometric.send_u_recv and unittests

* fix lowest bug

* fix according review comment

* add default value in yaml

* change api file name

* change name

34b43555

03 8月, 2022 1 次提交
- Z
  transfer op multiclass_nms3 to phi (#44765) · 15ce2c1b
  由 zhiboniu 提交于 8月 03, 2022
```
* add cmake enforce

* transfer multiclass_nms3  to phi
```
  15ce2c1b
01 8月, 2022 1 次提交
- Z
  
  Revert for cmake static library errors on XPU KP #44762 · f15d930a
  由 zhiboniu 提交于 8月 01, 2022
  
  f15d930a
29 7月, 2022 1 次提交
- Z
  
  phi_multiclass_nms3 (#44613) · a9919903
  由 zhiboniu 提交于 7月 29, 2022
  
  a9919903
28 7月, 2022 1 次提交

[PHI] Move spectral_norm to phi (#44577) · 768e50c9

由 Lin Manhui 提交于 7月 28, 2022

* Add kernel declarations

* Copy kernel implementation code

* Transfer implementation code

* Fix: Move out_grad to first

* Register new kernels

* Remove old kernels

* Move out_grad to last

* Fix bugs

* Transfer infermeta

* Add yaml files

* Add blank line

* Fix code style

* Optimize directory structure
Co-authored-by: NBobholamovic <linmanhui@baidu.com>

768e50c9

26 7月, 2022 1 次提交
- L
  
  [Phi] Migrate box coder to phi. (#44550) · 98f8fa4c
  由 lyq 提交于 7月 26, 2022
  
  98f8fa4c
12 7月, 2022 1 次提交
- C
  [PHI] Clean glog header in public header (#44216) · b0c9f24a
  由 Chen Weihang 提交于 7月 12, 2022
```
* clean glog header in public header

* move marco pos
```
  b0c9f24a
08 6月, 2022 1 次提交
- Y
  [Phi]Move group op kernel into PHI and add yaml / unittest (#43104) · 99c6497b
  由 YuanRisheng 提交于 6月 08, 2022
```
* move_group_norm

* move group norm backward

* fix code format

* modify code according comment
```
  99c6497b
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
30 5月, 2022 1 次提交
- A
  [fix] addmm supports 1-d input (#42959) · 849d937b
  由 Aganlengzi 提交于 5月 30, 2022
```
* addmm supports 1-d input

* fix coverage

* fix

* more ut
```
  849d937b
27 5月, 2022 1 次提交

[Phi] Change optional tensor from `optional<const Tensor&>` to `optional<Tensor>` (#42939) · 6d78524c

由 zyfncg 提交于 5月 27, 2022

* refactor the optional tensor

* remove optiona<MetaTensor> in InferMeta

* fix bug

* fix optional<vector<Tensor>>

* fix bug

* fix rmsprop

* fix amp of eager_gen

* polish code

* fix deleted code

* fix merge conflict

* polish code

* remove is_nullopt_

* fix merge conflict

* fix merge conflict

6d78524c

26 5月, 2022 1 次提交
- Y
  
  move instance_norm_double_grad (#43021) · b2b78cd4
  由 YuanRisheng 提交于 5月 26, 2022
  
  b2b78cd4
12 4月, 2022 1 次提交

Add layer norm yaml (#41589) · 43d5cca6

由 hong 提交于 4月 12, 2022

* add layer norm infermeta

* add layer norm yaml

* polish layer norm infer meta

* add layer norm to black list

43d5cca6

07 4月, 2022 1 次提交
- Y
  [Phi]Add hard_swish/kron/linspace/logit yaml file (#41298) · 90cb337e
  由 YuanRisheng 提交于 4月 07, 2022
```
* add yaml

* perfect converage
```
  90cb337e
03 4月, 2022 1 次提交
- Z
  Add randperm and range yaml (#41265) · fd1ecfc5
  由 zyfncg 提交于 4月 03, 2022
```
* add randperm and range yaml

* add eager test for randperm
```
  fd1ecfc5
31 3月, 2022 1 次提交
- C
  
  fix conflict (#40851) · 74894cd7
  由 csy0225 提交于 3月 31, 2022
  
  74894cd7
28 3月, 2022 1 次提交
- Y
  
  [phi] move infershape: flip/maxout/take_along_axis/put_along_axis (#40974) · b6661d3a
  由 Yang 提交于 3月 28, 2022
  
  b6661d3a
22 3月, 2022 1 次提交

[phi] Update graph_send_recv OP (#40509) · 67b46e45

由 Siming Dai 提交于 3月 22, 2022

* add out_size shape for graph_send_recv

* fix bug in register kernel: no const int& support

* add out_size in infermeta

* change unittest

* fix unittest

* fix out_size default value

* fix doc

* delete arg mapping

* add sig

* move -1 to 0

* move -1 to 0

67b46e45

18 3月, 2022 1 次提交
- Z
  [Phi] Move infershape of roi_pool to phi (#40682) · 579173d8
  由 zyfncg 提交于 3月 18, 2022
```
* move infershape of roi_pool to phi

* polish code
```
  579173d8
16 3月, 2022 1 次提交
- Z
  [Phi] Move roi_align grad kernel and infershape from fuild to phi (#40556) · 3898080e
  由 zyfncg 提交于 3月 16, 2022
```
* move roi_align_grad kernel

* move roi_align grad kernel and infershape to phi

* remove roi_align infershape
```
  3898080e

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功