提交 · 2bcbf8b0a8c4d6a4af521ad784ace8d1b3a60188 · Crayon鑫 / Paddle

11 10月, 2022 3 次提交

[cherry-pick] [PHI] relu6_grad kernel (#46501) (#46862) · 2bcbf8b0

由 Sławomir Siwek 提交于 10月 11, 2022

* [PHI] Migrate gelu kernels (#45596)

* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* gelu fwd

* sort activations

* gelu gradient

* remove unused macros

* merge conflicts

* fix merge conflicts

* remove extra contraint from gelu op

* [PHI] relu6_grad kernel (#46501)

* Relu6

* remove fluid handler

* add individual kernel signature

* coding style

* replace bounded_relu with clip

* whitespace

* code style

2bcbf8b0

S
Revert pool+grad oneDNN kernel conversion (#45989) (#46860) · 7b3837e6
由 Sławomir Siwek 提交于 10月 11, 2022
```
Co-authored-by: NPiotr Paturej <48731682+piotrekobi@users.noreply.github.com>
```
7b3837e6
Y
[BugFix]Fix concat bugs when call onednn kernel (#46518) (#46845) · 6a6c7493
由 YuanRisheng 提交于 10月 11, 2022
```
* fix concat bug

* fix ci bugs

* fix ci bugs
```
6a6c7493

10 10月, 2022 5 次提交

[cherry-pick] [PHI] Migrate concat+grad, expand+grad, fill_constant … oneDNN... · fdd0d6d0

由 Sławomir Siwek 提交于 10月 10, 2022

[cherry-pick] [PHI] Migrate concat+grad, expand+grad, fill_constant … oneDNN kernels (#45863) (#46727)

* [PHI] Migrate concat+grad, expand+grad, fill_constant, nearest_interp and bilinear_interp oneDNN kernels (#45863)

* Migrate concat+grad, expand+grad, fill_constant, nearest_interp_v2 and bilinear_interp_v2 oneDNN kernels to PHI

* Remove old namespace variable

* Fix invalid out dims error

* Add mutable_data method to concat output

* Add check for -1 dim before computing out_dims

* Capitalize oneDNNGetDataType function name

* Change fill_constant kernel to correct PHI kernel

* Attempt to fix dims error

* Fix fill_constant (full) kernel

* update dependencies
Co-authored-by: NPiotr Paturej <48731682+piotrekobi@users.noreply.github.com>

fdd0d6d0

[cherry-pick] [PHI] Migrate sgd and stack oneDNN kernels (#46374) (#46729) · 25d61cd1

由 Sławomir Siwek 提交于 10月 10, 2022

* [PHI] Migrate sgd and stack oneDNN kernels (#46374)

* Convert slice+grad oneDNN fluid kernels to PHI

* Change mutable_data to Alloc

* Refactor licences

* update dependencies
Co-authored-by: NPiotr Paturej <48731682+piotrekobi@users.noreply.github.com>

25d61cd1

[PHI] Migrate slice, slice_grad, split, pad and pad3d oneDNN kernels (#46101) (#46726) · 51a91fee

由 Sławomir Siwek 提交于 10月 10, 2022

* Convert split, pad and pad3d kernels

* Convert slice+grad oneDNN fluid kernels to PHI

* change out->mutable_data to dev_ctx.Alloc
Co-authored-by: NPiotr Paturej <48731682+piotrekobi@users.noreply.github.com>

51a91fee

S
[PHI] migrate softmax_grad kernel (#46257) (#46725) · 44ecae6c
由 Sławomir Siwek 提交于 10月 10, 2022
```
* init

* remove softmaxop

* merge dev

* correct dir

* style
```
44ecae6c

[PHI] Shape op migration (#46051) (#46724) · 3cc3f60f

由 Sławomir Siwek 提交于 10月 10, 2022

* First approach

* Shape kernel corrected

* Compilation error fixed

* Resize corrected

* Registered types added

* Mistake corrected & types added

* sum kernel deleted
Co-authored-by: NPaulina Gacek <paulina.gacek.pl@gmail.com>

3cc3f60f

29 9月, 2022 3 次提交

傅
[cherry-pick] Add FP16 support for uniform in dygraph mode on Nvidia GPU (#46641) · a58663f3
由傅剑寒提交于 9月 29, 2022
```
Add FP16 support for uniform in dygraph mode on Nvidia GPU
Dev PR link PR46212
```
a58663f3

[cherry-pick] Open the clip_extra flag in save_inference_model (#46577) · d67da3dc

由 zyfncg 提交于 9月 29, 2022

* set flag of clip_extra in save_inference_model to true (#46151)

* open the clip_extra flag in paddle.static.save_inference_model, test=allcase (#46456)

* Open the clip_extra flag in TracedLayer.save_inference_model (#46473)

* open the clip_extra flag in paddle.static.save_inference_model, test=allcase

* set the defalut value of clip_extra in TracedLayer from False to True, test=allcase

* update english doc of paddle.static.save_inference_model, test=document_fix (#46484)

* Fix clip_extra logic in remove_training_info (#46534)

* fix clip_extra code in remove_training_info

* revert rnn opmaker clear

d67da3dc

L
[CherryPick][Fix] Remove std::trunc() in FloorDivideFunctor and... · f5956bec
由 Lin Manhui 提交于 9月 29, 2022
```
[CherryPick][Fix] Remove std::trunc() in FloorDivideFunctor and InverseFloorDivideFunctor (#45051) (#46504)
```
f5956bec

28 9月, 2022 1 次提交

[cherry-pick] Clear extra attrs of some ops in OpMaker (#46150, #46321,... · b2e4211d

由 zyfncg 提交于 9月 28, 2022

[cherry-pick] Clear extra attrs of some ops in OpMaker (#46150, #46321, #46418, #46451, #46457) (#46553)

* Clear extra attributes of some Op in OpMaker (Part4) (#46060)

* clear extra attr of some ops in opmaker

* revert clear use_cudnn for pool

* fix test_operator_desc

* fix Attr interface of OperatorBase

* clear extra attrs of condition op in opmaker (#46150)

* Clear extra attrs of lookup_table_v2 in OpMaker (#46321)

* clear extra attrs of look_up_table_v2 in opmaker

* fix bug

* clear extra attrs of quantize op in opmaker (#46418)

* delete repeated item

* clear extra attrs of distribute op in opmaker (#46451)

* clear extra atts of sequence_softmax in opmaker (#46457)

b2e4211d

27 9月, 2022 2 次提交

Z

fix shard_index kernel (#46491) (#46511) · 5711bbee
由 zhaoyingli 提交于 9月 27, 2022

5711bbee

[cherry-pick] clear extra attrs of some ops in OpMaker (#45845, #45984, 46060) (#46218) · 0cc2251f

由 zyfncg 提交于 9月 27, 2022

* Clear extra attrs of elementwise op in OpMaker (#45845)

* clear extra attrs of elementwise op in opmaker

* fix op_debug_string_test

* fix bug of grad_add

* fix sort of runtime attrs

* Clear extra attrs of scale in OpMaker (#45984)

* clear extra attr of scale in opmaker

* fix sum bug

* fix merge conflict

* fix minus

* Clear extra attributes of some Op in OpMaker (Part4) (#46060)

* clear extra attr of some ops in opmaker

* revert clear use_cudnn for pool

* fix test_operator_desc

* fix Attr interface of OperatorBase

* fix code stype

0cc2251f

26 9月, 2022 1 次提交
- H
  [cherrypick] Fix elementwise_sub sign reverse for mkldnn (#46107) · 6990edfe
  由 Hui Zhang 提交于 9月 26, 2022
```
* fix sub sign reverse for mkldnn

* refactor code as comment

* remove useless
```
  6990edfe
20 9月, 2022 10 次提交

H
[cherry-pick][xpu] update xdnn activations (#46282) · a43f960e
由 houj04 提交于 9月 20, 2022
```
* [XPU] update xdnn activations. (#46246)

* [XPU] update xpu cmake. test=kunlun
```
a43f960e
H
[PolishComments] Polish some code comments (#46032) (#46261) · 42e56f65
由 HongyuJia 提交于 9月 20, 2022
```
* polish code comments

* polish data_device_transform.cc
```
42e56f65

[Cherry-pick] Fix amp error cp (#46272) · da173c40

由 Jiabin Yang 提交于 9月 20, 2022

* [Eager] Fix ocr (#46124)

* fix linspace error in amp

* fix log

* fix amp error

* fix ocr error which caused by amp

* add more check

* rename dtype ns

* [Eager Bug fix]Fix Detection (#46147)

* fix linspace error in amp

* fix log

* fix amp error

* Revert "Simplify size op impl (#45808)"

This reverts commit c252b1de.

* fix_seg

* fix detection
Co-authored-by: NChen Weihang <sunny_cwh@163.com>
Co-authored-by: NChen Weihang <sunny_cwh@163.com>

da173c40

[Release/2.4][Cherry-pick] Fix bug of reduce_sum op (#46160) · 759736df

由 Ghost Screaming 提交于 9月 20, 2022

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX,
its result is wrong.

* Cherry-pick of PR 46045

* Fix bug of reduce_sum kp op.

* Fix bug of reduce_sum kp operator compilation.
If compilation device is XPU, eigen kernel should be ignored.

759736df

W
Fix TransDataBackend Error when call unsqueeze using MKL Tensor (#46094) (#46186) · 50340302
由 WangZhen 提交于 9月 20, 2022
```
* Fix TransDataBackend Error when call unsqueeze using MKL Tensor

* Add UT

* Refine UT
```
50340302

[Cherry-pick] Sparse add InferMeta (#46235) · fd8ec4a1

由 zhangkaihuo 提交于 9月 20, 2022

cherry-pick : #46016, #46021, #45974

* [Sparse]Sparse add support gpu (#45974)

* [Sparse]Remove unused code (#46021)

* [Sparse] Add infer meta (#46016)

fd8ec4a1

J
[Eager] Fix linspace error in amp (#46088) (#46206) · 38c0fd02
由 Jiabin Yang 提交于 9月 20, 2022
```
* fix linspace error in amp

* fix log

* fix amp error
```
38c0fd02

(cherry-pick)Support some op refuse forward and fix some bugs (#46211) · bc92d5f5

由 Charles-hit 提交于 9月 20, 2022

* support cast op backward refuse forward and fix some bugs (#46173)

* support cast op backward refuse forward

* Fix the bug of high order unit test framework

* support sign op backward refuse forward (#46002)

bc92d5f5

[Cherry-pick] Update layoutautotune for inplace (#45826) (#46226) · c0324e82

由 niuliling123 提交于 9月 20, 2022

cherry-pick from #45826
LayoutAutotune 支持 inplace 类型的OP
 根据 Add eager layout autotune #45409 修改意见调整UseAutotune
将LayoutAutotune判断放到controller中，与AMP 判断保持一致

c0324e82

Fix wrong eigen header include (#46082) (#46202) · ac8cce20

由 zyfncg 提交于 9月 20, 2022

* fix wrong eigen header include

* fix complie bug

* fix nan_inf_utils_detail

* fix resource_manager

* fix conv_miopen_helper

ac8cce20

19 9月, 2022 7 次提交
- R
  [vision.ops.nms] Fix return order error and duplicate results with specific... · be84cac7
  由 RichardWooSJTU 提交于 9月 19, 2022
```
[vision.ops.nms] Fix return order error and duplicate results with specific inputs (#46148) (#46193)

* fix return order error and duplicate results with specific inputs
```
  be84cac7
- W
  
  Add symbolic shape deduction function for general Plugin mechanism (#46179) · a0566010
  由 weishengying 提交于 9月 19, 2022
  
  a0566010
- C
  (cherry-pick)support some op backward refuse forward (#46201) · adab3c59
  由 Charles-hit 提交于 9月 19, 2022
```
* add unit test for sum higher level op (#45961)

* support slice op backward refuse forward and add high level unit test (#45960)

* support tile op backward refuse forward (#45942)

* support expand_v2 op backward refuse forward (#45941)

* support concat backward refuse forward (#45940)
```
  adab3c59
- J
  [Cherry-pick] Support bmm and bmm_grad in xpu (#45887) (#46132) · 1c7e95cc
  由 Jiabin Yang 提交于 9月 19, 2022
```
* [PHI] Support bmm and bmm_grad in xpu (#45887)

* support bmm and bmm_grad in xpu

* add error removal

* test=kunlun

* refactor code for better structure

* test=kunlun

* add fp16 kernel for bmm

* test=kunlun

* test=kunlun
```
  1c7e95cc
- M
  Add INT8 support for fused_multi_transformer_op (#45284) (#46169) · db368d5b
  由 minghaoBD 提交于 9月 19, 2022
```
Co-authored-by: NRichardWooSJTU <37864677+RichardWooSJTU@users.noreply.github.com>
```
  db368d5b
- S
  
  fix broadcast kernel (#46158) · 860f6077
  由 sneaxiy 提交于 9月 19, 2022
  
  860f6077
- C
  Revert "Simplify size op impl (#45808)" (#46168) · dabb8f23
  由 Chen Weihang 提交于 9月 19, 2022
```
This reverts commit c252b1de.
```
  dabb8f23
17 9月, 2022 1 次提交
- Y
  
  fix compilation errors on mac arm64 (#46135) · f6dd2014
  由 Yuanle Liu 提交于 9月 17, 2022
  
  f6dd2014
16 9月, 2022 2 次提交

（cherry-pick）Fix split infershape in static mode and add convert rules for... · 4e09e402

由 Charles-hit 提交于 9月 16, 2022

（cherry-pick）Fix split infershape in static mode and add convert rules for fill_any_like op (#46079)

* Fix split bug in static mode (#45906)

* fix split bug in static mode

* modify code style

* modify code style

* add unit test for split

* add convert rules for fill_any_like op in paddle science (#45985)

* add convert rules for fill_any_like op in paddle science

* add unit test for fill_any_like op in paddle science

* modify fill_any_like convert rule

* modify fill_any_like convert rule dtype

4e09e402

[Cherry-pick] Normalize yaml name and label (#46052) · 8caaf85a

由 Chen Weihang 提交于 9月 16, 2022

* normalize yaml file name (#45894)

* Clear extra attributes of activation op in OpMaker (#45772)

* clear extra attr of activation op in opmaker

* fix syntax bug

* fix mkldnn kernel

* fix merge conflict

* fix bug

* [PHI] Normalize yaml op label (#45976)

* normalize yaml op label

* revert op_compat yaml change

* fix prelu and rnn compat problem

* replace api by op

* support assign op backward refuse forward (#45879)

* normize yaml backward op label (#46028)
Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
Co-authored-by: NCharles-hit <56987902+Charles-hit@users.noreply.github.com>

8caaf85a

15 9月, 2022 2 次提交
- W
  Support 0 shapes input Tensor for MKL slice (#45930) (#46072) · 903c87bd
  由 WangZhen 提交于 9月 15, 2022
```
Support 0 shapes input Tensor for MKL slice kernel
```
  903c87bd
- C
  Fix arm fp16 compile error (#45991) (#46048) · 91677eb4
  由 Chen Weihang 提交于 9月 15, 2022
```
* fix arm fp16 compile error

* polish macro impl
```
  91677eb4
14 9月, 2022 3 次提交
- J
  cherry pick delay tensorrt log (#45958) · 2ca65904
  由 JingZhuangzhuang 提交于 9月 14, 2022
```
* cherry pick delay tensorrt log
* Update trt_plugin.h
```
  2ca65904
- [chery-pick] Fix namespace error (#45925) (#46029) · 925e84bf
  由 engineer1109 提交于 9月 14, 2022
```
修复cuda11.7编译出错的问题
```
  925e84bf
- Y
  
  fix transformer bug, test=kunlun (#45983) · 20d168d9
  由 ykkk2333 提交于 9月 14, 2022
  
  20d168d9

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致