提交 · 1d6fd81dbbdf63cf702ddb265208118f012e8f78 · PaddlePaddle / Paddle

16 2月, 2022 1 次提交

[Pten]Remove reshape and elementwise_add's registry code in Fluid (#39317) · c6478270

由 YuanRisheng 提交于 2月 16, 2022

* remove reshape and elementwise_add registry

* delete code

* fix bugs when run ci ut

* remove log

* fix bugs when run unit test

* fix bugs when run unit test

* fix bugs when run cinn

* fix bugs when run ci-mac-python3

* fix compile bugs

* fix compile bugs

* fix compile bugs

* fix bugs when run kunlun

* fix bugs when compile

* update code according comment

c6478270

15 2月, 2022 2 次提交

J

disabled unnecessary int reorders profiling (#39498) · 3581c075
由 jakpiase 提交于 2月 15, 2022

3581c075

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

11 2月, 2022 1 次提交

Added shape (U)INT8/BF16/FP32 oneDNN kernel (#36033) · 52bbaae9

由 jakpiase 提交于 2月 11, 2022

* added shape oneDNN kernel

* removed unnecessary import from test

* added skipping tests for GPU

* refactoring

* refactored shape kernel

* added tests in new framework

* removed one line

* minor change

* added newline at EOF

* added formatting

* added attributes as extra

52bbaae9

08 2月, 2022 1 次提交

Fix to #38126 (#39097) · f884edb9

由 Jacek Czaja 提交于 2月 08, 2022

* - 38126 potential fix

* - fix

* - build fix

* - another candidate fix

* - compilation fix

* - another fix

* - Fix to activation of NHWC being first oneDNN op in chain on oneDNN ops

* - compilation fix

* - added NHWC reotating for elementwise being first op

* - compilation fix

* - compilation fix

* - Added UT

* - cosmetic fixes

f884edb9

24 1月, 2022 1 次提交

Remved redundant defintions of likely/unlikely (#38911) · 43919d0a

由 Jacek Czaja 提交于 1月 24, 2022

* - more unlikely

* - compilation fix

* - removed redundant definition

* - fix

* - Fixes

* - compilation fix for windows

43919d0a

18 1月, 2022 2 次提交

Mish FP32/BF16 kernel, conv and fc fuse passes (#38623) · 1d18bc2c

由 Sławomir Siwek 提交于 1月 18, 2022

* Mish

* Change exp() library

* mish fuse pass

* mish attrs

* fixes

* mishop maker

* remove attrs

* mish kernal for bf16

* fc+mish fuse

* fix code format error

* Resolve merge conflicts

* Update mish operator version

* update mish variable to new naming convention

1d18bc2c

[Unify Tensors PR ] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3

由 Zhanlue Yang 提交于 1月 18, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Patched python level LoDTensor

* Merge Tensor into DenseTensor

* Fixed namespace issues,test=allcases

* Fixed merge issues

* Fixed inference issues

* Fixed NPU test issues

* Fixed merge issues

2052f1e3

17 1月, 2022 1 次提交
- J
  
  fix for conv2D training error (#38938) · 944ea436
  由 jakpiase 提交于 1月 17, 2022
  
  944ea436
15 1月, 2022 1 次提交

[Unify Tensors PR ] Merged LoDTensor with Tensor, test=allcases (#38880) · 88966b28

由 Zhanlue Yang 提交于 1月 15, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Fixed example code failure

* Polished function names, removed duplicated forward declarations

88966b28

13 1月, 2022 1 次提交

Added mul BF16/FP32 FWD/BWD oneDNN kernel (#38552) · fc6eed5b

由 jakpiase 提交于 1月 13, 2022

* base changes for mul reimplementation

* empty commit

* tmp save

* full implementation of mul bf16/fp32 fwd bwd

* CI fix

* CI rerun

* changed unity build cmake to avoid gpu issues

* removed mul mkldnn from unity build

* added skipping tests if not cpu_bf16

* CI fix

* CI fix

* CI fix

fc6eed5b

12 1月, 2022 1 次提交
- S
  Fix conv act int8 scale (#38331) · 4825addd
  由 Sylwester Fraczek 提交于 1月 12, 2022
```
* fix conv act int8 scale

* add unit test for conv+hard_swish
```
  4825addd
06 1月, 2022 1 次提交
- J
  Added exp FP32 FWD/BWD oneDNN kernel and optimized other oneDNN grad kernels (#38624) · 718183f1
  由 jakpiase 提交于 1月 06, 2022
```
* added exp activation and use_dst_for_bwd kernels

* CI RERUN

* minor change
```
  718183f1
05 1月, 2022 2 次提交
- W
  
  add depthwise_conv2d op for mkldnn (#38484) · e1cc2236
  由 wangxinxin08 提交于 1月 05, 2022
  
  e1cc2236
- J
  Fix for matmul_v2 oneDNN op broadcasting when inputs dims have different lengths (#38665) · 67923124
  由 jakpiase 提交于 1月 05, 2022
```
* fix for matmul_v2 broadcasting

* fix for output shape not broadcasted
```
  67923124
04 1月, 2022 1 次提交
- J
  
  added sqrt bf16 fwd/bwd (#38599) · 2d2609ea
  由 jakpiase 提交于 1月 04, 2022
  
  2d2609ea
30 12月, 2021 1 次提交

Added Conv2D BF16 BWD oneDNN kernel (#38507) · ed8ba011

由 jakpiase 提交于 12月 30, 2021

* working test for padding only

* added full conv2d grad kernel

* removed some trash

* minor change

* Ci fix

* format fix

ed8ba011

23 12月, 2021 1 次提交
- J
  Make GetBlob assuming elements are cached (#38336) · 7da5368d
  由 Jacek Czaja 提交于 12月 23, 2021
```
* First set of fixes

* - Make more likely to GetBlob find a blobs

* - Lint
```
  7da5368d
22 12月, 2021 1 次提交
- J
  
  Add nearest_interp/v2 int8 and uint8 support (#37985) · 56e2a6a6
  由 joanna.wozna.intel 提交于 12月 22, 2021
  
  56e2a6a6
20 12月, 2021 1 次提交
- S
  
  fix use of implicitly deleted constructor (#38225) · 23d9e947
  由 Sylwester Fraczek 提交于 12月 20, 2021
  
  23d9e947
14 12月, 2021 3 次提交
- S
  add map_matmul and fc_act_fuse passes to quant2_int8_mkldnn_pass (#38023) · 8f800dc0
  由 Sylwester Fraczek 提交于 12月 14, 2021
```
* add map_matmul passes to quant2_int8_mkldnn_pass

* fix fc+act fuse (activation scale)

* ci fix, c++17 structured bindings not available

* fix ci static check
```
  8f800dc0
- B
  add conv_gelu_mkldnn_fuse_pass (#38107) · 206a33b3
  由 baoachun 提交于 12月 14, 2021
```
* add conv_gelu_mkldnn_fuse_pass

* add post ops
```
  206a33b3
- S
  add reshape+transpose+matmul_v2 only (#37847) · a922168a
  由 Sylwester Fraczek 提交于 12月 14, 2021
```
* reshape+transpose+matmul_v2

* in_name->input_name

* fix pr-ci-static-check
```
  a922168a
07 12月, 2021 1 次提交
- Z
  Quantize slice op (#37630) · 2bd0f3c7
  由 Zuza 提交于 12月 07, 2021
```
* quantize slice op

* correct test

* fix code formatting
```
  2bd0f3c7
30 11月, 2021 3 次提交
- S
  refactoring matmul_v2 mkldnn hierarchy (#37622) · fab92824
  由 Sylwester Fraczek 提交于 11月 30, 2021
```
* refactoring matmul hierarchy

* review fix

* review fix

* review_FIX-part2
```
  fab92824
- S
  Add new unittests for gIOHW format in conv_transpose_mkldnn_op (#37344) · d93ee063
  由 Sławomir Siwek 提交于 11月 30, 2021
```
* Add new unittests

* Replace I with O channel for filter groups

* Undo changes affecting other operators

* Fix oneDNN namespace typo

* Fix code format error
```
  d93ee063
- G
  support data_format='NHWC' for prelu channel mode (#37019) · 3f2a665a
  由 Guoxia Wang 提交于 11月 30, 2021
```
* support data_format='NHWC' for prelu channel mode
```
  3f2a665a
29 11月, 2021 1 次提交
- P
  
  Add third batch of deprecated mkldnn namespace name changes (#37558) · 1ba81500
  由 piotrekobiIntel 提交于 11月 29, 2021
  
  1ba81500
24 11月, 2021 1 次提交
- P
  Changed second batch of deprecated mkldnn header and function names to new oneDNN names (#37351) · 7db7a0ec
  由 piotrekobiIntel 提交于 11月 24, 2021
```
* Add second batch of deprecated mkldnn namespace and macro changes

* Unlock CI

* Fix temporary namespace alias placing
```
  7db7a0ec
22 11月, 2021 1 次提交

disable copying of datatype when sharing buffer between two tensors. (#37247) · 9ec1432d

由 Feiyu Chan 提交于 11月 22, 2021

* disable copying of datatype when sharing buffer between two tensors.
* fix for mkldnn operator kernels (elementwise_add, sum, softplus, softmax, scale, activation), mannually set the data type when reusing memory by ShareBufferWith.

9ec1432d

17 11月, 2021 2 次提交

Replace custom IOHW -> OIHW reorder with build-in oneDNN reorder (#37175) · 162ac048

由 Sławomir Siwek 提交于 11月 17, 2021

* Use oneDNN reorder instead of custom one

* Fix whitespace typo

* Fix Code format error

* Incorporating feedback

* Remove unncessary reorder

* Support GIOHW format

* Fix code format error

162ac048

Changed first batch of deprecated mkldnn headers and function names to new oneDNN names (#37040) · ce3ee9bb

由 piotrekobiIntel 提交于 11月 17, 2021

* Change first batch of mkldnn headers and namespace names to dnnl

* Revert changes to tensor.h, which require approval

* Format changes with pre-commit

* Add int32 tests

* Fix int32 tests and call GetDataFromTensor for int32

* Fix test

ce3ee9bb

16 11月, 2021 2 次提交
- A
  Added BF16 Pool2d grad (#37081) · f95d44a2
  由 arlesniak 提交于 11月 16, 2021
```
* Added BF16 Pool2d grad

* upstream pulled

* fix for CI

* fixes after review
```
  f95d44a2
- J
  
  added onednn elu kernel (#37149) · ae40ee32
  由 jakpiase 提交于 11月 16, 2021
  
  ae40ee32
11 11月, 2021 1 次提交

Added softplus + activation oneDNN fuse pass (#36657) · a346c4dc

由 jakpiase 提交于 11月 11, 2021

* added softplus + activation fuse plass

* minor change

* implemented reviewer suggestion

* minor fix

* minor fix

* added scale_out parameter

* minor fix

* fix for iScan CI

* conditionally disabled logs

* refactored pass builder

a346c4dc

10 11月, 2021 1 次提交

Added stack FP32 FWD oneDNN kernel (#37002) · 99f9224c

由 jakpiase 提交于 11月 10, 2021

* added stack oneDNN FP32 op

* minor change

* CI fix

* added skipping for gpus

* fix for stack op

* CI fix

* CI fix

* Added comment

* CI fix

99f9224c

05 11月, 2021 2 次提交

J
Added caching of scales for bias in conv2d int8 (#36980) · 3705b12c
由 Jacek Czaja 提交于 11月 05, 2021
```
* - Cached bias scales

* - Fix

* - fixes after review

* - second round of fixes after internal review
```
3705b12c

Disable pool&conv_transpose&quantize caching (#36695) · db6c00c4

由 Jacek Czaja 提交于 11月 05, 2021

* - WIP

- compilation fix

- fix

- fixes

- fix

- fix

- fix again

- fix

- another fix

- another compilation fix

- fix

- fix

- fix

- lint

* - pool2d partially stripped from cache

- pool2d partially stripped of caching

* - compilation fix

* - compilation fix

* - Fix to UT of caching

* - Enabling test_conv3d_mkldnn

* - conv_transpose stripped of cache

* - compilation fix

* - fix

* - fix

* - compilation fix

* - fix

* Reverted disabling caching of conv2d

* - compilation fix

* - ut reverted

db6c00c4

02 11月, 2021 2 次提交
- J
  [Need review] Optimized and refactored oneDNN layer_norm kernel (#36917) · b6edaff8
  由 jakpiase 提交于 11月 02, 2021
```
* optimization for layernorm

* further refactoring

* added reviewer suggestions
```
  b6edaff8
- J
  [Need review] Added conv + hard_sigmoid oneDNN fuse pass (#36869) · 53690719
  由 jakpiase 提交于 11月 02, 2021
```
* added conv + hard_sigmoid fuse pass

* Removed IsOptional() statements

* Reverted removing optional
```
  53690719

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功