提交 · ba368bf69621bdcf2c42fbb8a0ae25272f4df648 · BaiXuePrincess / Paddle

29 8月, 2019 1 次提交
- L
  clean up intel labeled TODOs (#19476) · ba368bf6
  由 lidanqing 提交于 8月 29, 2019
```
test=develop
```
  ba368bf6
21 8月, 2019 1 次提交

Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237) · 97d1db18

由 Adam 提交于 8月 21, 2019

* Add generalized Conv+Activation MKLDNN fuse pass creation Part2
test=develop

* Undefined behaviour of GetAttrIfExists<> FIX
test=develop

97d1db18

15 8月, 2019 1 次提交
- A
  Add generalized Conv+Activation MKLDNN fuse pass creation (#19072) · b837689e
  由 Adam 提交于 8月 15, 2019
```
test=develop
```
  b837689e
12 8月, 2019 1 次提交
- J
  Replace Relu with bounded Relu in MobileNetV2 quantization (#18988) · bce72c7f
  由 joanna.wozna.intel 提交于 8月 12, 2019
```
test=develop
```
  bce72c7f
30 7月, 2019 1 次提交
- J
  [MKL-DNN] Fix int8 performance regression (#18758) · cfcb96d2
  由 Jacek Czaja 提交于 7月 30, 2019
```
test=develop

- optimization of TID to string

test=develop
```
  cfcb96d2
25 7月, 2019 1 次提交

change ComputeINT8 to template version to remove checking dst_datatype code (#18756) · 9ecd8ee7

由 lidanqing 提交于 7月 25, 2019

* change INT8 to template so that checking dst_dt with if-else could be removed. CI will be enabled after fixing reviews

* reverse user_residual_memory_p and user_bias_memory_p declaration scope
test=develop

9ecd8ee7

09 7月, 2019 1 次提交

Fix/gcc 4.8 ubt link error (#18558) · 667f88f9

由 Jiabin Yang 提交于 7月 09, 2019

* test=develop, fix docker with paddle nccl problem

* test=develop, fix/gcc_4.8_ubt_link_error

* test=develop, fix code format

667f88f9

28 6月, 2019 1 次提交

Fix potential mkldnn concat/pool/conv kernel issues (#18393) · 681d3553

由 Leo Zhao 提交于 6月 28, 2019

1. some key generation method is not aligned with PR#17965
2. enlarge ptr lifetime to avoid memory release if SetBlob fails
   otherwise it will get core dump.

test=develop

681d3553

13 6月, 2019 1 次提交

refactor the function ConvFwdPrimitiveDesc (#17897) · f8ecc3de

由 lidanqing 提交于 6月 13, 2019

* refractor the function ConvFwdPrimitiveDesc
test=develop

* change according to review
test=develop

* use pointer way without boost::optional
test=develop

* pass vector to function by reference instead of raw vector
test=develop

* change pointer to shared_ptr
test=develop

f8ecc3de

10 6月, 2019 1 次提交
- Z
  Remove attribute in Allocator::Allocate (#17878) · 3ece61f7
  由 Zeng Jinle 提交于 6月 10, 2019
```
* remove attribute in Allocator::Allocate, test=develop

* fix travis ci error, test=develop
```
  3ece61f7
07 6月, 2019 1 次提交
- Y
  Fix the accuracy issue while using float precision to get the scale. (#17884) · 14a32bf0
  由 Yihua Xu 提交于 6月 07, 2019
```
test=develop
```
  14a32bf0
28 5月, 2019 1 次提交

Improve mobilenetv2 INT8 performance by using INT8 relu as post-op (#17570) · 04b6c29e

由 lidanqing 提交于 5月 28, 2019

* add INT8 conv+relu6 fuse and enbale mobilentv2 INT8 test
test=develop

* change fasle and 0.0 to fuse_brelu and brelu_threshold
test=develop

change the "fuse_relu||fuse_brelu" to "unsigned_output"
test=develop

* Use relu instead of brelu as INT8 post-op because INT8 brelu is not enabled in mkldnn v0.18
test=develop

* continuous-integration fix
test=develop

04b6c29e

22 5月, 2019 1 次提交

Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130) · 2281ebf0

由 guomingz 提交于 5月 22, 2019

* Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.

Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
Batch size | with fusion | without fusion
-- | -- | --
1 | 214.7 | 53.4
50 | 1219.727 | 137.280

test=develop

* Fix the format issue

test=develop

* Add the missing nolint comments.

test=develop

* Fix the typos.

test=develop

* Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.

test=develop

* Adjust the indentation.

test=develop

* Add the test_conv_brelu_mkldnn_fuse_pass case.

test=develop

* Slightly update the code per Baidu comments.
Let the parameter definition embedded into the code.
That's will make the code easy to understand.

test=develop

2281ebf0

16 4月, 2019 1 次提交

[MKL-DNN] Added reusing of primitive descriptors (fp32) (#16667) · 87a44b11

由 Jacek Czaja 提交于 4月 15, 2019

* - Reuse of conv PD

- conv transpose pd reused

- Added PD reusing of softmax and Batch Norm

- Refactoring and removal of not needed routines of mkl-dnn ops

test=develop

- Fix to reusing conv

test=develop

- Lint fixes

test=develop

- Further lint fixes

test=develop

- Lint  fixes

test=develop

- lint fixes

test=develop

- Lint workaround

test=develop

* - Fix after review on including boost as third party header

test=develop

* - Fix after review. Name change to something more descriptive

test=develop

87a44b11

28 3月, 2019 1 次提交

[MKL-DNN] Tensor modifications revert (#16462) · 26323274

由 Jacek Czaja 提交于 3月 28, 2019

* Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)"

This reverts commit 13816dd4.
Apart from enabling transformer for MKL-DNN

* Revert "- MKL-DNN pooling updated to set_prim_desc"

This reverts commit c63f6b20.

Conflicts:
	paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc

* Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429)"

test=develop

This reverts commit dec9cf53.

* - concat compilation fix

- lint

test=develop

- Lint fixes

test=develop

- Lint fixes

test=develop

- Fix Transpose MKLDNN op

test=develop

26323274

19 3月, 2019 1 次提交
- Z
  add allocator flags · 22715487
  由 zhhsplendid 提交于 3月 19, 2019
```
test=develop
```
  22715487
18 3月, 2019 1 次提交

Add cpu_quantize_pass for C-API quantization (#16127) · 2579ade4

由 Wojciech Uss 提交于 3月 18, 2019

* Add cpu_quantize_pass for C-API quantization

test=develop

* add cpu_quantize_pass test

* fix lint: add include memory unorderd_map and unordered_set

test=develop

* fuse_relu 1

test=develop

* tuned 2 without squash

* fixes

test=develop

* remove unused vars

test=develop

* refactored

test=develop

* fix lint c-style cast -> C++ style cast

test=develop

* remove QuantMax and c style casts

test=develop

* last usage of QuantMax removed

test=develop

* Fix Analysis Predictor UT

Check if memory_optimize_pass has already been added
to the analysis config before adding a new one, so
that it is not added multiple times.
test=develop

* change map to unordered_map

fix the forgotten part of cpu_quantize_pass_tester.cc

test=develop

* removed quantized attribute

* fixed cpu_quantize_pass_tester and op attr comments

test=develop

* removed redundant line

test=debug

* removed gmock

test=develop

* fix after merge

2579ade4

26 2月, 2019 1 次提交

- MKL-DNN pooling updated to set_prim_desc · c63f6b20

由 Jacek Czaja 提交于 2月 04, 2019

- MKLDNN ops revisited

- disabled softmax modifications

- disabled elementwise_add

- reverted LRN modifications

- reverted SUM primitive

- Partial reviing of softmax

- Enable softmax

- Softmax changes

- LRN is back

- LRN partially disabled

- LRN is back

- LRN fix

- compilation fixes

- Sum fixed(hopefully)

- Enabling (partially) elementwise_add

- Fixes to elemenwise_add

- Lint fixes

quantize fix

- compilation fix

test=develop

Disabling pooling

- Disabled quantize op

test=develop

c63f6b20

25 2月, 2019 2 次提交

L
Enable function coverage for U8/S8 ConvMKLDNNOpKernel · 4acc5220
由 liangan1 提交于 2月 25, 2019
```
test=develop
```
4acc5220

[MKL-DNN] MKL-DNN specific Tensor modification (#15429) · dec9cf53

由 Jacek Czaja 提交于 2月 25, 2019

* - Implemented draft of primitive desc keeping in Tensor

test=develop

- TransposeMKLDNNHandler::AcquireSrcMemory was reimplemented

- Added nchw and nc formats setting for sake of compatiblity

Fixed unit tests

- Worakaround to problem with 5D data in conv

- Added 3D and 1D MKL-DNN formats for name handles for tensor

test=develop

- Fix to UTs

test=develop

- Conv fp32 op was updated

Cosmetic fixes

test=develop

- tensor mkldnn cosmetics

test=develop

- Moved most of mkl-dnn specific code from Tensor to mkl-dnn utils

* - Lint fixes

test=develop

* - setting prim dec in Tensor , sets also layout to kMKLDNN

test=develop

* - Moved creation of prim desc totally out of Tensor

test=develop

* - Cosmetic fixes adter review

test=develop

dec9cf53

29 1月, 2019 1 次提交
- K
  Make separate folders for mkldnn codes · b1bdcd4d
  由 Krzysztof Binias 提交于 1月 28, 2019
```
test=develop
```
  b1bdcd4d
21 1月, 2019 1 次提交

Memory optimization of depthwise conv op and group norm op (#15313) · 9f8f0fc2

由 Dun 提交于 1月 21, 2019

* mem opt

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* refine code  test=develop

* refine code  test=develop

* refine code  test=develop

* refine code  test=develop

* refine with cub test=develop

* fix mkldnn test && remove comments && test=develop

* polish code && test=develop

* add only_forward test && test=develop

9f8f0fc2

10 1月, 2019 1 次提交

Conv int8 residual (#15145) · 8f17c714

由 xiaolil1 提交于 1月 10, 2019

* Enable basic MKL-DNN INT8 Conv OP
test=develop

* Modify test case
test=develop

* Clean unittest code
test=develop

* Fix test
test=develop

* Modify test
test=develop

* Enable MKL-DNN INT8 Conv with Relu Fusion OP
test=develop

* Enable INT8 Conv with residual fusion OP
test=develop

* Modify code.
test=develop

* Modify basic INT8 Conv
test=develop

* Modify Conv.
test=develop

* fix style
test=develop

* Fix style
test=develop

* Fix test
test=develop

* Modify code.
test=develop

* Fix test
test=develop

8f17c714

07 1月, 2019 1 次提交

Conv int8 relu (#15130) · c8f101e5

由 xiaolil1 提交于 1月 07, 2019

* Enable basic MKL-DNN INT8 Conv OP
test=develop

* Modify test case
test=develop

* Clean unittest code
test=develop

* Fix test
test=develop

* Modify test
test=develop

* Enable MKL-DNN INT8 Conv with Relu Fusion OP
test=develop

* Modify basic INT8 Conv
test=develop

* fix type
test=develop

* Modify test
test=develop

c8f101e5

04 1月, 2019 1 次提交

Enable basic MKL-DNN INT8 Conv OP (#15124) · bbc93368

由 xiaolil1 提交于 1月 04, 2019

* Enable basic MKL-DNN INT8 Conv OP
test=develop

* Modify test case
test=develop

* Clean unittest code
test=develop

* Fix test
test=develop

* Modify test
test=develop

* Modify basic INT8 Conv
test=develop

bbc93368

20 12月, 2018 1 次提交
- Y
  Fix the regression issue and add the group unitest for conv2d (#14932) · 3babc801
  由 Yihua Xu 提交于 12月 20, 2018
```
* Add test items for mkldnn conv2d

* Fix the regression issue and pass the unit test for conv2d and conv3d

test=develop
```
  3babc801
07 12月, 2018 1 次提交
- Y
  Clean Code · 155328a4
  由 Yihua Xu 提交于 12月 07, 2018
```
test=develop
```
  155328a4
05 12月, 2018 1 次提交
- X
  allow customize kernel selection · 41c28d54
  由 Xin Pan 提交于 12月 05, 2018
```
test=develop
```
  41c28d54
03 12月, 2018 1 次提交
- Y
  
  Implement conv3d with mkldnn library (test=develop) · 669191c9
  由 Yihua Xu 提交于 12月 03, 2018
  
  669191c9
27 11月, 2018 1 次提交

- conv2d transpose MKL-DNN · fb24690a

由 Jacek Czaja 提交于 11月 20, 2018

test=develop

- Added new header for MKLDNN reuse functionality

- Extended conv2d_transpose GetExpectedKernelType for MKL-DNN supporrt

- Buildable conv transpose mkldnn and conv mkldnn using conv template

- Conv2d transpose roughlt implemented and buildable

- Added modifications conv2d transpose MKLDNN unit tests

- Fix to UT of conv2d transpose mkldnn op

- Wrong type of MKLDNN primitive was chosen for conv2d transpose

- HAcks for conv2d transpose

- UT enalbed

- Replaced copying loop with memcpy

- Draft of passing lambda into AcquireMemory

- Made reorder (IOHW->OIHW) to be called only once

fb24690a

15 11月, 2018 1 次提交

add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278) · 8a1eeec5

由 Sylwester Fraczek 提交于 11月 15, 2018

* add is_test to pooling and activations

add prop_kind support for layers activation. conv and pooling

add a pass that sets is_test to true

add transpiler version of is_test pass

test=develop

* patch test and pass

test=develop

* add pass to analyzer.h

test=develop

* add is_test attr description & pass only on mkldnn

in:
activation_op.cc
batch_norm_op.cc
conv_op.cc
dropout_op.cc
lrn_op.cc
pool_op.cc
sequence_pool_op.cc
softmax_op.cc

* fix is_test handling for activation pool and conv

* change description of is_test for all layers again

* remove GetAttr(use_mkldnn) from pass

* rename correct_mkldnn_test_phase to is_test

and remove dependency on MKLDNN
test=develop

* review fix magic number

* two if(..)s into one

* Check is_test once and pass mkldnn forward prop kind

* dereference shared_ptr with * (without get())

test=develop

* add is_test_pass back

test=develop

8a1eeec5

09 11月, 2018 1 次提交
- Y
  
  Merge develop tiny fix · 26fb34c3
  由 Yu Yang 提交于 11月 09, 2018
  
  26fb34c3
08 11月, 2018 1 次提交
- K
  Changed hardcoded format to any in convolution and bumped MKL-DNN version to 0.17-rc · f1c1acf1
  由 Krzysztof Binias 提交于 11月 08, 2018
```
test=develop
```
  f1c1acf1
01 11月, 2018 1 次提交
- T
  MKLDNN conv residual data: primitive reuse interface used. Reorder done when formats are different · 8899d422
  由 Tomasz Patejko 提交于 10月 31, 2018
```
test=develop
```
  8899d422
31 10月, 2018 1 次提交
- T
  
  MKLDNN conv residual data: residual data is reorder when formats are incorrect · f11934cb
  由 Tomasz Patejko 提交于 10月 30, 2018
  
  f11934cb
21 10月, 2018 5 次提交
- T
  MKLDNN conv + elementwise_add fusion: skip connection attribute renamed.... · 4be45af1
  由 Tomasz Patejko 提交于 9月 27, 2018
```
MKLDNN conv + elementwise_add fusion: skip connection attribute renamed. Comments about patterns added.

test=develop
```
  4be45af1
- M
  MKLDNN conv + elementwise_add fusion: Fix output_data to point to the right... · f6881971
  由 Michal Gallus 提交于 9月 25, 2018
```
MKLDNN conv + elementwise_add fusion: Fix output_data to point to the right tensor, also fix transpiler integration
```
  f6881971
- T
  
  MKLDNN conv + elementwise_add fusion: further reformatting · bf95ac36
  由 Tomasz Patejko 提交于 9月 19, 2018
  
  bf95ac36
- T
  
  MKLDNN conv + elementwise_add fusion: parameter name changed to ResidualData · b8e54ab5
  由 Tomasz Patejko 提交于 9月 18, 2018
  
  b8e54ab5
- T
  MKLDNN conv + elementwise_add fusion: output and elemwise param share data in... · 41f3d78f
  由 Tomasz Patejko 提交于 9月 17, 2018
```
MKLDNN conv + elementwise_add fusion: output and elemwise param share data in conv primitive. Output is properly allocated
```
  41f3d78f

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致