提交 · c548e370f1ebbfef249d788a50ba0e12410f8809 · 机器未来 / Paddle

13 8月, 2019 1 次提交
- L
  UT coverage for guassian_mkldnn_op and batch_norm_mkldnn_op (#19011) · c548e370
  由 lidanqing 提交于 8月 13, 2019
```
* integrations problem
test=develop

* add batch_norm_mkldnn_op backward-reuse test and guassian seed=0 test
test=develop
```
  c548e370
26 7月, 2019 1 次提交
- A
  
  Add LeakyReLU MKLDNN support (#18762) · ee022279
  由 Adam 提交于 7月 26, 2019
  
  ee022279
22 7月, 2019 1 次提交
- T
  Revert "Add LeakyRelu MKLDNN support (#18656)" (#18723) · bd22453f
  由 Tao Luo 提交于 7月 22, 2019
```
test=develop
```
  bd22453f
19 7月, 2019 1 次提交
- A
  Add LeakyRelu MKLDNN support (#18656) · d6b6a337
  由 Adam 提交于 7月 19, 2019
```
test=develop
```
  d6b6a337
09 7月, 2019 1 次提交
- P
  
  Add mkldnn int8 mul-op kernel (#17834) · 0caa08ea
  由 Physher 提交于 7月 09, 2019
  
  0caa08ea
24 5月, 2019 1 次提交

[MKL-DNN] Add Fully Connected Op for inference only(#15226) · 0c39b97b

由 Michał Gallus 提交于 5月 24, 2019

* fuse mul and elementwise add to fc

* Reimplement the FC forward operator

* Fix FC MKLDNN integration by transposing weights

* Add FC MKLDNN Pass

test=develop

* FC MKLDNN Pass: change memcpy to std::copy

* Fix MKLDNN FC handling of mismatch input and weights dims

* Lower tolerance for MKL-DNN in resnet50 test

test=develop

* Adjust FC to support MKLDNN Op placement

test=develop

* Adjust Placement Op to set use_mkldnn attribute for graph

test=develop

* MKLDNN FC: fix weights format so that gemm version is called

test=develop

* FC MKLDNN: Remove tolerance decrease from tester_helper

* FC MKL-DNN: Refactor the code, change input reorder to weight reorder

* MKL-DNN FC: Introduce operator caching

test=develop

* FC MKL-DNN: Fix the tensor type in ExpectedKernelType

test=develop

* FC MKL-DNN: fix style changes

test=develop

* FC MKL-DNN: fallback to native on non-supported dim sizes

test=develop

* FC MKLDNN: fix CMake paths

test=develop

* FC MKLDNN: Refine placement pass graph mkldnn attribute

test=develop

* Fix Transpiler error for fuse_conv_eltwise

test=develop

* Fix missing STL includes in files

test=develop

* FC MKL-DNN: Enable new output size computation

Also, refine pass to comply with newest interface.
test=develop

* FC MKL-DNN: enable only when fc_mkldnn_pass is enabled

* FC MKL-DNN: Allow Weights to use oi or io format

* FC MKL-DNN: Adjust UT to work with correct dims

test=develop

* Enable MKL DEBUG for resnet50 analyzer

test=develop

* FC MKL-DNN: Improve Hashing function

test=develop

* FC MKL-DNN: Fix shape for fc weights in transpiler

* FC MKL-DNN: Update input pointer in re-used fc primitive

* Add log for not handling fc fuse for unsupported dims

test=develop

* FC MKL-DNN: Move transpose from pass to Op Kernel

test=develop

* FC MKL-DNN: Disable transpose in unit test

test=develop

* FC MKL-DNN: Remove fc_mkldnn_pass from default list

* Correct Flag for fake data analyzer tests

test=develop

* FC MKL-DNN: Add comment about fc mkldnn pass disablement

test=develop

* FC MKL-DNN: Disable fc in int8 tests

test=develop

0c39b97b

22 5月, 2019 1 次提交

Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130) · 2281ebf0

由 guomingz 提交于 5月 22, 2019

* Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.

Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
Batch size | with fusion | without fusion
-- | -- | --
1 | 214.7 | 53.4
50 | 1219.727 | 137.280

test=develop

* Fix the format issue

test=develop

* Add the missing nolint comments.

test=develop

* Fix the typos.

test=develop

* Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.

test=develop

* Adjust the indentation.

test=develop

* Add the test_conv_brelu_mkldnn_fuse_pass case.

test=develop

* Slightly update the code per Baidu comments.
Let the parameter definition embedded into the code.
That's will make the code easy to understand.

test=develop

2281ebf0

24 4月, 2019 1 次提交

Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing (#17058) · 2deac4e4

由 guomingz 提交于 4月 24, 2019

* resolve #17057

Fixed the bug that fuse_relu/fuse_residual option couldn't be passed to class TestConv2dInt8Op.

test=develop

* Fix the bug of test_conv2d_int8_mkldnn case which raised by improper parameter passing.

test=develop

2deac4e4

16 4月, 2019 2 次提交
- L
  remove unnecessary new line · 1edcd731
  由 Leo Zhao 提交于 4月 16, 2019
```
test = develop
resolve #16764
```
  1edcd731
- L
  
  disable test_elementwise_mul_mkldnn_op case · 61cc842a
  由 Leo Zhao 提交于 4月 16, 2019
  
  61cc842a
12 4月, 2019 1 次提交
- L
  convert output to nchw format to align with native version in avx512 mode · a9694bd3
  由 Leo Zhao 提交于 4月 12, 2019
```
test = develop
resolve #16764
```
  a9694bd3
26 3月, 2019 1 次提交
- D
  
  revert test_softmax_cudnn. test=develop · 7920e3be
  由 dengkaipeng 提交于 3月 26, 2019
  
  7920e3be
22 3月, 2019 1 次提交

Enable MKL-DNN INT8 Concat Kernel. (#16156) · e235882c

由 xiaolil1 提交于 3月 22, 2019

* Enable INT8 Concat Kernel to improve the performance of MobileNet-SSD.
test=develop

* Optimize UT format.
test=develop

* Fix UT file address issue.
test=develop

* Refine the license year.
test=develop

* Optimize code for new API.
test=develop

* Restructure INT8 Concat kernel.
test=develop

e235882c

19 3月, 2019 1 次提交
- Z
  add allocator flags · 22715487
  由 zhhsplendid 提交于 3月 19, 2019
```
test=develop
```
  22715487
18 3月, 2019 2 次提交
- D
  
  add mkldnn support. test=develop · 365e6cfd
  由 dengkaipeng 提交于 3月 05, 2019
  
  365e6cfd
- X
  Enable INT8 transpose kernel for MobileNet-SSD improvement. (#16159) · e818fa10
  由 xiaolil1 提交于 3月 18, 2019
```
* Enable INT8 transpose kernel for MobileNet-SSD improvement.
test=develop

* Refine the license year.
test=develop

* Delete redundant code.
test=develop

* Add axis check.
test=develop
```
  e818fa10
06 3月, 2019 2 次提交

Add Requantize OP (#15318) · a177d482

由 xiaolil1 提交于 3月 06, 2019

* Enable INT8 ReQuantize OP
test=develop

* Clean code
test=develop

* Add comments
test=develop

* Revert "Clean code"
test=develop

This reverts commit a7a49b8a.

* Modify requantize op test
test=develop

* fix requantize UT by moving public function to public test file.
test=develop

* Fix test fail due to file address change.
test=develop

* Change file address for requantize op.
test=develop

a177d482

MKLDNN: Add UT for conv_transpose_mkldnn op. (#16030) · 21156b8d

由 lidanqing 提交于 3月 05, 2019

* MKLDNN: Add UT for conv_transpose_mkldnn op.
test=develop

* MKLDNN: Add fuse_bias check UT for conv_transpose_mkldnn op.
test=develop

21156b8d

05 3月, 2019 1 次提交

MKLDNN: Add UT for conv_transpose_mkldnn op. (#16030) · 02c106c7

由 lidanqing 提交于 3月 05, 2019

* MKLDNN: Add UT for conv_transpose_mkldnn op.
test=develop

* MKLDNN: Add fuse_bias check UT for conv_transpose_mkldnn op.
test=develop

02c106c7

04 3月, 2019 3 次提交
- L
  UT for conv2d_mkldnn_op with fuse_bias and fuse_residual (#16016) · 667bc256
  由 lidanqing 提交于 3月 04, 2019
```
test=develop
```
  667bc256
- K
  Add test for ceil mode · ea9d6731
  由 Krzysztof Binias 提交于 3月 01, 2019
```
test=develop
```
  ea9d6731
- L
  UT for conv2d_mkldnn_op with fuse_bias and fuse_residual (#16016) · dd1c7ee6
  由 lidanqing 提交于 3月 04, 2019
```
test=develop
```
  dd1c7ee6
01 3月, 2019 1 次提交
- K
  Add test for ceil mode · 54f21a5c
  由 Krzysztof Binias 提交于 3月 01, 2019
```
test=develop
```
  54f21a5c
25 2月, 2019 1 次提交
- K
  Add UTs to check whether primitives for activations and softmax already exist in backward · 851ea04d
  由 Krzysztof Binias 提交于 2月 25, 2019
```
test=develop
```
  851ea04d
21 2月, 2019 2 次提交
- K
  Fix for pylint Failed · 309ea6f2
  由 Krzysztof Binias 提交于 2月 21, 2019
```
test=develop
```
  309ea6f2
- K
  Add new ut and remove unnecessary code · 1578c60b
  由 Krzysztof Binias 提交于 2月 21, 2019
```
test=develop
```
  1578c60b
29 1月, 2019 1 次提交
- K
  Make separate folders for mkldnn codes · b1bdcd4d
  由 Krzysztof Binias 提交于 1月 28, 2019
```
test=develop
```
  b1bdcd4d

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致