提交 · 8fa3d367edcff1f7760b1fb67bae1c539d03f418 · BaiXuePrincess / Paddle

02 9月, 2020 1 次提交
- J
  - Cosmetic fixes to align with PADDLE_ENFORCE guidelines (#26891) · 5e874cc3
  由 Jacek Czaja 提交于 9月 02, 2020
```
test=develop
```
  5e874cc3
26 8月, 2020 1 次提交
- J
  
  Small change in conv2d and quantize pass (#26671) · 559e43ee
  由 joanna.wozna.intel 提交于 8月 26, 2020
  
  559e43ee
17 7月, 2020 1 次提交
- J
  
  [oneDNN] cache cosmetics improvement (#25576) · 7dbc441e
  由 Jacek Czaja 提交于 7月 17, 2020
  
  7dbc441e
23 6月, 2020 1 次提交

Refactor of conv fp32 oneDNN operator (#25137) · bd0b38e6

由 Adam 提交于 6月 23, 2020

* Refactor of conv fp32 oneDNN operator
test=develop

* Formatting fix
test=develop

* Return Enforces
test=develop

* GetWeights improvements
test=develop

bd0b38e6

26 5月, 2020 1 次提交

Update PADDLE_ENFORCE in DNNL related ops (#24333) · c3c61d34

由 lidanqing 提交于 5月 26, 2020

* Update PADDLE_ENFORCE in DNNL related ops
test=develop

* Abstract macro of OP_GET_PLACE_CHECK
test=develop

* update according to reviews

* update GET_PLACE_CPU_CHECK

* fix typo
test=develop

* revert macro
test=develop

c3c61d34

14 5月, 2020 2 次提交
- P
  Hide globals & redesign restore PR (#24279) · db2b6b65
  由 pawelpiotrowicz 提交于 5月 14, 2020
```
test=develop
```
  db2b6b65
- F
  update conv error info (#24430) · 526a2117
  由 FDInSky 提交于 5月 14, 2020
```
* test=develop update conv error info

* test=develop update iou_similarity error info

* test=develop update some error info based review
```
  526a2117
17 3月, 2020 1 次提交
- A
  
  Revert "Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695)" (#22985) · 5842ae67
  由 Adam 提交于 3月 17, 2020
  
  5842ae67
11 3月, 2020 1 次提交
- A
  
  Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695) · 056edf39
  由 Adam 提交于 3月 11, 2020
  
  056edf39
10 12月, 2019 1 次提交

MKL-DNN 1.0 Update (#20162) · e81f0228

由 Adam 提交于 12月 10, 2019

* MKLDNN v1.0 rebase to Paddle 1.6
test=develop

* Add hacky paddle::string::to_string() implementation

* vectorize<int64-t>() -> vectorize() cleanup
test=develop

* PADDLE_ENFORCE and void_cast fixes
test=develop

* Rebase changes
test=develop

* Cosmetics
test=develop

* Delete MKL from mkldnn.cmake
test=develop

* CMake debug commands
test=develop

* Delete MKLDNN_VERBOSE and rebase fixes
test=develop

* Rebase fixes
test=develop

* Temporarily disable int8 resnet101 vgg16 and vgg19 tests
test=develop

* Add libmkldnn.so.1 to python setup
test=develop

* Add libmkldnn.so.1 to inference_lib cmake after rebase
test=develop

* Post rebase fixes + FC int8 changes
test=develop

* Fix LRN NHWC
test=develop

* Fix NHWC conv3d
test=develop

* Windows build fix + next conv3d fix
test=develop

* Fix conv2d on AVX2 machines
test=develop

e81f0228

06 12月, 2019 1 次提交
- J
  - Fix to regression in performance of ResNet-50 training (#21588) · 8f5a93a0
  由 Jacek Czaja 提交于 12月 06, 2019
```
test=develop
```
  8f5a93a0
03 12月, 2019 1 次提交
- J
  
  [MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466) · 18a5d307
  由 Jacek Czaja 提交于 12月 03, 2019
  
  18a5d307
29 11月, 2019 1 次提交

Add dygraph execution context (#20157) · ac854670

由 hong 提交于 11月 29, 2019

* add_dygraph_execution_context

* add dygraph infershape context and execution context; test=develop

* fix imperative bug; test=develop

* remove inputs outputs interface from execution context,
because it have same function with inputNames;
test=develop

* remove tracer_test ctest; test=develop

* fix split op bug; test=develop

* fix unitests bug; test=develop

* fix distribute test bug; test=develop

* fix ngraph compile bug; test=develop

* fix grad maker bug; test=develop

* fix load op bugs; test=develop

* fix operator.cc construct bug; test=develop

* remove useless name find in operator; test=develop

* add tracer_test; test=develop

* fix concat, split bug; test=develop

* remove tracer_test unitest; test=develop

* fix attribute check bug; test=develop

* add test code to fix converage; test=develop

* remove useless code, change check backward input in engin; test=develop

* unlock var type infer shape;test=develop

* add ShareAllLoD api; test=develop

* add dygraph infershape context unitest; test=develop

* remove increase and decrease lod in dygraph; test=develop

* addd override; test=develop

* fix increase descrease lod; test=develop

* fix paddle_enforce; test=develop

* disable lod op dygraph check; test=develop

* fix paddle enforce error; test=develop

* add comment for op_registry and OperatorBase; test=develop

* optimize the comment of op_registry; test=develop

* fix format of comment; test=develop

* fix format of comment; test=develop

* optimize the format of comment; test=develop

* optimize the format of the comment; test=develop

* optimize comment of op_registry; test=develop

ac854670

07 11月, 2019 1 次提交

Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062) · 3fda695b

由 Adam 提交于 11月 07, 2019

* Add asymetric padding support for mkldnn pooling
test=develop

* Add asymetric padding support for mkldnn conv
test=develop

* Add asymetric padding support for mkldnn conv_transpose
test=develop

3fda695b

22 10月, 2019 1 次提交
- A
  Minor MKL-DNN conv int8 performance fixes (#20753) · 67b59ddb
  由 Adam 提交于 10月 22, 2019
```
test=develop
```
  67b59ddb
18 10月, 2019 1 次提交

Revert "Refactor conv computeINT8" (#20640) · 46e93f7c

由 lidanqing 提交于 10月 18, 2019

* Revert "Refactor conv computeINT8 (#19574)"

This reverts commit 2c32c2d6.

test=develop

* replace PADDLE_ENFORCE
test=develop

46e93f7c

17 10月, 2019 1 次提交

[MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241) · a1cd27f1

由 Jacek Czaja 提交于 10月 17, 2019

* - Flushing mkl-dnn cache

test=develop

- Disabled clearing cache for LoadModel

- Added clearing of mkl-dnn cache when Executor is created

test=develop

- Do not clear for GPU places

test=develop

- compilation fix

test=develop

* - Moved clearing of mkl-dnn cache in destructor of executor

test=develop

* - Compilation fix

test=develop

- Reverted conditional clearing of mkl-dnn cache in Executors's
  destructor

test=develop

- compilation fix

a1cd27f1

19 9月, 2019 1 次提交

Refactor conv computeINT8 (#19574) · 2c32c2d6

由 lidanqing 提交于 9月 19, 2019

* fix conflicts
test=develop

* change mask_bias_reorder
test=develop

* add ComputeMask function to make code clear
test=develop

* change according to reviews
test=develop

* change according to reviews
test=develop

2c32c2d6

14 9月, 2019 1 次提交
- A
  Add common CreateKey for mkldnn handlers (#19767) · d4413a54
  由 Adam 提交于 9月 14, 2019
```
test=develop
```
  d4413a54
10 9月, 2019 1 次提交
- A
  MKLDNN handler cleanup (#19713) · 428b2b9e
  由 Adam 提交于 9月 10, 2019
```
* MKLDNN handler cleanup

* MKLDNN handler cleanup
test=develop
```
  428b2b9e
04 9月, 2019 1 次提交
- A
  paddle::framework::vectorize() templatization (#19611) · 8d6d95cc
  由 Adam 提交于 9月 04, 2019
```
test=develop
```
  8d6d95cc
03 9月, 2019 1 次提交
- A
  using MKLDNNMemoryFormat = mkldnn::memory::format changes (#19568) · e94b26da
  由 Adam 提交于 9月 03, 2019
```
* using MKLDNNMemoryFormat = mkldnn::memory::format changes
test=develop

* PADDLE_ENFORCE update
test=develop
```
  e94b26da
29 8月, 2019 1 次提交
- L
  clean up intel labeled TODOs (#19476) · ba368bf6
  由 lidanqing 提交于 8月 29, 2019
```
test=develop
```
  ba368bf6
21 8月, 2019 1 次提交

Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237) · 97d1db18

由 Adam 提交于 8月 21, 2019

* Add generalized Conv+Activation MKLDNN fuse pass creation Part2
test=develop

* Undefined behaviour of GetAttrIfExists<> FIX
test=develop

97d1db18

15 8月, 2019 1 次提交
- A
  Add generalized Conv+Activation MKLDNN fuse pass creation (#19072) · b837689e
  由 Adam 提交于 8月 15, 2019
```
test=develop
```
  b837689e
12 8月, 2019 1 次提交
- J
  Replace Relu with bounded Relu in MobileNetV2 quantization (#18988) · bce72c7f
  由 joanna.wozna.intel 提交于 8月 12, 2019
```
test=develop
```
  bce72c7f
30 7月, 2019 1 次提交
- J
  [MKL-DNN] Fix int8 performance regression (#18758) · cfcb96d2
  由 Jacek Czaja 提交于 7月 30, 2019
```
test=develop

- optimization of TID to string

test=develop
```
  cfcb96d2
25 7月, 2019 1 次提交

change ComputeINT8 to template version to remove checking dst_datatype code (#18756) · 9ecd8ee7

由 lidanqing 提交于 7月 25, 2019

* change INT8 to template so that checking dst_dt with if-else could be removed. CI will be enabled after fixing reviews

* reverse user_residual_memory_p and user_bias_memory_p declaration scope
test=develop

9ecd8ee7

09 7月, 2019 1 次提交

Fix/gcc 4.8 ubt link error (#18558) · 667f88f9

由 Jiabin Yang 提交于 7月 09, 2019

* test=develop, fix docker with paddle nccl problem

* test=develop, fix/gcc_4.8_ubt_link_error

* test=develop, fix code format

667f88f9

28 6月, 2019 1 次提交

Fix potential mkldnn concat/pool/conv kernel issues (#18393) · 681d3553

由 Leo Zhao 提交于 6月 28, 2019

1. some key generation method is not aligned with PR#17965
2. enlarge ptr lifetime to avoid memory release if SetBlob fails
   otherwise it will get core dump.

test=develop

681d3553

13 6月, 2019 1 次提交

refactor the function ConvFwdPrimitiveDesc (#17897) · f8ecc3de

由 lidanqing 提交于 6月 13, 2019

* refractor the function ConvFwdPrimitiveDesc
test=develop

* change according to review
test=develop

* use pointer way without boost::optional
test=develop

* pass vector to function by reference instead of raw vector
test=develop

* change pointer to shared_ptr
test=develop

f8ecc3de

10 6月, 2019 1 次提交
- Z
  Remove attribute in Allocator::Allocate (#17878) · 3ece61f7
  由 Zeng Jinle 提交于 6月 10, 2019
```
* remove attribute in Allocator::Allocate, test=develop

* fix travis ci error, test=develop
```
  3ece61f7
07 6月, 2019 1 次提交
- Y
  Fix the accuracy issue while using float precision to get the scale. (#17884) · 14a32bf0
  由 Yihua Xu 提交于 6月 07, 2019
```
test=develop
```
  14a32bf0
28 5月, 2019 1 次提交

Improve mobilenetv2 INT8 performance by using INT8 relu as post-op (#17570) · 04b6c29e

由 lidanqing 提交于 5月 28, 2019

* add INT8 conv+relu6 fuse and enbale mobilentv2 INT8 test
test=develop

* change fasle and 0.0 to fuse_brelu and brelu_threshold
test=develop

change the "fuse_relu||fuse_brelu" to "unsigned_output"
test=develop

* Use relu instead of brelu as INT8 post-op because INT8 brelu is not enabled in mkldnn v0.18
test=develop

* continuous-integration fix
test=develop

04b6c29e

22 5月, 2019 1 次提交

Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130) · 2281ebf0

由 guomingz 提交于 5月 22, 2019

* Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.

Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
Batch size | with fusion | without fusion
-- | -- | --
1 | 214.7 | 53.4
50 | 1219.727 | 137.280

test=develop

* Fix the format issue

test=develop

* Add the missing nolint comments.

test=develop

* Fix the typos.

test=develop

* Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.

test=develop

* Adjust the indentation.

test=develop

* Add the test_conv_brelu_mkldnn_fuse_pass case.

test=develop

* Slightly update the code per Baidu comments.
Let the parameter definition embedded into the code.
That's will make the code easy to understand.

test=develop

2281ebf0

16 4月, 2019 1 次提交

[MKL-DNN] Added reusing of primitive descriptors (fp32) (#16667) · 87a44b11

由 Jacek Czaja 提交于 4月 15, 2019

* - Reuse of conv PD

- conv transpose pd reused

- Added PD reusing of softmax and Batch Norm

- Refactoring and removal of not needed routines of mkl-dnn ops

test=develop

- Fix to reusing conv

test=develop

- Lint fixes

test=develop

- Further lint fixes

test=develop

- Lint  fixes

test=develop

- lint fixes

test=develop

- Lint workaround

test=develop

* - Fix after review on including boost as third party header

test=develop

* - Fix after review. Name change to something more descriptive

test=develop

87a44b11

28 3月, 2019 1 次提交

[MKL-DNN] Tensor modifications revert (#16462) · 26323274

由 Jacek Czaja 提交于 3月 28, 2019

* Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)"

This reverts commit 13816dd4.
Apart from enabling transformer for MKL-DNN

* Revert "- MKL-DNN pooling updated to set_prim_desc"

This reverts commit c63f6b20.

Conflicts:
	paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc

* Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429)"

test=develop

This reverts commit dec9cf53.

* - concat compilation fix

- lint

test=develop

- Lint fixes

test=develop

- Lint fixes

test=develop

- Fix Transpose MKLDNN op

test=develop

26323274

19 3月, 2019 1 次提交
- Z
  add allocator flags · 22715487
  由 zhhsplendid 提交于 3月 19, 2019
```
test=develop
```
  22715487
18 3月, 2019 1 次提交

Add cpu_quantize_pass for C-API quantization (#16127) · 2579ade4

由 Wojciech Uss 提交于 3月 18, 2019

* Add cpu_quantize_pass for C-API quantization

test=develop

* add cpu_quantize_pass test

* fix lint: add include memory unorderd_map and unordered_set

test=develop

* fuse_relu 1

test=develop

* tuned 2 without squash

* fixes

test=develop

* remove unused vars

test=develop

* refactored

test=develop

* fix lint c-style cast -> C++ style cast

test=develop

* remove QuantMax and c style casts

test=develop

* last usage of QuantMax removed

test=develop

* Fix Analysis Predictor UT

Check if memory_optimize_pass has already been added
to the analysis config before adding a new one, so
that it is not added multiple times.
test=develop

* change map to unordered_map

fix the forgotten part of cpu_quantize_pass_tester.cc

test=develop

* removed quantized attribute

* fixed cpu_quantize_pass_tester and op attr comments

test=develop

* removed redundant line

test=debug

* removed gmock

test=develop

* fix after merge

2579ade4

26 2月, 2019 1 次提交

- MKL-DNN pooling updated to set_prim_desc · c63f6b20

由 Jacek Czaja 提交于 2月 04, 2019

- MKLDNN ops revisited

- disabled softmax modifications

- disabled elementwise_add

- reverted LRN modifications

- reverted SUM primitive

- Partial reviing of softmax

- Enable softmax

- Softmax changes

- LRN is back

- LRN partially disabled

- LRN is back

- LRN fix

- compilation fixes

- Sum fixed(hopefully)

- Enabling (partially) elementwise_add

- Fixes to elemenwise_add

- Lint fixes

quantize fix

- compilation fix

test=develop

Disabling pooling

- Disabled quantize op

test=develop

c63f6b20

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致