提交 · 50b1cab1225405583514531d6ea3346b1f97ab8a · Crayon鑫 / Paddle

09 7月, 2019 1 次提交

Fix/gcc 4.8 ubt link error (#18558) · 667f88f9

由 Jiabin Yang 提交于 7月 09, 2019

* test=develop, fix docker with paddle nccl problem

* test=develop, fix/gcc_4.8_ubt_link_error

* test=develop, fix code format

667f88f9

19 6月, 2019 1 次提交

翟

fix spelling errors (#17941) · 802ea509

由翟飞跃提交于 6月 19, 2019

* fix spelling errors; test=develop

* Update API.spec

update md5

* Update API.spec

* change the order of api;test=develop

802ea509

16 6月, 2019 1 次提交

Update backward appending stragety to support double backward and fix some bug. (#18104) · 80d2e66f

由 qingqing01 提交于 6月 16, 2019

* Update backward.py:
     - If there is no input grad var in all outputs of previous ops, do not append this op into graph.
     - Only apply this stragety when double backward.
* Update some double backward op.
* Update sum_op to judge whether a tensor is empty by numel or IsInitialized().

80d2e66f

22 5月, 2019 1 次提交

Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130) · 2281ebf0

由 guomingz 提交于 5月 22, 2019

* Relu6 is the bottleneck op for Mobilenet-v2. As the mkldnn supports the conv/relu6 fusion, we implement it fusion via cpass way. Due to the int8 enabling for this fusion will be supported in MKLDNN v0.20, so this PR is focused on the fp32 optimization.

Below table shows the benchmark(FPS) which measured on skx-8180(28 cores)
Batch size | with fusion | without fusion
-- | -- | --
1 | 214.7 | 53.4
50 | 1219.727 | 137.280

test=develop

* Fix the format issue

test=develop

* Add the missing nolint comments.

test=develop

* Fix the typos.

test=develop

* Register the conv_brelu_mkldnn_fuse_pass for the MKLDNN engine.

test=develop

* Adjust the indentation.

test=develop

* Add the test_conv_brelu_mkldnn_fuse_pass case.

test=develop

* Slightly update the code per Baidu comments.
Let the parameter definition embedded into the code.
That's will make the code easy to understand.

test=develop

2281ebf0

10 5月, 2019 1 次提交

Double backward of conv2d. (#17211) · e32c9888

由 qingqing01 提交于 5月 10, 2019

* Add conv2d_grad_grad_op
* Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
    - Now use it in conv2d_grad_grad.
    - Will simply the searching code in conv2d and conv2d_grad in next PR.
* Enhance and fix bug in unit testing of gradient_checker.
* Support to fetch empty variables，return None in Python.

e32c9888

23 4月, 2019 1 次提交
- Z
  Make conv cudnn workspace size configurable (#17036) · 0c335dcd
  由 Zeng Jinle 提交于 4月 23, 2019
```
* make_conv_cudnn_ws_size_configurable, test=develop

* change std::max to std::min
test=develop
```
  0c335dcd
15 4月, 2019 2 次提交
- T
  polish the code · e0f7bf4f
  由 tink2123 提交于 4月 15, 2019
```
test=develop
```
  e0f7bf4f
- T
  modified infer shape · ffe81af0
  由 tink2123 提交于 4月 15, 2019
```
test=develop
```
  ffe81af0
26 3月, 2019 1 次提交
- S
  fix some op grad maker · 7000ec85
  由 sneaxiy 提交于 3月 25, 2019
```
fix ctest eager deletion disable bug
test=develop
```
  7000ec85
19 3月, 2019 1 次提交
- Z
  add allocator flags · 22715487
  由 zhhsplendid 提交于 3月 19, 2019
```
test=develop
```
  22715487
18 3月, 2019 1 次提交

Add cpu_quantize_pass for C-API quantization (#16127) · 2579ade4

由 Wojciech Uss 提交于 3月 18, 2019

* Add cpu_quantize_pass for C-API quantization

test=develop

* add cpu_quantize_pass test

* fix lint: add include memory unorderd_map and unordered_set

test=develop

* fuse_relu 1

test=develop

* tuned 2 without squash

* fixes

test=develop

* remove unused vars

test=develop

* refactored

test=develop

* fix lint c-style cast -> C++ style cast

test=develop

* remove QuantMax and c style casts

test=develop

* last usage of QuantMax removed

test=develop

* Fix Analysis Predictor UT

Check if memory_optimize_pass has already been added
to the analysis config before adding a new one, so
that it is not added multiple times.
test=develop

* change map to unordered_map

fix the forgotten part of cpu_quantize_pass_tester.cc

test=develop

* removed quantized attribute

* fixed cpu_quantize_pass_tester and op attr comments

test=develop

* removed redundant line

test=debug

* removed gmock

test=develop

* fix after merge

2579ade4

25 2月, 2019 1 次提交
- L
  Enable function coverage for U8/S8 ConvMKLDNNOpKernel · 4acc5220
  由 liangan1 提交于 2月 25, 2019
```
test=develop
```
  4acc5220
21 2月, 2019 1 次提交
- X
  add per kernel config and remove const_cast. · 5eb87506
  由 Xin Pan 提交于 2月 21, 2019
```
test=develop
```
  5eb87506
13 2月, 2019 1 次提交
- C
  fix potential bug (#15688) · ad61e1b2
  由 chengduo 提交于 2月 13, 2019
```
test=develop
```
  ad61e1b2
21 1月, 2019 1 次提交

Memory optimization of depthwise conv op and group norm op (#15313) · 9f8f0fc2

由 Dun 提交于 1月 21, 2019

* mem opt

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* refine code  test=develop

* refine code  test=develop

* refine code  test=develop

* refine code  test=develop

* refine with cub test=develop

* fix mkldnn test && remove comments && test=develop

* polish code && test=develop

* add only_forward test && test=develop

9f8f0fc2

04 1月, 2019 1 次提交

Enable basic MKL-DNN INT8 Conv OP (#15124) · bbc93368

由 xiaolil1 提交于 1月 04, 2019

* Enable basic MKL-DNN INT8 Conv OP
test=develop

* Modify test case
test=develop

* Clean unittest code
test=develop

* Fix test
test=develop

* Modify test
test=develop

* Modify basic INT8 Conv
test=develop

bbc93368

25 12月, 2018 1 次提交
- S
  polish code · 3a2afbf0
  由 sneaxiy 提交于 12月 25, 2018
```
test=develop
```
  3a2afbf0
19 12月, 2018 1 次提交
- S
  rewrite variable type · ae6f46a1
  由 sneaxiy 提交于 12月 19, 2018
```
test=develop
```
  ae6f46a1
14 12月, 2018 1 次提交
- Y
  
  Fea/fuse conv elementwise add fuse (#14669) · a985949b
  由 Yan Chunwei 提交于 12月 14, 2018
  
  a985949b
12 12月, 2018 1 次提交
- Y
  Change tensor uses proto::VarType::type · 9bd70a1e
  由 Yu Yang 提交于 12月 11, 2018
```
test=develop
```
  9bd70a1e
07 12月, 2018 1 次提交
- Y
  Clean Code · 155328a4
  由 Yihua Xu 提交于 12月 07, 2018
```
test=develop
```
  155328a4
05 12月, 2018 2 次提交
- X
  follow comments · 82d68281
  由 Xin Pan 提交于 12月 05, 2018
```
test=develop
```
  82d68281
- X
  allow customize kernel selection · 41c28d54
  由 Xin Pan 提交于 12月 05, 2018
```
test=develop
```
  41c28d54
03 12月, 2018 1 次提交
- Y
  
  Implement conv3d with mkldnn library (test=develop) · 669191c9
  由 Yihua Xu 提交于 12月 03, 2018
  
  669191c9
19 11月, 2018 1 次提交
- Q
  Convolution fusion operator. (#14449) · fd7e6431
  由 qingqing01 提交于 11月 19, 2018
```
* Convolution fusion operator.
* Clean code
test=develop
```
  fd7e6431
15 11月, 2018 1 次提交

add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278) · 8a1eeec5

由 Sylwester Fraczek 提交于 11月 15, 2018

* add is_test to pooling and activations

add prop_kind support for layers activation. conv and pooling

add a pass that sets is_test to true

add transpiler version of is_test pass

test=develop

* patch test and pass

test=develop

* add pass to analyzer.h

test=develop

* add is_test attr description & pass only on mkldnn

in:
activation_op.cc
batch_norm_op.cc
conv_op.cc
dropout_op.cc
lrn_op.cc
pool_op.cc
sequence_pool_op.cc
softmax_op.cc

* fix is_test handling for activation pool and conv

* change description of is_test for all layers again

* remove GetAttr(use_mkldnn) from pass

* rename correct_mkldnn_test_phase to is_test

and remove dependency on MKLDNN
test=develop

* review fix magic number

* two if(..)s into one

* Check is_test once and pass mkldnn forward prop kind

* dereference shared_ptr with * (without get())

test=develop

* add is_test_pass back

test=develop

8a1eeec5

09 11月, 2018 2 次提交

Add InferVarType for some op (#14201) · 6c6e6385

由 chengduo 提交于 11月 09, 2018

* add_infer_var_type
test=develop

* InferVarTypeHelper-> VarTypeInferenceHelper
test=develop

* PassInputTypeAndDTypeOnOutput
 test=develop

* follow comment
test=develop

6c6e6385

Exhaustive search for cuDNN conv. (#14286) · abe20923

由 qingqing01 提交于 11月 09, 2018

* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
* Fix compiling test=develop

abe20923

07 11月, 2018 2 次提交
- Q
  Revert " Exhaustive search for cuDNN conv. (#14043)" · db8c52da
  由 qingqing01 提交于 11月 07, 2018
```
This reverts commit ce7d9b07.
```
  db8c52da
- Q
  Exhaustive search for cuDNN conv. (#14043) · ce7d9b07
  由 qingqing01 提交于 11月 07, 2018
```
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Clean code
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
```
  ce7d9b07
02 11月, 2018 1 次提交
- D
  
  refine tests. test=develop · eb2f7ed2
  由 dzhwinter 提交于 11月 02, 2018
  
  eb2f7ed2
22 10月, 2018 1 次提交
- X
  clean up after the changes have been stopped for so long. · 8f2116d8
  由 Xin Pan 提交于 10月 18, 2018
```
test=develop
```
  8f2116d8
21 10月, 2018 4 次提交
- T
  MKLDNN conv + elementwise_add fusion: skip connection attribute renamed.... · 4be45af1
  由 Tomasz Patejko 提交于 9月 27, 2018
```
MKLDNN conv + elementwise_add fusion: skip connection attribute renamed. Comments about patterns added.

test=develop
```
  4be45af1
- T
  
  MKLDNN conv + elementwise_add fusion: parameter name changed to ResidualData · b8e54ab5
  由 Tomasz Patejko 提交于 9月 18, 2018
  
  b8e54ab5
- T
  MKLDNN conv + elementwise_add fusion: output and elemwise param share data in... · 41f3d78f
  由 Tomasz Patejko 提交于 9月 17, 2018
```
MKLDNN conv + elementwise_add fusion: output and elemwise param share data in conv primitive. Output is properly allocated
```
  41f3d78f
- T
  
  MKLDNN conv + elementwis_add fusion: initial work on passing eltwise data to conv primitive · 56528531
  由 Tomasz Patejko 提交于 9月 17, 2018
  
  56528531
15 9月, 2018 1 次提交
- D
  
  debug version · 85f8dd1c
  由 dzhwinter 提交于 9月 15, 2018
  
  85f8dd1c
14 9月, 2018 1 次提交
- M
  
  Fuse Conv+BN+SkipConnectionAdd+ReLU with transpiler temporarily (#13350) · 8cbefd1a
  由 Michał Gallus 提交于 9月 14, 2018
  
  8cbefd1a
11 9月, 2018 1 次提交
- M
  
  Fuse MKLDNN's Conv + ReLU · 5d34ef61
  由 Michal Gallus 提交于 9月 04, 2018
  
  5d34ef61
10 9月, 2018 1 次提交
- K
  
  Reusing converted weights · 1658958f
  由 Krzysztof Binias 提交于 9月 10, 2018
  
  1658958f

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致