提交 · 35c89f38c3b03d9ace53aec0bb263b54b08d64bb · s920243400 / PaddleDetection

27 3月, 2019 1 次提交

由 liuwei1031 提交于 3月 27, 2019

* fix cdn issue, test=develop

* fix memory optimize bugs, test=develop

* fix memory optimize bugs, test=develop

* remove add/sub_2 op, test=develop

* disable memory_optimize by default, test=develop

* disable inplace activation in python, test=develop

* fix unittests, test=develop

* fix unittests, test=develop

* bug-fix, test=develop

8d22bc17

26 3月, 2019 2 次提交
- S
  fix env variable settting bug · 78fb3a62
  由 sneaxiy 提交于 3月 26, 2019
```
test=develop
```
  78fb3a62
- S
  fix some op grad maker · 7000ec85
  由 sneaxiy 提交于 3月 25, 2019
```
fix ctest eager deletion disable bug
test=develop
```
  7000ec85
15 3月, 2019 1 次提交

Support sync batch norm. (#16121) · 8ad672a2

由 qingqing01 提交于 3月 15, 2019

* Support Sync Batch Norm.
* Note, do not enable it in one device.

Usage:

build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
        loss_name=loss_mean.name,
        build_strategy=build_strategy)

8ad672a2

31 1月, 2019 1 次提交
- D
  
  fix batch norm. test=develop (#15597) · 46a6cac9
  由 dzhwinter 提交于 1月 31, 2019
  
  46a6cac9
21 1月, 2019 1 次提交
- D
  
  squash commits. test=develop · 8f3b2523
  由 dzhwinter 提交于 1月 21, 2019
  
  8f3b2523
12 12月, 2018 1 次提交
- Y
  Change tensor uses proto::VarType::type · 9bd70a1e
  由 Yu Yang 提交于 12月 11, 2018
```
test=develop
```
  9bd70a1e
29 11月, 2018 1 次提交
- Q
  Enable BatchNorm to use global mean and variane during training (#14630) · 731d45a3
  由 qingqing01 提交于 11月 29, 2018
```
* Enable BatchNorm to use global mean and variane during training
* Update doc and follow comments.
```
  731d45a3
15 11月, 2018 1 次提交

add mkldnn prop_kind phase for inference-only case to pooling and activations (#14278) · 8a1eeec5

由 Sylwester Fraczek 提交于 11月 15, 2018

* add is_test to pooling and activations

add prop_kind support for layers activation. conv and pooling

add a pass that sets is_test to true

add transpiler version of is_test pass

test=develop

* patch test and pass

test=develop

* add pass to analyzer.h

test=develop

* add is_test attr description & pass only on mkldnn

in:
activation_op.cc
batch_norm_op.cc
conv_op.cc
dropout_op.cc
lrn_op.cc
pool_op.cc
sequence_pool_op.cc
softmax_op.cc

* fix is_test handling for activation pool and conv

* change description of is_test for all layers again

* remove GetAttr(use_mkldnn) from pass

* rename correct_mkldnn_test_phase to is_test

and remove dependency on MKLDNN
test=develop

* review fix magic number

* two if(..)s into one

* Check is_test once and pass mkldnn forward prop kind

* dereference shared_ptr with * (without get())

test=develop

* add is_test_pass back

test=develop

8a1eeec5

09 11月, 2018 1 次提交

Add InferVarType for some op (#14201) · 6c6e6385

由 chengduo 提交于 11月 09, 2018

* add_infer_var_type
test=develop

* InferVarTypeHelper-> VarTypeInferenceHelper
test=develop

* PassInputTypeAndDTypeOnOutput
 test=develop

* follow comment
test=develop

6c6e6385

22 10月, 2018 1 次提交
- X
  clean up after the changes have been stopped for so long. · 8f2116d8
  由 Xin Pan 提交于 10月 18, 2018
```
test=develop
```
  8f2116d8
23 8月, 2018 2 次提交
- G
  Revert "Disable in_place in batch_norm API. (#12736)" · b1fc2386
  由 guochaorong 提交于 8月 23, 2018
```
This reverts commit f5d5d7b2.
```
  b1fc2386
- Q
  
  add sequence_mask_op for DAM model · 79918a84
  由 qingqing01 提交于 8月 22, 2018
  
  79918a84
22 8月, 2018 1 次提交
- Q
  Disable in_place in batch_norm API. (#12736) · f5d5d7b2
  由 qingqing01 提交于 8月 22, 2018
```
* Disable in_place in batch_norm API.
```
  f5d5d7b2
10 7月, 2018 1 次提交
- Q
  Skip BatchNorm when feature only has 1 element. (#11578) · 10fbb831
  由 qingqing01 提交于 7月 10, 2018
```
* Fix batch norm when only 1 elements in normzalize dimension during training.
```
  10fbb831
27 6月, 2018 1 次提交

bnorm+relu fuse for mkldnn (inference) (#11434) · 9a15c923

由 pzelazko-intel 提交于 6月 27, 2018

* bnorm+relu fuse for mkldnn

* separate fuse_relu function

* bug fix

* proper while range in inference_transpiler

* description fix

* review fix

* review fix

* unit test for fwd batch norm+relu MKLDNN fuse

9a15c923

20 6月, 2018 1 次提交
- L
  
  add url of cuda9.0_cudnn7_avx_mkl library · 5aac910b
  由 Luo Tao 提交于 6月 20, 2018
  
  5aac910b
11 6月, 2018 2 次提交
- D
  add inplace attribute to op_proto_maker (#10665) · bfa3fd6f
  由 dzhwinter 提交于 6月 11, 2018
```
* "add inplace attribute"

* "register inplace attribute"

* "change se-next model for memory-reuse"

* "fix typo"

* repick

* fix merge conflict

* "fix stupid error"
```
  bfa3fd6f
- M
  
  MKLDNN layout: Support for batch norm operator · 7d564356
  由 mozga-intel 提交于 6月 10, 2018
  
  7d564356
07 6月, 2018 1 次提交

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

08 5月, 2018 1 次提交

Clean OpProtoAndCheckerMaker · 0e78cb69

由 Yu Yang 提交于 5月 08, 2018

Do not use ctor

* Reduce line of codes.
* We can use virtual function for Maker now.
* The implementation does not care what maker holds, it is easier to
refactor later.

0e78cb69

03 5月, 2018 1 次提交

MKLDNN implementation of batch normalization (#9904) · 4a497b82

由 Tomasz Patejko 提交于 5月 03, 2018

* Initial implementation of forward pass for MKLDNN batch norm

* Added attributes for MKLDNN batch norm

* MKLDNN batch norm forward pass passes unittest. Started working on backward

* Backward pass for MKLDNN batch norm added

* MKLDNN batch norm: scoring added to forward pass

* MKLDNN batch norm: bias as input added; handling AnyLayout when kernel is looked up

* MKLDNN batch norm: python unit tests added; mkldnn tests removed

* MKLDNN batch norm: changes required by cpplint

* MKLDNN batch norm: refactoring the operator

* MKLDNN batch norm: saved variance inversed in backward pass for correct execution of MKLDNN unit tests

* MKLDNN batch norm: refctoring, function for static/const cast to void* added

* MKLDNN batch norm: remove AnyLayout from batch norm

*  MKLDNN batch norm: only NCHW format is supported. Unittests refactored

* MKDNN batch norm: use_mkldnn added to attributes

* MKLDNN batch norm: AnyLayout removed from unittest

* MKLDNN batch norm: added CUDNN defines to batch norm

* MKLDNN batch norm: undefined data_format variable corrected

* MKLDNN batch norm: use_cudnn added, use of setUp method for configuring attributes

* MKLDNN batch norm: added use_cudnn attribute to batch norm operator

* MKLDNN batch norm: correcting batch norm unit tests for MKLDNN

* MKLDNN batch norm: MKLDNN tests moved to another file; reverting changes for saved variance not being inverted

* Change default layout to NCHW

* MKLDNN batch norm: init_kernel_type method added to unit tests

* MKLDNN batch norm: style changes

* MKLDNN batch norm: unit tests refactored

* MKLDNN batch norm: added use_mkldnn attribute to batch norm python interface

4a497b82

02 5月, 2018 1 次提交
- D
  "fix double type error" (#10322) · 57be5c6c
  由 dzhwinter 提交于 5月 02, 2018
```
* "fix double type error"

* "fix ci"
```
  57be5c6c
11 4月, 2018 1 次提交
- S
  
  Fix cpplint errors (#9800) · cea39121
  由 Siddharth Goyal 提交于 4月 10, 2018
  
  cea39121
21 3月, 2018 1 次提交
- Y
  
  Shrink batch_norm_grad's inputs · 0760aaf4
  由 Yu Yang 提交于 3月 21, 2018
  
  0760aaf4
19 3月, 2018 2 次提交
- K
  
  update · 446d54f5
  由 Kexin Zhao 提交于 3月 18, 2018
  
  446d54f5
- K
  
  fix batch norm fp16 param type · e870947c
  由 Kexin Zhao 提交于 3月 18, 2018
  
  e870947c
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
08 1月, 2018 1 次提交

cpu gpu transform function (#7191) · 0f353ab4

由 Qiao Longfei 提交于 1月 08, 2018

* add rename guard

* add device_data_transform

* add device_data_transform_test

* modify GetExpectedKernelType

* update operator.run

* support test test_label_semantic_roles

* optimize code

* optimize code

* rename GetActualKernelType to GetExpectedKernelType

* fix chunk_eval_op and device_data_transform_test

* add is_same_place to place

* optimize code, refine rename_guard

* refine rename guard, add GetKernelTypeForVar

* optimize code

* add some log

* rename guard

* use sub scope to create var

* fix compile

* add IsInitialized for Tensor

* add VarIsTensor

* fix op_registry_test

* test

* tmp disable priority

* restore switch_kernel.md

* code clean

0f353ab4

04 1月, 2018 1 次提交
- Y
  
  Correctly handle image operators · 040dc59b
  由 Yang Yu 提交于 1月 04, 2018
  
  040dc59b
26 12月, 2017 1 次提交
- C
  
  refine batch_norm · 67e47e69
  由 chengduoZH 提交于 12月 26, 2017
  
  67e47e69
25 12月, 2017 1 次提交

Impl kernel hint (#6883) · af0c4c45

由 Qiao Longfei 提交于 12月 25, 2017

* init kernel hint

* fix typo

* rm unused code

* add include in op_kernel.h

* restore op_kernel since it will be moved to op_kernel_type

* change force_cpu to use_cpu

* fix compilation

af0c4c45

22 12月, 2017 1 次提交
- Q
  add data layout (#6832) · 6b475981
  由 QI JUN 提交于 12月 22, 2017
```
* add data layout

* fix ci
```
  6b475981
20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

28 11月, 2017 1 次提交
- Q
  batch norm support matrix input (#5980) · c975fe1b
  由 Qiao Longfei 提交于 11月 28, 2017
```
* batch norm support matrix input

* update gpu code

* format code
```
  c975fe1b
08 11月, 2017 2 次提交

D

Remove fill_constant_batch_size_like_op.h and clean some operator codes. · e5791dd1
由 dangqingqing 提交于 11月 08, 2017

e5791dd1

Polish OpWithKernel · bbdac7f7

由 Yu Yang 提交于 11月 07, 2017

* Chage `IndicateDataType` to `GetKernelType`. Make it easier to
  understand.
* Change `OpKernelKey` to `OpKernelType`
* Make operator developers can customize which kernel the operator will
  use in runtime.

bbdac7f7

s920243400 / PaddleDetection 与 Fork 源项目一致

s920243400 / PaddleDetection
与 Fork 源项目一致