提交 · ddb120357c89c9fe54b7e7b23e11ec52f0aa7ab0 · PaddlePaddle / PaddleDetection

13 11月, 2018 1 次提交
- T
  refine lrn_op cpu forward and speedup · b4dfba17
  由 tensor-tang 提交于 11月 13, 2018
```
test=develop
```
  b4dfba17
07 6月, 2018 1 次提交

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

08 5月, 2018 1 次提交

Clean OpProtoAndCheckerMaker · 0e78cb69

由 Yu Yang 提交于 5月 08, 2018

Do not use ctor

* Reduce line of codes.
* We can use virtual function for Maker now.
* The implementation does not care what maker holds, it is easier to
refactor later.

0e78cb69

19 4月, 2018 1 次提交
- Y
  add semicolon to op registry (#10034) · e04c43d5
  由 Yang Yang(Tony) 提交于 4月 18, 2018
```
* script to add semicolon

* fix typo
```
  e04c43d5
17 4月, 2018 1 次提交
- Y
  
  script to fix all · ce7c2e86
  由 Yang Yang 提交于 4月 16, 2018
  
  ce7c2e86
12 4月, 2018 1 次提交
- S
  Fix cpplint errors for a set of operators (#9837) · 8d3ce01f
  由 Siddharth Goyal 提交于 4月 11, 2018
```
* Fix cpplint errors, round2

* Fix pointer issue
```
  8d3ce01f
30 3月, 2018 1 次提交
- T
  
  Plain LRN op throws an exception when is_test is set in backward pass · b9874251
  由 Tomasz Patejko 提交于 3月 30, 2018
  
  b9874251
22 3月, 2018 1 次提交
- T
  
  Function for running MKLDNN primitive added. Unittest added for is_test attribute · 14ba67c0
  由 Tomasz Patejko 提交于 3月 22, 2018
  
  14ba67c0
21 3月, 2018 1 次提交
- T
  
  Device blobs are created only in training. Added testing attribute · 72cc64e4
  由 Tomasz Patejko 提交于 3月 21, 2018
  
  72cc64e4
19 3月, 2018 3 次提交
- T
  
  Removing WITHIN_CHANNEL algorithm for lrn. CPU lrn operator works only with ACROSS_CHANNELS · 2d955275
  由 Tomasz Patejko 提交于 3月 19, 2018
  
  2d955275
- T
  
  Content of GetExpectedKernelType moved to standalone function · c51c4462
  由 Tomasz Patejko 提交于 3月 16, 2018
  
  c51c4462
- T
  
  Implementation of MKLDNN LRN · 192cc5dd
  由 Tomasz Patejko 提交于 3月 13, 2018
  
  192cc5dd
15 3月, 2018 1 次提交
- Q
  
  Fix bug in LRN operator. (#9124) · 1cd700d8
  由 qingqing01 提交于 3月 15, 2018
  
  1cd700d8
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

06 12月, 2017 1 次提交
- G
  Add LRN efficient GPU implement. (#5894) · c7e739f5
  由 gongweibao 提交于 12月 06, 2017
```
Add LRN efficient GPU implement
```
  c7e739f5
04 11月, 2017 1 次提交
- K
  
  polish_g_to_l (#5367) · c0d2ca54
  由 kexinzhao 提交于 11月 03, 2017
  
  c0d2ca54
26 10月, 2017 1 次提交
- G
  Local response normalize. (#4426) · 9d142d50
  由 gongweibao 提交于 10月 26, 2017
```
Add local response normalize
```
  9d142d50

PaddlePaddle / PaddleDetection 大约 1 年 前同步成功

PaddlePaddle / PaddleDetection
大约 1 年前同步成功