提交 · e26f51ce74e998c4119fc2c3145aa7c1224c2170 · s920243400 / PaddleDetection

26 6月, 2018 1 次提交

MKLDNN elementwis_add with default broadcast operations (#11544) · e26f51ce

由 Tomasz Patejko 提交于 6月 26, 2018

* elementwise_add with bcast: Brian's implementation by Brian added, with default bcasts

* elementwise_add with bcast: GetExpectedKernelType added to elementwise_op

* elementwise_add with bcast: use_mkldnn attribute added

* elementwise_add with bcast: changes after review and some formatting

* elementwise_add with bcast: changes after style check

* elementwise_add with bcast: changes after style check cont.

* elementwise_add with bcast: MKLDNN unittests added

* elementwise_add with bcast: original unittests with use_mkldnn flag

* elementwise_add with bcast: handling of MKLDNN format corrected

* elementwise_add with bcast: setting MKLDNN format turned into lambda

* elementwise_add with bcast: MKDNN format setting turned into separate function

* elementwise_add with bcast: condition for choosing MKLDNN simplified

* elementwise_add with bcast: fix for MKLDNN format set incorrectly in bcasts

* elementwise_add with bcast: changes in unittests for broadcasts

* elementwise_add with bcast: fixes in unittests regarding dimensions

* elementwise_add with bcast: bring back correct format setting in mklml grad path

* elementwise_add with bcast: fixed compilation error

e26f51ce

07 6月, 2018 1 次提交

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

19 4月, 2018 1 次提交
- A
  
  Fix CPPLint errors in some framework files · cbbf08ae
  由 Abhinav Arora 提交于 4月 18, 2018
  
  cbbf08ae
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
21 1月, 2018 1 次提交

"fix decode bug" (#7711) · e983cc90

由 dzhwinter 提交于 1月 21, 2018

* "fix decode bug"

* "follow commnet"

* "fix error"

* "fix hook bug"

* fix based comment

* fix copyright

* fix based on comment

e983cc90

19 1月, 2018 1 次提交
- Q
  complete data layout transform (#7440) · 0071b5f7
  由 Qiao Longfei 提交于 1月 19, 2018
```
* add data layout transform and optimize the implementation of data_transform
```
  0071b5f7
10 1月, 2018 1 次提交

reorganize data transform related code (#7391) · 377424bf

由 Qiao Longfei 提交于 1月 10, 2018

* init data_type_transform

* split data_layout_transform

* tmp rm data_transform_test

* change device_data_transform to data_device_transform

* clean code

* clean code

377424bf

08 1月, 2018 1 次提交

cpu gpu transform function (#7191) · 0f353ab4

由 Qiao Longfei 提交于 1月 08, 2018

* add rename guard

* add device_data_transform

* add device_data_transform_test

* modify GetExpectedKernelType

* update operator.run

* support test test_label_semantic_roles

* optimize code

* optimize code

* rename GetActualKernelType to GetExpectedKernelType

* fix chunk_eval_op and device_data_transform_test

* add is_same_place to place

* optimize code, refine rename_guard

* refine rename guard, add GetKernelTypeForVar

* optimize code

* add some log

* rename guard

* use sub scope to create var

* fix compile

* add IsInitialized for Tensor

* add VarIsTensor

* fix op_registry_test

* test

* tmp disable priority

* restore switch_kernel.md

* code clean

0f353ab4

26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
17 10月, 2017 1 次提交

Rewrite feed/fetch op (#4815) · 4df6cf4d

由 Yu Yang 提交于 10月 16, 2017

* Feed/Fetch op just plain operator, not a OpWithKernel
* Do not register OpInfoMaker since Feed/Fetch will never be
  configured by users
* Feed/Fetch op has empty gradient
* Feed/Fetch op do not hard code `feed_variable`, `fetch_variable` as
  its input and output, make it as a plain Operator input/output

4df6cf4d

28 9月, 2017 1 次提交
- C
  
  fix compile error · 920392e6
  由 chengduoZH 提交于 9月 28, 2017
  
  920392e6
27 9月, 2017 1 次提交
- Y
  
  Make PyBind support C++ exception · 67cdd5bc
  由 Yu Yang 提交于 9月 26, 2017
  
  67cdd5bc
25 7月, 2017 1 次提交
- L
  
  ENH: Refine Tensor and And CopyFrom · de8a8fee
  由 liaogang 提交于 7月 25, 2017
  
  de8a8fee
17 7月, 2017 2 次提交
- Y
  
  Refine CMake dependencies graph · 38310f93
  由 Yu Yang 提交于 7月 17, 2017
  
  38310f93
- Y
  Add enforce switch for convient develop (#2850) · cdec5634
  由 Yan Chunwei 提交于 7月 17, 2017
```
* add NDEBUG switch to PADDLE_ENFORCE
```
  cdec5634
11 7月, 2017 2 次提交
- D
  
  "support net_proto header" · 18e65b0c
  由 dongzhihong 提交于 7月 11, 2017
  
  18e65b0c
- D
  
  "move opContext to DeviceContext" · bc021d77
  由 dongzhihong 提交于 7月 11, 2017
  
  bc021d77
06 7月, 2017 2 次提交
- L
  
  FIX: explicit construct pool element · a669bf48
  由 liaogang 提交于 7月 06, 2017
  
  a669bf48
- L
  
  ENH: add memory unit test · 74691789
  由 liaogang 提交于 7月 06, 2017
  
  74691789
05 7月, 2017 1 次提交
- L
  
  FIX: Buddy Allocator Free with Merge feature · ada1c20b
  由 liaogang 提交于 7月 05, 2017
  
  ada1c20b
04 7月, 2017 4 次提交
- L
  
  ENH: Add paddle_memory for external usage · 4dc3c9e0
  由 liaogang 提交于 7月 04, 2017
  
  4dc3c9e0
- L
  
  ENH: Add buddy allocator Free · 0ba63475
  由 liaogang 提交于 7月 04, 2017
  
  0ba63475
- L
  
  ENH: code style · ff363894
  由 liaogang 提交于 7月 04, 2017
  
  ff363894
- L
  
  ENH: add buddy alloctor Free · 4e1617d0
  由 liaogang 提交于 7月 04, 2017
  
  4e1617d0
03 7月, 2017 1 次提交
- L
  ENH: Add Alloc for buddy Allocator · bbd3eab7
  由 liaogang 提交于 7月 03, 2017
```
* Free will be added soon
```
  bbd3eab7
28 6月, 2017 2 次提交
- L
  
  FIX: Pass CI · 3e9aa7fd
  由 liaogang 提交于 6月 28, 2017
  
  3e9aa7fd
- Y
  
  Add buddy_allocator.cc and system_allocator.cc · 3e087f76
  由 Yi Wang 提交于 6月 27, 2017
  
  3e087f76

s920243400 / PaddleDetection 与 Fork 源项目一致

s920243400 / PaddleDetection
与 Fork 源项目一致