提交 · 24649a780d055241625359ac492402689af9bd91 · s920243400 / PaddleDetection

07 6月, 2018 6 次提交

split reduce op into multiple libraries, accelerate the compiling (#11029) · d48172f2

由 dzhwinter 提交于 6月 07, 2018

* "split into multiple .ccl"

* "refine file structure"

* "refine files"

* "remove the cmakelist"

* "fix typo"

* "fix typo"

* fix ci

d48172f2

F

Make crop op supporting taking offsets as one of its inputs · 9c61409a
由 fengjiayi 提交于 6月 07, 2018

9c61409a
F

stash · 4f46a98f
由 fengjiayi 提交于 6月 07, 2018

4f46a98f

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

G

Add rpc_client interface. (#11154) · 2028a8ef
由 gongweibao 提交于 6月 06, 2018

2028a8ef
Y

feature/trt engine op test (#11182) · 4f95bc94
由 Yan Chunwei 提交于 6月 07, 2018

4f95bc94

06 6月, 2018 5 次提交
- F
  
  fix a bug · 12d17941
  由 fengjiayi 提交于 6月 06, 2018
  
  12d17941
- F
  
  Refine code · 41ced8e2
  由 fengjiayi 提交于 6月 06, 2018
  
  41ced8e2
- F
  
  Add unit tests · aa9383f3
  由 fengjiayi 提交于 6月 06, 2018
  
  aa9383f3
- F
  
  complete C++ part · e2bb4d07
  由 fengjiayi 提交于 6月 06, 2018
  
  e2bb4d07
- D
  Feature/deterministic (#11205) · 7971d4a3
  由 dzhwinter 提交于 6月 06, 2018
```
* "fix deterministic"

* "fix ci"

* "fix init"
```
  7971d4a3
05 6月, 2018 4 次提交
- Y
  Add default prior box var for box_coder_op (#11164) · 666c94e3
  由 Yuan Gao 提交于 6月 05, 2018
```
* add normalize switch to box_coder_op

* add default prior box var

* update according to the review
```
  666c94e3
- W
  
  Refine rpc client wait sync (#11132) · 036a90f1
  由 Wu Yi 提交于 6月 05, 2018
  
  036a90f1
- W
  
  Prune dims supported by reduce op. (#11113) · d74838bd
  由 whs 提交于 6月 05, 2018
  
  d74838bd
- S
  
  Fix signed-unsigned comparison warning (#11167) · 71b6bdb5
  由 Siddharth Goyal 提交于 6月 04, 2018
  
  71b6bdb5
04 6月, 2018 6 次提交
- Q
  
  delete unused code · 1766406f
  由 qiaolongfei 提交于 6月 04, 2018
  
  1766406f
- F
  
  refine code · 3526ac11
  由 fengjiayi 提交于 6月 04, 2018
  
  3526ac11
- T
  
  rename Mkldnn to MKLDNN · 6ac47a3d
  由 tensor-tang 提交于 6月 04, 2018
  
  6ac47a3d
- F
  
  fix a bug · 744cc412
  由 fengjiayi 提交于 6月 04, 2018
  
  744cc412
- F
  
  Creating readers before training begining · ee4e567d
  由 fengjiayi 提交于 6月 04, 2018
  
  ee4e567d
- Y
  
  add normalize switch to box_coder_op (#11129) · d3e99aee
  由 Yuan Gao 提交于 6月 04, 2018
  
  d3e99aee
03 6月, 2018 1 次提交
- Q
  
  fix build error on mac · 906334a6
  由 qiaolongfei 提交于 6月 03, 2018
  
  906334a6
01 6月, 2018 6 次提交
- Y
  
  feature/add TRT fc converter (#11043) · 0c0c5df4
  由 Yan Chunwei 提交于 6月 01, 2018
  
  0c0c5df4
- W
  Add python wrapper for gather op. (#11033) · 86d8659c
  由 whs 提交于 6月 01, 2018
```
* Add python wrapper for gather op.

* Add unitest for 'rank==1' and fix comments.

* Fix comments.
```
  86d8659c
- W
  Add shape op to get the shape of variable. (#11048) · 28dc9ba3
  由 whs 提交于 6月 01, 2018
```
* Add shape op to get the shape of variable.

* Rename get_shape to shape.

* Add checker for output and fix comments.
```
  28dc9ba3
- W
  Make bilinear_interp_op support attrs from input. (#11041) · 85c203b1
  由 whs 提交于 6月 01, 2018
```
* Make bilinear_interp_op support attrs from input.

* Fix python api.
```
  85c203b1
- Y
  
  use open_files reader to read multiple files · f9556dca
  由 Yancey1989 提交于 6月 01, 2018
  
  f9556dca
- G
  
  Move sync_mode device ctx from grpc server (#10881) · 4fb7cc7f
  由 gongweibao 提交于 5月 31, 2018
  
  4fb7cc7f
31 5月, 2018 1 次提交
- Y
  
  use recordio in dist train · e05abab6
  由 Yancey1989 提交于 5月 31, 2018
  
  e05abab6
30 5月, 2018 11 次提交
- L
  
  fix compiler error when do not have TensorRT library · aa4f685b
  由 Luo Tao 提交于 5月 30, 2018
  
  aa4f685b
- Y
  
  feature/tensorrt engine op (#11001) · 211e1315
  由 Yan Chunwei 提交于 5月 30, 2018
  
  211e1315
- F
  
  fix two bugs · 32c0e82c
  由 fengjiayi 提交于 5月 30, 2018
  
  32c0e82c
- X
  
  add back reduce_op · cb01c594
  由 Xin Pan 提交于 5月 30, 2018
  
  cb01c594
- Y
  
  Fix bug in CUDA · a6c11a5d
  由 yuyang18 提交于 5月 30, 2018
  
  a6c11a5d
- X
  
  better profiler and benchmark · 3cb63956
  由 Xin Pan 提交于 5月 30, 2018
  
  3cb63956
- M
  
  Withdraw mkldnn mul · 30d32035
  由 mozga-intel 提交于 5月 30, 2018
  
  30d32035
- Y
  
  Fix GPU compile · 45530c77
  由 yuyang18 提交于 5月 30, 2018
  
  45530c77
- F
  
  Polish RandomCropOp · 7c42e5de
  由 fengjiayi 提交于 5月 30, 2018
  
  7c42e5de
- Q
  Develop a fake dequantized op for fixed-point quantization training framework. (#10965) · 3a29821b
  由 qingqing01 提交于 5月 30, 2018
```
* Develop a fake dequantized op for fixed-point quantization training framework.

* Add the missing file.
```
  3a29821b
- F
  
  Add .cu · 56419caa
  由 fengjiayi 提交于 5月 30, 2018
  
  56419caa

s920243400 / PaddleDetection 与 Fork 源项目一致

s920243400 / PaddleDetection
与 Fork 源项目一致