提交 · efcff3d9e50ef75e854e8945c3b829e94ddffa50 · BaiXuePrincess / Paddle

08 6月, 2018 3 次提交
- Y
  
  polish api ref docs · efcff3d9
  由 yi.wu 提交于 6月 08, 2018
  
  efcff3d9
- Y
  
  polish docs · 5be454bf
  由 yi.wu 提交于 6月 08, 2018
  
  5be454bf
- Y
  fix dist train error (#11281) · 0aa9546e
  由 Yancey 提交于 6月 08, 2018
```
* fix dist train error

* update by comment
```
  0aa9546e
07 6月, 2018 8 次提交

split reduce op into multiple libraries, accelerate the compiling (#11029) · d48172f2

由 dzhwinter 提交于 6月 07, 2018

* "split into multiple .ccl"

* "refine file structure"

* "refine files"

* "remove the cmakelist"

* "fix typo"

* "fix typo"

* fix ci

d48172f2

Big data op_test benchmark, for checking output consistent in different runs. (#10646) · f7c96f07

由 dzhwinter 提交于 6月 07, 2018

* "init benchmark ops"

* "untrack outputs"

* "delete some usused code"

* "benchmark"

* "fix ci"

* "fix op test"

* "fix uint16 missing"

* "fix ci"

* "follow comments"

* "fix ci"

* "follow comments"

* "conficts. merge develop branch"

* repick

* "merge develop branch"

f7c96f07

F

fix bugs in the implementation of 'HasInput' and 'HasOutput' · dc8e0b49
由 fengjiayi 提交于 6月 07, 2018

dc8e0b49

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

F

fix a compile error · 2f5e3101
由 fengjiayi 提交于 6月 07, 2018

2f5e3101
L

add test_mode in trt/activation_op · f6fb51a1
由 Luo Tao 提交于 6月 07, 2018

f6fb51a1
G

Add rpc_client interface. (#11154) · 2028a8ef
由 gongweibao 提交于 6月 06, 2018

2028a8ef
Y

feature/trt engine op test (#11182) · 4f95bc94
由 Yan Chunwei 提交于 6月 07, 2018

4f95bc94

06 6月, 2018 18 次提交
- Q
  Fix PADDLE_ASSERT. (#10981) · e0a32074
  由 qingqing01 提交于 6月 06, 2018
```
* Enable assertions in CUDA.

* Fix PADDLE_ASSERT.
```
  e0a32074
- Y
  SSA Graph Builder Factory · d9af1532
  由 yuyang18 提交于 6月 06, 2018
```
* Use Builder Chain to decorate new builders. It is easy to extend
  builders.
* Make graphviz path as a build strategy, not a FLAGS.
```
  d9af1532
- C
  
  add fuse var op handle · a584bc86
  由 chengduoZH 提交于 6月 06, 2018
  
  a584bc86
- X
  
  small clean up and document pointer ownership. · 73aa5d23
  由 Xin Pan 提交于 6月 06, 2018
  
  73aa5d23
- T
  
  refine the lock in scope · 4ae935e2
  由 tensor-tang 提交于 6月 06, 2018
  
  4ae935e2
- T
  
  fix abort issue in cpu multi-threads · 9b34f8da
  由 tensor-tang 提交于 6月 06, 2018
  
  9b34f8da
- T
  
  refine nlp multi-threads · 68409533
  由 tensor-tang 提交于 6月 06, 2018
  
  68409533
- Y
  
  Extract method from tensor_impl.h to tensor.cc · fc9f2d28
  由 yuyang18 提交于 6月 06, 2018
  
  fc9f2d28
- D
  
  "fix" · 2b9ef7e2
  由 dzhwinter 提交于 6月 05, 2018
  
  2b9ef7e2
- D
  
  "fix compiled in manylinux" · 75d8e8ca
  由 dzhwinter 提交于 6月 05, 2018
  
  75d8e8ca
- F
  
  fix a bug · 12d17941
  由 fengjiayi 提交于 6月 06, 2018
  
  12d17941
- F
  
  Refine code · 41ced8e2
  由 fengjiayi 提交于 6月 06, 2018
  
  41ced8e2
- D
  
  "done" · 4777aec9
  由 dzhwinter 提交于 6月 05, 2018
  
  4777aec9
- F
  
  Add unit tests · aa9383f3
  由 fengjiayi 提交于 6月 06, 2018
  
  aa9383f3
- L
  
  rewrite unittest of trt_activation_op · e116129f
  由 Luo Tao 提交于 6月 06, 2018
  
  e116129f
- F
  
  complete C++ part · e2bb4d07
  由 fengjiayi 提交于 6月 06, 2018
  
  e2bb4d07
- Y
  
  add dfg graphviz pass (#11211) · df87e63b
  由 Yan Chunwei 提交于 6月 06, 2018
  
  df87e63b
- D
  Feature/deterministic (#11205) · 7971d4a3
  由 dzhwinter 提交于 6月 06, 2018
```
* "fix deterministic"

* "fix ci"

* "fix init"
```
  7971d4a3
05 6月, 2018 6 次提交
- Y
  Add default prior box var for box_coder_op (#11164) · 666c94e3
  由 Yuan Gao 提交于 6月 05, 2018
```
* add normalize switch to box_coder_op

* add default prior box var

* update according to the review
```
  666c94e3
- W
  
  Refine rpc client wait sync (#11132) · 036a90f1
  由 Wu Yi 提交于 6月 05, 2018
  
  036a90f1
- W
  
  Prune dims supported by reduce op. (#11113) · d74838bd
  由 whs 提交于 6月 05, 2018
  
  d74838bd
- Q
  fix protobuf memory leak (#11177) · 23812490
  由 Qiao Longfei 提交于 6月 05, 2018
```
fix protobuf memory leak
```
  23812490
- S
  
  Fix dangling pointer bug · 02cc80b3
  由 sneaxiy 提交于 6月 05, 2018
  
  02cc80b3
- S
  
  Fix signed-unsigned comparison warning (#11167) · 71b6bdb5
  由 Siddharth Goyal 提交于 6月 04, 2018
  
  71b6bdb5
04 6月, 2018 5 次提交
- Q
  
  delete unused code · 1766406f
  由 qiaolongfei 提交于 6月 04, 2018
  
  1766406f
- F
  
  refine code · 3526ac11
  由 fengjiayi 提交于 6月 04, 2018
  
  3526ac11
- T
  
  rename Mkldnn to MKLDNN · 6ac47a3d
  由 tensor-tang 提交于 6月 04, 2018
  
  6ac47a3d
- F
  
  fix a bug · 744cc412
  由 fengjiayi 提交于 6月 04, 2018
  
  744cc412
- T
  
  follow comments · 6ae7cbe2
  由 tensor-tang 提交于 6月 04, 2018
  
  6ae7cbe2

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致