提交 · 29bf727e2ad281604a3bc30e094be29a193a5d73 · 机器未来 / Paddle

13 6月, 2018 1 次提交
- Y
  
  fix nccl dist train bug · d76ebd78
  由 yi.wu 提交于 6月 13, 2018
  
  d76ebd78
12 6月, 2018 4 次提交

Trainer send term signal (#11220) · 34865f2d

由 Wu Yi 提交于 6月 12, 2018

* wip

* use executor.complete to end trainer

* fix build

* fix build with distribute off

* fix typo

* fix cmake typo

* fix build

34865f2d

L

update with comments · c4c78733
由 Luo Tao 提交于 6月 12, 2018

c4c78733
Q

fix the default value prefetch_var_name_to_block_id · 2b9ff39f
由 qiaolongfei 提交于 6月 12, 2018

2b9ff39f

Make the normalization operator more general and fix bug in l2_normalize. (#11348) · 19fd0717

由 qingqing01 提交于 6月 12, 2018

* Add normalization operator.
1. Refine the raw norm_op and let it more general to support to normalize Tensor along any axis.
2. There is a bug in l2_normalize API, which lacks sqrt after `reduce_sum`.
3. Use norm_op to refine the l2_normalize API.
4. Fix bug in test_normalization_wrapper.py.

19fd0717

11 6月, 2018 18 次提交
- W
  Add slice op. (#11052) · adc09087
  由 whs 提交于 6月 11, 2018
```
* Add slice op.

* Remove using from header file and fix doc.

* Fix doc

* Small fix.
```
  adc09087
- L
  
  update with comments · 7bdb573d
  由 Luo Tao 提交于 6月 11, 2018
  
  7bdb573d
- Q
  
  optimize code · 506fc8d9
  由 qiaolongfei 提交于 6月 11, 2018
  
  506fc8d9
- G
  
  Add brpc surpport. (#11263) · d9de6b86
  由 gongweibao 提交于 6月 11, 2018
  
  d9de6b86
- X
  Make status update thread-safe · 1509ae3a
  由 Xin Pan 提交于 6月 11, 2018
```
The status is updated in the Process() thread
and can be checked in another HandleRequest() thread.
```
  1509ae3a
- Q
  
  optimize comment and code · ea106c91
  由 qiaolongfei 提交于 6月 11, 2018
  
  ea106c91
- L
  
  refine docs of elementwise_op etc. · 76941990
  由 Luo Tao 提交于 6月 11, 2018
  
  76941990
- Q
  
  set status before Finish in prefetch process · 7f4b9656
  由 qiaolongfei 提交于 6月 11, 2018
  
  7f4b9656
- D
  add inplace attribute to op_proto_maker (#10665) · bfa3fd6f
  由 dzhwinter 提交于 6月 11, 2018
```
* "add inplace attribute"

* "register inplace attribute"

* "change se-next model for memory-reuse"

* "fix typo"

* repick

* fix merge conflict

* "fix stupid error"
```
  bfa3fd6f
- Q
  
  set the thread pool of prefetch to 1 to fix a bug · 5aba10b5
  由 qiaolongfei 提交于 6月 11, 2018
  
  5aba10b5
- Q
  
  fix grpc_server_test · 8fb78f6c
  由 qiaolongfei 提交于 6月 11, 2018
  
  8fb78f6c
- Q
  
  update prefetch logic in grpc_server · 4e36c0ec
  由 qiaolongfei 提交于 6月 11, 2018
  
  4e36c0ec
- G
  
  Clean `sendop` `recv` operator. (#11309) · 627d7a64
  由 gongweibao 提交于 6月 11, 2018
  
  627d7a64
- Q
  
  refine prefetch logic · 0d3d4ae7
  由 qiaolongfei 提交于 6月 11, 2018
  
  0d3d4ae7
- Y
  Polish arg_min_max_op · 9b43edea
  由 yuyang18 提交于 6月 11, 2018
```
* Remove unused arg_max/min_op.h
* Remove reference parameter. Use pointer insteaded.
* undef macro
* Always set OutT as int64_t.
```
  9b43edea
- M
  
  MKLDNN layout: Support for batch norm operator · 7d564356
  由 mozga-intel 提交于 6月 10, 2018
  
  7d564356
- M
  
  MKLDNN layout: Support for convolution operator · 9908d3cf
  由 mozga-intel 提交于 6月 10, 2018
  
  9908d3cf
- M
  
  MKLDNN layout: Support for pool operator · 36031cb5
  由 mozga-intel 提交于 6月 10, 2018
  
  36031cb5
08 6月, 2018 9 次提交
- S
  
  remove redundant comments · 6d32e960
  由 sneaxiy 提交于 6月 08, 2018
  
  6d32e960
- Y
  
  polish sparse update logic · 56964946
  由 Yancey1989 提交于 6月 08, 2018
  
  56964946
- G
  
  fix some bugs introduced by unfreed memory · 0fec9469
  由 guochaorong 提交于 6月 08, 2018
  
  0fec9469
- Y
  
  Refine LinearCRF · 8c9041f4
  由 yuyang18 提交于 6月 08, 2018
  
  8c9041f4
- S
  
  recommit using account sneaxiy · 568c4e5e
  由 sneaxiy 提交于 6月 08, 2018
  
  568c4e5e
- Y
  
  Add resize_bilinear · 0d29e659
  由 yuyang18 提交于 6月 08, 2018
  
  0d29e659
- Y
  
  Simplize API Reference Documentation · b000e0de
  由 yuyang18 提交于 6月 08, 2018
  
  b000e0de
- F
  
  Fix a GPU bug · c7bbfb33
  由 fengjiayi 提交于 6月 08, 2018
  
  c7bbfb33
- Y
  
  polish sparse update code · 1239fce7
  由 Yancey1989 提交于 6月 08, 2018
  
  1239fce7
07 6月, 2018 8 次提交

X

Refine API doc string · e80c6b3c
由 Xin Pan 提交于 6月 07, 2018

e80c6b3c

split reduce op into multiple libraries, accelerate the compiling (#11029) · d48172f2

由 dzhwinter 提交于 6月 07, 2018

* "split into multiple .ccl"

* "refine file structure"

* "refine files"

* "remove the cmakelist"

* "fix typo"

* "fix typo"

* fix ci

d48172f2

F

Make crop op supporting taking offsets as one of its inputs · 9c61409a
由 fengjiayi 提交于 6月 07, 2018

9c61409a
F

stash · 4f46a98f
由 fengjiayi 提交于 6月 07, 2018

4f46a98f

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

F

fix a multi-thread bug in readers · 499dbe05
由 fengjiayi 提交于 6月 07, 2018

499dbe05
G

Add rpc_client interface. (#11154) · 2028a8ef
由 gongweibao 提交于 6月 06, 2018

2028a8ef
Y

feature/trt engine op test (#11182) · 4f95bc94
由 Yan Chunwei 提交于 6月 07, 2018

4f95bc94

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致