提交 · 701102283c4a80877a6c4c75b1f6fe170dc0d16d · PaddlePaddle / Paddle

19 6月, 2018 4 次提交
- M
  
  MKLDNN layouts: Gaussian random layout · 70110228
  由 mozga-intel 提交于 6月 15, 2018
  
  70110228
- Q
  Fix decay bug (#11520) · a29cb4be
  由 Qiyang Min 提交于 6月 19, 2018
```
* Add sub_blocks of lr_decay_op to pserver_prog after distribute_transpiler

* Remove unused logs and logics

* 1. Add ops to new block (considering the nested block condition)
2. Follow the original hierarchy of blocks
3. Change the function's name and remove debug lines
```
  a29cb4be
- T
  
  update the default cpu memory with MKLDNN · 9a25f289
  由 tensor-tang 提交于 6月 19, 2018
  
  9a25f289
- G
  
  Fix unlikely (#11537) · 4dda54aa
  由 gongweibao 提交于 6月 18, 2018
  
  4dda54aa
18 6月, 2018 2 次提交
- M
  
  MKLDNN layout: Support for activation operator · 792d3b24
  由 mozga-intel 提交于 6月 11, 2018
  
  792d3b24
- Y
  
  Feature/pass manager (#11440) · d7345959
  由 Yan Chunwei 提交于 6月 18, 2018
  
  d7345959
17 6月, 2018 4 次提交
- Y
  
  polish doc · 7a56705e
  由 yuyang18 提交于 6月 17, 2018
  
  7a56705e
- G
  
  Add some paddleenforce. (#11516) · 962711dc
  由 gongweibao 提交于 6月 16, 2018
  
  962711dc
- Q
  
  update image_resize_short and shape doc · 82a4cf19
  由 qiaolongfei 提交于 6月 17, 2018
  
  82a4cf19
- Q
  
  add gpu support for concat · ad1ad738
  由 qiaolongfei 提交于 6月 17, 2018
  
  ad1ad738
16 6月, 2018 2 次提交
- Q
  
  concat support data as input · 9c128fe6
  由 qiaolongfei 提交于 6月 16, 2018
  
  9c128fe6
- T
  
  refine the initial cpu memory flag for mkldnn · a8c2ff31
  由 tensor-tang 提交于 6月 16, 2018
  
  a8c2ff31
15 6月, 2018 17 次提交
- K
  Modify Pybind LoDTensor API according to length-based LoD (#11106) · 417fcf4f
  由 Kexin Zhao 提交于 6月 15, 2018
```
* add lod_tensor util and modify pybind

* refind pybind LoDTensor API and modify LoDTensor and DataFeeder test

* fix test error

* fix detection map op test

* fix reorder_lod_tensor test

* fix seq_concat_op

* fix chunk evel op test

* fix target assign op

* fix warp ctc op

* address comments step 1: reverse reset_lod op

* step 2: modify op test

* add warning message

* remove has_valid_lod

* add back has_valid_lod

* address comments

* add exception catching trial
```
  417fcf4f
- G
  
  fix warning (#11518) · dd55cc16
  由 gongweibao 提交于 6月 15, 2018
  
  dd55cc16
- Y
  
  Fix the display of reciprocal's formula · f3a777d8
  由 Yibing Liu 提交于 6月 15, 2018
  
  f3a777d8
- Q
  
  update doc for sigmoid_cross_entropy_with_logits · 8f59d79d
  由 qiaolongfei 提交于 6月 15, 2018
  
  8f59d79d
- Q
  Update some doc about API reference. (#11495) · cc1239ff
  由 qingqing01 提交于 6月 15, 2018
```
* Update some doc about layers' API.

* Fix format.

* Fix example bug in random_data_generator.

* Fix example bug in dropout.

* Follow comments and some small fix for some examples.
```
  cc1239ff
- Q
  
  update · 6ace04f6
  由 qiaolongfei 提交于 6月 15, 2018
  
  6ace04f6
- C
  
  fix conv3d/conv3d_trans/slice/mean_iou doc · 7b823530
  由 chengduoZH 提交于 6月 15, 2018
  
  7b823530
- Y
  
  Polish the doc of nce layer · 67dc5c7f
  由 Yibing Liu 提交于 6月 15, 2018
  
  67dc5c7f
- D
  
  "fix based comments" · 6ac8383f
  由 dzhwinter 提交于 6月 15, 2018
  
  6ac8383f
- Y
  
  Fix reciprocal op's doc · 279ebdd0
  由 Yibing Liu 提交于 6月 14, 2018
  
  279ebdd0
- L
  
  refine \odot in elementwise_mul · 1958654d
  由 Luo Tao 提交于 6月 15, 2018
  
  1958654d
- Y
  
  bugfix/trt engine op (#11487) · 5fd142c3
  由 Yan Chunwei 提交于 6月 15, 2018
  
  5fd142c3
- D
  
  "fix some typo" · 1f38cbf7
  由 dzhwinter 提交于 6月 14, 2018
  
  1f38cbf7
- Y
  
  update by comment · 3380737c
  由 yi.wu 提交于 6月 15, 2018
  
  3380737c
- F
  
  fix errors · d91060d3
  由 fengjiayi 提交于 6月 15, 2018
  
  d91060d3
- T
  
  polish doc: softshrink, assign, shuffle · 98ab2b40
  由 tensor-tang 提交于 6月 15, 2018
  
  98ab2b40
- T
  
  polish doc: mean · 24fea628
  由 tensor-tang 提交于 6月 15, 2018
  
  24fea628
14 6月, 2018 11 次提交
- D
  
  fix typo · 16a3d88a
  由 dzhwinter 提交于 6月 14, 2018
  
  16a3d88a
- C
  
  enable more type for splitOp and ConcatOp · ca743de2
  由 chengduoZH 提交于 6月 14, 2018
  
  ca743de2
- Y
  
  fix dist ut · 44925eb4
  由 yi.wu 提交于 6月 14, 2018
  
  44925eb4
- Y
  
  Polish code · 055df470
  由 yuyang18 提交于 6月 14, 2018
  
  055df470
- Y
  
  Polish documentation · cbc1b7f1
  由 yuyang18 提交于 6月 14, 2018
  
  cbc1b7f1
- T
  
  initial with only 1 mkl/openblas threads for each pthreads · 3e58df20
  由 tensor-tang 提交于 6月 14, 2018
  
  3e58df20
- F
  
  fix errors · 980499fa
  由 fengjiayi 提交于 6月 14, 2018
  
  980499fa
- Q
  Fix NCCLBcast hang up bug in Parallel Executor (#11377) · 046bb5c8
  由 Qiyang Min 提交于 6月 13, 2018
```
* 1. Create buddy allocator in each places before NcclBcast the variables
2. Check the memory usage of ALL gpus rather than the first one

* 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing
2. NOTE the usage of NCCLGroupGuard

* Remove the memory usage check of gpus

* Fix code style
```
  046bb5c8
- W
  Add mean IOU op. (#10519) · 6fcdb240
  由 whs 提交于 6月 14, 2018
```
* Add mean_iou op.

* Add unitest for mean iou op.

* Add optional collections of confusion matrix and mean_iou.

* Fix cuda kernel.

* Refine code.
1. Merge computing in GPU to two kernel.
2. Use wrong array and correct array instead of confusion matrix.

* Add python api and fix cuda kernel.

* Fix comments.

* Small fix.

* Small fix.
```
  6fcdb240
- Q
  
  add comment that out var of prefetch must be created in local scope · 490a07f5
  由 qiaolongfei 提交于 6月 14, 2018
  
  490a07f5
- X
  Remove cuptiFinalize. · d2afd210
  由 Xin Pan 提交于 6月 14, 2018
```
In cupti samples, only cuptiFlush is used.
I can't find any places calling cuptiFinalize and
this API can error out as not_implemented in some
cuda installation.
```
  d2afd210

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功