提交 · 18ac6947d08e5cd25ea53c2d90363bc27e009b19 · PaddlePaddle / Paddle

19 3月, 2018 1 次提交

由 Xin Pan 提交于 3月 19, 2018

On k40 with 4 devices, time reduces from ~4.0 to ~3.8+, should be
more obvious on better hardware

18ac6947

16 3月, 2018 1 次提交
- X
  Fix a program copy regression. · 1ca1e1c3
  由 Xin Pan 提交于 3月 15, 2018
```
Single device on se-resnet reduce from 0.56 to 0.50
```
  1ca1e1c3
15 3月, 2018 14 次提交

Implement Select OP (#9088) · 1e4c504e

由 Thuan Nguyen 提交于 3月 15, 2018

* Fix old documentation for channel_recv

* Initial design of CSP select

* Redesign channel implementation for Select Op

* Remove unecessary header

* Initial checkin of select op, currently will read all the conditional_op in the cases block and also pull out all channels involved in the select.

* Init python select op API

* Python select bug fix when checking op creates block

* Add case_to_execute as (a) input to select, (b) into the passed inputs into the select op

* Add in addition code for select op

* Init fibonacci test from python

* implement fibonnaci sequence test

* update fib unit test

* Improve select test cases

* Shorten non-pep-8-ed lines

* Add methods on channel needed by select op

* Fix compile issues, finish implementation, still need to debug code

* Fix issue with fibonncci test, it works now!

* Change QueueMessage callback to take in an ChannelAction enum, fix select unit test

* Fix case attributes

* Fix issue with select control flow

* Make cases - previously on each selectcase conditional_block - attributes to select

* Use class constants for type of channel

* Change select op to take in "cases" attribute

* return boolean from select callback function to tell Channel if this RECV or SEND should be executed

* Improve attributes and inputs comments on select op

* Fix issues with python unit test

* Assert fibonacci final output

* Fix issue when channel name / channel var is null for "default" case in select op

* Assert base select test output

* Make QueueMessage use shared pointer and modify the order of the callback

* Fixing the order in which the callback is called

* Move channel utility methods to paddle/fluid/operators/concurrency/channel_util

* Create channel_util and move channel util methods

* Fix crash when calling select_op

* Fix deadlock

* Fix issue of channel destructor deadlock

* Fix precommit issues

* Accidentally checked in changes to beam_search_op, reverting change.

* Fix dependency issue in concurrency cmake

* add device_context dependency for concurrency target

1e4c504e

Q

Always synchronize when copy data on GPU from C++ to Numpy array. (#9110) · 45073b7c
由 qingqing01 提交于 3月 15, 2018

45073b7c
X
Merge pull request #9037 from panyx0718/develop · d284cf88
由 Xin Pan 提交于 3月 15, 2018
```
Better timeline
```
d284cf88

[Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53

由 dzhwinter 提交于 3月 15, 2018

* "add softmax cudnn functor support"

* "add testing"

* "refine cmakelist"

* "sequence softmax forward speed up"

* "add softmax grad"

* "fix sequence softmax test"

* "add double precision'

* "fix softmax test"

* "add softmax cudnn support"

* "fix softmax cudnn test"

* "add softmax to nn.py"

* "fix compile bug"

* "refine cmakelist"

* "fix ci"

* "fix based on comment"

* "fix based on comments"

* "fix ci"

128adf53

Y
Merge pull request #9058 from reyoung/feature/parallel_do_bug · 9b9f3f09
由 Yu Yang 提交于 3月 15, 2018
```
Fix models #725
```
9b9f3f09
R
Merge pull request #9093 from weixing02/dockerfile · 3ff649e3
由 ranqiu92 提交于 3月 15, 2018
```
The sphinx version is specified as 1.5.6 in the Dockerfile
```
3ff649e3
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dockerfile · a68290fc
由 _青葱提交于 3月 15, 2018
```
Merge branch develop
```
a68290fc
Add comments · 45eb94e6
由 _青葱提交于 3月 15, 2018

45eb94e6

Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017) · e26f1123

由 Kexin Zhao 提交于 3月 14, 2018

* add fp16 mul op support

* small fix

* fix bug

* small fix

* fix PADDLE_WITH_CUDA compiling issue

* reorg code

* test for pybind

* treate as float16 as uint16_t in pybind

* bind np.float16 to paddle float16

* small fix

* clean code

* remove redundancy

* fix mul_op test

* address comments

* small fix

* add is_float16_supported func

e26f1123

The sphinx version is specified as 1.5.6 in the Dockerfile · fdc3843f
由 _青葱提交于 3月 15, 2018

fdc3843f

"exported scatter to python" (#9038) · 71400711

由 dzhwinter 提交于 3月 15, 2018

* "exported scatter to python"

* Revert ""exported scatter to python""

This reverts commit 38745a62.

* "polish scatter and export to python"

71400711

T
Merge pull request #9067 from luotao1/with_fluid · cf2addd2
由 Tao Luo 提交于 3月 15, 2018
```
enable WITH_FLUID option
```
cf2addd2
C
Merge pull request #9072 from chengduoZH/feature/refine_parallel_do · 11c43e5d
由 chengduo 提交于 3月 15, 2018
```
Refine parallel_do_grad
```
11c43e5d
A

Add changes to channel that are needed for select op (#9084) · 41894da1
由 Abhinav Arora 提交于 3月 14, 2018

41894da1

14 3月, 2018 24 次提交
- Y
  Merge pull request #9051 from panyx0718/profiler · a4b0e4a1
  由 Yibing Liu 提交于 3月 14, 2018
```
Add a test to ensure profiler works on multi-gpu
```
  a4b0e4a1
- X
  
  Reproduce profiler failure on multi-gpu. · ebde3b1a
  由 Xin Pan 提交于 3月 13, 2018
  
  ebde3b1a
- X
  Merge pull request #9077 from kuke/fix-9052 · 86018641
  由 Xin Pan 提交于 3月 14, 2018
```
Move back operator's event to RunImpl()
```
  86018641
- Y
  
  Move back operator's event to RunImpl() · 90afbd28
  由 Yibing Liu 提交于 3月 14, 2018
  
  90afbd28
- T
  Merge pull request #9016 from weixing02/doc · 72763668
  由 Tao Luo 提交于 3月 14, 2018
```
Fixed some outdated contents in Contribute Documentations
```
  72763668
- X
  
  Better timeline · 4840c49b
  由 Xin Pan 提交于 3月 14, 2018
  
  4840c49b
- C
  
  refine parallel_do_grad · ef28e7de
  由 chengduoZH 提交于 3月 14, 2018
  
  ef28e7de
- L
  
  enable WITH_FLUID option · 76e1c6af
  由 Luo Tao 提交于 3月 14, 2018
  
  76e1c6af
- Y
  
  Fix models #725 · 41d8bcdc
  由 Yu Yang 提交于 3月 14, 2018
  
  41d8bcdc
- Y
  Merge pull request #8991 from reyoung/feature/shuffle_reader · 48f213e5
  由 Yu Yang 提交于 3月 14, 2018
```
Feature/shuffle reader
```
  48f213e5
- C
  Merge pull request #8843 from zhouhanqing/Paddle-ReduceProd · 881c5227
  由 Cao Ying 提交于 3月 14, 2018
```
Add product reduction for reduce op.
```
  881c5227
- C
  Merge pull request #8934 from chengduoZH/feature/Enhance_regularizer_py · 5a159f34
  由 chengduo 提交于 3月 14, 2018
```
Enhance regularizer.py
```
  5a159f34
- 武
  Merge pull request #9044 from typhoonzero/fix_deadlink · ba65d54d
  由武毅提交于 3月 14, 2018
```
Fix distributed train doc deadlink
```
  ba65d54d
- T
  Merge pull request #8927 from ranqiu92/api_std · ad6b59e0
  由 Tao Luo 提交于 3月 14, 2018
```
Add api doc std
```
  ad6b59e0
- Correct links error · 3d6a4b6c
  由 _青葱提交于 3月 14, 2018
  
  3d6a4b6c
- Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into doc · ce557989
  由 _青葱提交于 3月 14, 2018
```
Merge branch develop
```
  ce557989
- Adjust · ca0e3249
  由 _青葱提交于 3月 14, 2018
  
  ca0e3249
- T
  
  fix deadlink · 7957e86c
  由 typhoonzero 提交于 3月 14, 2018
  
  7957e86c
- R
  
  Update api doc std and fc doc · fc0f92c2
  由 ranqiu 提交于 3月 14, 2018
  
  fc0f92c2
- Adjust some contents · cc1650c9
  由 _青葱提交于 3月 14, 2018
  
  cc1650c9
- C
  
  add regularization for test_machine_tranlation · 93107ce1
  由 chengduoZH 提交于 3月 14, 2018
  
  93107ce1
- R
  
  Refine api_doc_std_cn · a78b7602
  由 ranqiu 提交于 3月 14, 2018
  
  a78b7602
- 武
  
  Feature/send recv can now retry (#9027) · d13ce358
  由武毅提交于 3月 14, 2018
  
  d13ce358
- D
  Refine/nccl (#9009) · 14fe40aa
  由 dzhwinter 提交于 3月 14, 2018
```
* "Refine nccl op"

* "refine code "

* "refine nccl code"
```
  14fe40aa

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功