提交 · b5a16dca205cfd2a903e1a68bae0b1518eb5a26e · s920243400 / PaddleDetection

15 3月, 2018 5 次提交

Q
Fix a critical bug in softmax_with_cross_entropy_op backward. (#9120) · b5a16dca
由 qingqing01 提交于 3月 15, 2018
```
* Fix a critical bug in softmax_with_cross_entropy_op, which will lead to the wrong gradients.

* Enhance unit testing.
```
b5a16dca

由 Thuan Nguyen 提交于 3月 15, 2018

* Fix old documentation for channel_recv

* Initial design of CSP select

* Redesign channel implementation for Select Op

* Remove unecessary header

* Initial checkin of select op, currently will read all the conditional_op in the cases block and also pull out all channels involved in the select.

* Init python select op API

* Python select bug fix when checking op creates block

* Add case_to_execute as (a) input to select, (b) into the passed inputs into the select op

* Add in addition code for select op

* Init fibonacci test from python

* implement fibonnaci sequence test

* update fib unit test

* Improve select test cases

* Shorten non-pep-8-ed lines

* Add methods on channel needed by select op

* Fix compile issues, finish implementation, still need to debug code

* Fix issue with fibonncci test, it works now!

* Change QueueMessage callback to take in an ChannelAction enum, fix select unit test

* Fix case attributes

* Fix issue with select control flow

* Make cases - previously on each selectcase conditional_block - attributes to select

* Use class constants for type of channel

* Change select op to take in "cases" attribute

* return boolean from select callback function to tell Channel if this RECV or SEND should be executed

* Improve attributes and inputs comments on select op

* Fix issues with python unit test

* Assert fibonacci final output

* Fix issue when channel name / channel var is null for "default" case in select op

* Assert base select test output

* Make QueueMessage use shared pointer and modify the order of the callback

* Fixing the order in which the callback is called

* Move channel utility methods to paddle/fluid/operators/concurrency/channel_util

* Create channel_util and move channel util methods

* Fix crash when calling select_op

* Fix deadlock

* Fix issue of channel destructor deadlock

* Fix precommit issues

* Accidentally checked in changes to beam_search_op, reverting change.

* Fix dependency issue in concurrency cmake

* add device_context dependency for concurrency target

1e4c504e

[Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53

由 dzhwinter 提交于 3月 15, 2018

* "add softmax cudnn functor support"

* "add testing"

* "refine cmakelist"

* "sequence softmax forward speed up"

* "add softmax grad"

* "fix sequence softmax test"

* "add double precision'

* "fix softmax test"

* "add softmax cudnn support"

* "fix softmax cudnn test"

* "add softmax to nn.py"

* "fix compile bug"

* "refine cmakelist"

* "fix ci"

* "fix based on comment"

* "fix based on comments"

* "fix ci"

128adf53

Add fp16 mul op support and bind paddle fp16 to numpy fp16 (#9017) · e26f1123

由 Kexin Zhao 提交于 3月 14, 2018

* add fp16 mul op support

* small fix

* fix bug

* small fix

* fix PADDLE_WITH_CUDA compiling issue

* reorg code

* test for pybind

* treate as float16 as uint16_t in pybind

* bind np.float16 to paddle float16

* small fix

* clean code

* remove redundancy

* fix mul_op test

* address comments

* small fix

* add is_float16_supported func

e26f1123

"exported scatter to python" (#9038) · 71400711

由 dzhwinter 提交于 3月 15, 2018

* "exported scatter to python"

* Revert ""exported scatter to python""

This reverts commit 38745a62.

* "polish scatter and export to python"

71400711

14 3月, 2018 4 次提交
- X
  
  Better timeline · 4840c49b
  由 Xin Pan 提交于 3月 14, 2018
  
  4840c49b
- C
  
  refine parallel_do_grad · ef28e7de
  由 chengduoZH 提交于 3月 14, 2018
  
  ef28e7de
- 武
  
  Feature/send recv can now retry (#9027) · d13ce358
  由武毅提交于 3月 14, 2018
  
  d13ce358
- D
  Refine/nccl (#9009) · 14fe40aa
  由 dzhwinter 提交于 3月 14, 2018
```
* "Refine nccl op"

* "refine code "

* "refine nccl code"
```
  14fe40aa
13 3月, 2018 5 次提交
- C
  
  refine doc · 92e2207e
  由 chengduoZH 提交于 3月 13, 2018
  
  92e2207e
- Y
  
  Polish code · 164f2382
  由 Yu Yang 提交于 3月 13, 2018
  
  164f2382
- Y
  
  Make double_buffer reader async · f9974a4a
  由 Yu Yang 提交于 3月 13, 2018
  
  f9974a4a
- C
  
  remove concat_rows · b9397b26
  由 chengduoZH 提交于 3月 13, 2018
  
  b9397b26
- Q
  Repair nccl op test (#8575) · 7287630e
  由 QI JUN 提交于 3月 13, 2018
```
* fix nccl op unit test

* fix build error

* format code

* refine nccl related unit test

* fix build error

* add setGPUData

* clean up

* follow comments

* rm test_nccl.cu

* follow comment

* rm wait
```
  7287630e
12 3月, 2018 10 次提交
- Y
  
  Remove dims in base class · 225efa67
  由 Yu Yang 提交于 3月 12, 2018
  
  225efa67
- Q
  [Memory]More memory optimization policy (#8690) · f7e9fe57
  由 QI JUN 提交于 3月 12, 2018
```
* add memopt level

* add opt level for image classification demo

* clean code

* add delete op

* clean code

* test machine translation demo

* clean code

* clean code

* skip fill constant with force cpu

* clean code

* clean code

* refine code

* clean code

* fix bug
```
  f7e9fe57
- Y
  
  Polish double buffer reader · 2ea4a5d9
  由 Yu Yang 提交于 3月 12, 2018
  
  2ea4a5d9
- Y
  
  Fix dist compile error (#8987) · b5ef315c
  由 Yancey 提交于 3月 12, 2018
  
  b5ef315c
- Q
  Fix bug in detection_output and mAP calculation in SSD. (#8985) · b3d26cd3
  由 qingqing01 提交于 3月 12, 2018
```
* Clipping bbox in the mAP evaluator calculation.

* Fix bug in detection_output and mAP calculation in SSD.

* Fix bug in detection.py.

* Fix bug in test_detection_map_op.py.
```
  b3d26cd3
- Y
  
  Polish ShuffleReader and test · 46ae4075
  由 Yu Yang 提交于 3月 12, 2018
  
  46ae4075
- C
  
  add concat rows · f1c3ecb2
  由 chengduoZH 提交于 3月 10, 2018
  
  f1c3ecb2
- K
  
  address comments · 3b44b849
  由 Kexin Zhao 提交于 3月 11, 2018
  
  3b44b849
- Y
  
  Polish RecordIO · 7eedced8
  由 Yu Yang 提交于 3月 12, 2018
  
  7eedced8
- Y
  
  Refine · fea43077
  由 Yu Yang 提交于 3月 12, 2018
  
  fea43077
10 3月, 2018 5 次提交
- P
  MKLDNN pool2d OP kernel added (#8879) · 4730a4be
  由 pzelazko-intel 提交于 3月 10, 2018
```
* MKLDNN pool2d OP kernel added

* conv2d and pool2d MKLDNN kernels renamed

* MKLDNN conv2d kernel refactoring
```
  4730a4be
- K
  
  fix bug · 95de7617
  由 Kexin Zhao 提交于 3月 09, 2018
  
  95de7617
- K
  
  add gpu info func to get compute cap · 1998d5af
  由 Kexin Zhao 提交于 3月 09, 2018
  
  1998d5af
- K
  
  fix math function arch mismatch for older GPU · d400b419
  由 Kexin Zhao 提交于 3月 09, 2018
  
  d400b419
- F
  
  fix a potential bug in the c++ reader · 614c33fb
  由 fengjiayi 提交于 3月 10, 2018
  
  614c33fb
09 3月, 2018 10 次提交
- C
  
  enhancement look_up_table · 1509ce66
  由 chengduoZH 提交于 3月 09, 2018
  
  1509ce66
- Q
  Refine cast op (#8923) · b341bac7
  由 QI JUN 提交于 3月 09, 2018
```
* fix mac build error

* override GetExpectedKernelType for cast op

* fix typo

* add cuda unittest
```
  b341bac7
- Y
  Fix sparse update memory error for distributed training (#8837) · 84680379
  由 Yancey 提交于 3月 09, 2018
```
Fix sparse update memory error for distributed training
```
  84680379
- F
  
  uses channel to replace the traditional buffer · 35e1e0d5
  由 fengjiayi 提交于 3月 09, 2018
  
  35e1e0d5
- F
  
  fix a compile error · 6e5736e2
  由 fengjiayi 提交于 3月 09, 2018
  
  6e5736e2
- F
  
  remove HasNext · 4e517881
  由 fengjiayi 提交于 3月 09, 2018
  
  4e517881
- 武
  
  update unpushed commits for zerocopy grpc (#8900) · 9dd34e41
  由武毅提交于 3月 09, 2018
  
  9dd34e41
- Z
  
  Some comments have been modified. · 9d78971d
  由 zhouhanqing 提交于 3月 09, 2018
  
  9d78971d
- K
  Add float16 GEMM math function on GPU (#8695) · 90215b78
  由 kexinzhao 提交于 3月 08, 2018
```
* test cpu float16 data transform

* add isnan etc

* small fix

* fix containsNAN test error

* add data_type transform GPU test

* add float16 GPU example

* fix error

* fix GPU test error

* initial commit

* fix error

* small fix

* add more gemm fp16 tests

* fix error

* add utility function
```
  90215b78
- 武
  
  Performance/zero copy variable seriralization (#8839) · 45af8c1e
  由武毅提交于 3月 09, 2018
  
  45af8c1e
08 3月, 2018 1 次提交
- C
  
  Add ElementwiseOpInferVarType · 53d19f5b
  由 chengduoZH 提交于 3月 08, 2018
  
  53d19f5b

s920243400 / PaddleDetection 与 Fork 源项目一致

s920243400 / PaddleDetection
与 Fork 源项目一致