提交 · 6772dfa0bd25db4151009fc3568e120d42961b8d · PaddlePaddle / PaddleDetection

14 11月, 2017 4 次提交

add split and merge lod tensor operator (#5537) · f07a226a

由 QI JUN 提交于 11月 14, 2017

* add split lod tensor operator

* add more test cast

* clean code

* add merge lod tensor operator

* fix bug

* clean code

* add grad operator

* make mask support GPU

* add comments

f07a226a

Assign Operator. (#5531) · 7c1755d9

由 Yu Yang 提交于 11月 13, 2017

* Assign Operator.

Out=X, when type in [LoDTensor/SelectedRows/LoDTensorArray]

* Follow comments

7c1755d9

Fix sequence_pool_op in debug mode · 983502d2

由 xuwei06 提交于 11月 10, 2017

The rank of the tensor from the chip() function is changed. In release mode, eigen_assert is not enabled and the dimenstion mismatch is not detected.

983502d2

Fix matmal_op for debug mode · 6a6e4d8d

由 xuwei06 提交于 11月 10, 2017

The dimension is not set correctly and is not being checked in release mode because eigen_assert is not enabled.

6a6e4d8d

13 11月, 2017 6 次提交

P

refine var name · c5d71077
由 peterzhang2029 提交于 11月 13, 2017

c5d71077
P

fix warning · 0a6262d5
由 peterzhang2029 提交于 11月 13, 2017

0a6262d5

BeamSearchDecodeOp (#5498) · a4106278

由 Qiao Longfei 提交于 11月 13, 2017

* init trieconcat_op

* add basic implementation

* add test

* add more test

* update unit test

* add PackAllSteps test

* fix PackAllSteps

* all test passed

* clean code

* remove state inside helper

* rename prob to score

* optimize RemoveFromEnd

* use deconstructor to delete BeamNode recursively

* optimize interface

* add comment to interface

* optimizer data structure

* use template to define the type of score

* use template parameter for BeamHelper

* change father to parent

* rename TrieConcat to BeamSearchOutConcat

* use LoDTensorArray

* rename BeamSearchOutConcat to BeamSearchDecode

* refine code

* remain all candidate sentence in beam_search_decode_op, do not consider endid

* use unique_ptr

* fix compare bug

* fix lod compile problem

a4106278

Y

trigger ci for lod_reset_op · c6275eca
由 Yibing Liu 提交于 11月 13, 2017

c6275eca
Y

bug fix in lod_reset_op: cast int to size_t in LoD · 9bc71087
由 Yibing Liu 提交于 11月 13, 2017

9bc71087
P

refine notation in bilinear_tensor_product_op.h · 5f99ae90
由 peterzhang2029 提交于 11月 13, 2017

5f99ae90

11 11月, 2017 4 次提交

D
"fix ci failed" (#5567) · 23b9bc0a
由 dzhwinter 提交于 11月 10, 2017
```
* "fix ci failed"

* "comment out seq_concate op to unblock PRs"
```
23b9bc0a

Fixing duplicate struct name TensorSetConstant. (#5532) · 58b4c9af

由 emailweixu 提交于 11月 10, 2017

TensorSetConstant struct is used both in math_function.cc and math_function.cu. Somehow the release version can correctly handle it. But in debug version, set_constant_with_place() in math_function.cu uses the TensorSetConstant in math_function.cc and causes crash.

58b4c9af

Y
Add Scope::Rename (#5534) · edb22c2f
由 Yu Yang 提交于 11月 10, 2017
```
it is useful in gradient phase of an operator with block
```
edb22c2f

Fix a dead lock bug for dyload/nccl.h when nccl lib cannot be loaded (#5533) · 2378679a

由 emailweixu 提交于 11月 10, 2017

It caused by a bug of std::call_once described in https://stackoverflow.com/questions/41717579/stdcall-once-hangs-on-second-call-after-callable-threw-on-first-call. It is likely caused by a deeper bug of pthread_once, which is discussed in https://patchwork.ozlabs.org/patch/482350/

2378679a

10 11月, 2017 6 次提交

Y

Add using case. · d7e7a1d7
由 yangyaming 提交于 11月 10, 2017

d7e7a1d7
Y
Fix seq concat op with refactoring LoD (#5486) · e5d810b9
由 Yancey 提交于 11月 10, 2017
```
* fix seq_concat with refactaring LoD

* fix failed unit test

* rename function name
```
e5d810b9
Y

Refine .cc and .h, more unit test more readable. · d04c8538
由 yangyaming 提交于 11月 10, 2017

d04c8538
Y

IndicateDataType --> GetKernelType · 3c84ebec
由 yangyaming 提交于 11月 10, 2017

3c84ebec

feature/while_op (#5502) · 40367d18

由 Yang Yang(Tony) 提交于 11月 09, 2017

* first commit

* Python API for while op

* Python Unittest for simple while_op forward

* fix out to be list

* Fix UT

* VarType

* Fix several bugs

* Fix bug

* Fix bug

* Fix Bug

* Fix bug

* Fix unittest

* Remove debug log

* Add comments

* add PADDLE_ENFORCE

* while_grad_op first commit

* Add `BlockDescBind::FindRecursiveOrCreateVar()` and fix bugs

* refine code

* fix unittest bug

40367d18

Fix attribute naming for momentum_op (#5453) · 2e355f03

由 Siddharth Goyal 提交于 11月 09, 2017

* Fix attribute naming for momentum_op

* Fix minor typo in comment

* Fix attribute name

* Fix names in test_optimizer

* Fix python wrapper

2e355f03

09 11月, 2017 10 次提交

D

remove header file paddle/framework/eigen.h · cceed081
由 dangqingqing 提交于 11月 09, 2017

cceed081
D

follow comments. · d60fe75a
由 dangqingqing 提交于 11月 09, 2017

d60fe75a
L

remove PADDLE_USE_MKL · 7835d493
由 Luo Tao 提交于 11月 09, 2017

7835d493
P

refine docString · 5cf82041
由 peterzhang2029 提交于 11月 09, 2017

5cf82041
Y

Adapt to new interface. · 0d9ba3da
由 yangyaming 提交于 11月 09, 2017

0d9ba3da

Add grad for lodtensor array ops (#5461) · b698d19b

由 fengjiayi 提交于 11月 08, 2017

* Add LoDRankTable

LoD Rank Table stores the `level` of `lod` which is ordered by sequence
length in descending order. It is useful when implement dynamic RNN and
is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
RNN slice output operators.

* Add skeleton for array_to_lod_tensor and lod_tensor_to_array

* Add VarType::LoDTensorArray

* Add PyBind of LoDTensorArray

* Add InferVarType

* Add first unittest

* Add ut

* Add unittest

* Add unittest

* Add unittests

* update

* init

* add infershape for lod_tensor_to_array_op

* compelete array_to_lod_tensor_op

* copy data

* clean code

* clean code

* Fix unittest data

* fix bugs

* fix compile error

* Refine TensorToArrayOp

* refactor array_to_lod_tensor

* Unittest

* fix bugs

* Fix unittest

* Fix unittest

* debug

* Debug

* Fix unittest

* Add grad for ops

* Debug

* Fix a bug

* fix a bug

* fix a bug

b698d19b

Y

Add `lod_array_length` operator · d24d8c20
由 Yang Yu 提交于 11月 08, 2017

d24d8c20
Y

Add increment op · 6d41bfb7
由 Yang Yu 提交于 11月 08, 2017

6d41bfb7
Y

Stash · 568270f3
由 Yang Yu 提交于 11月 08, 2017

568270f3
G

Refine ChunkEvalOp by following comments and rewrite the doc · c8dcd9a9
由 guosheng 提交于 11月 09, 2017

c8dcd9a9

08 11月, 2017 10 次提交

W

fix CI · b3a86b6d
由 wwhu 提交于 11月 08, 2017

b3a86b6d
D

Remove fill_constant_batch_size_like_op.h and clean some operator codes. · e5791dd1
由 dangqingqing 提交于 11月 08, 2017

e5791dd1

Static lstm sanity check (#5365) · 870650d8

由 Yang Yang(Tony) 提交于 11月 08, 2017

* add fill_constant_batch_size_like_op to rnn h_boot

* first commit

* merge develop; fix conflict

* update to main_program

870650d8

T

update · 11ee50ce
由 typhoonzero 提交于 11月 08, 2017

11ee50ce
P

refine memory transform · 47269273
由 peterzhang2029 提交于 11月 08, 2017

47269273
T

fix accuracy cudamemset · 6308ccc2
由 typhoonzero 提交于 11月 08, 2017

6308ccc2

CompareOp's kernel device type is decided by input tensor place · 3187451a

由 Yang Yu 提交于 11月 07, 2017

CompareOp can run on CPU even other operators are running on GPU, since
opeatations like comparing control flags should be performed only on CPU

3187451a

Y
Rename shrink_state -> shrink_rnn_memory · 01425309
由 Yang Yu 提交于 11月 07, 2017
```
Follow comments
```
01425309
C

fix attr name · cdf5e871
由 chengduoZH 提交于 11月 08, 2017

cdf5e871

Feature/rnn to array to lod tensor (#5411) · f72729d4

由 Yu Yang 提交于 11月 07, 2017

* Add LoDRankTable

LoD Rank Table stores the `level` of `lod` which is ordered by sequence
length in descending order. It is useful when implement dynamic RNN and
is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
RNN slice output operators.

* Add skeleton for array_to_lod_tensor and lod_tensor_to_array

* Add VarType::LoDTensorArray

* Add PyBind of LoDTensorArray

* Add InferVarType

* Add first unittest

* Add ut

* Add unittest

* Add unittest

* Add unittests

* update

* init

* add infershape for lod_tensor_to_array_op

* compelete array_to_lod_tensor_op

* copy data

* clean code

* clean code

* Fix unittest data

* fix bugs

* fix compile error

* Refine TensorToArrayOp

* refactor array_to_lod_tensor

* Unittest

* fix bugs

* Fix unittest

* Fix unittest

* debug

* Debug

* Fix unittest

* clean code

* refactor

* use ostream

* update test

* fix gpu build error

* make gpu test pass

f72729d4

PaddlePaddle / PaddleDetection 接近 2 年 前同步成功

PaddlePaddle / PaddleDetection
接近 2 年前同步成功