提交 · dafd449c68c23c642ba117a55135c823c6594772 · PaddlePaddle / Paddle

14 12月, 2017 1 次提交
- F
  
  Unify `step_block` and `block` to `sub_block` · dafd449c
  由 fengjiayi 提交于 12月 14, 2017
  
  dafd449c
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

11 12月, 2017 1 次提交

Fix gcc4.9 (#6442) · 95924686

由 Yiqun Liu 提交于 12月 11, 2017

* Fix compiling error of gcc4.9.

* Refine the check of cxx compiler flags in api/CMakeLists.txt.

95924686

08 12月, 2017 1 次提交

Nmt decoder train (#6367) · 36fcc95c

由 Qiao Longfei 提交于 12月 08, 2017

* init decoder_trainer

* can run

* fix lod

* add sharelod to cross_entropy_grad_op

* add avg_cost to fetch list

* modify learning rate

* can run

* optimie code

* add early exit

* fix print

* revert test_understand_sentiment_conv.py

* add act to fc

36fcc95c

06 12月, 2017 1 次提交

Feature/while op sentiment analysis (#6282) · 229c2e78

由 Yu Yang 提交于 12月 06, 2017

* Add DataFeeder

A v2 API like data feeder for book demos.
We can feed data directly from reader.

* Fix CI

* Add an unittest for while/rnn op forward

* Add unittest for raw while op backward

* Fix CI

* Complete Dynamic RNN

229c2e78

05 12月, 2017 1 次提交
- D
  
  Remove the cuda stream synchronization between each operator. · 4e451a34
  由 dangqingqing 提交于 12月 05, 2017
  
  4e451a34
04 12月, 2017 1 次提交

While op forward for sentimental analysis (#6140) · d5e32794

由 Yu Yang 提交于 12月 04, 2017

* Add DataFeeder

A v2 API like data feeder for book demos.
We can feed data directly from reader.

* Fix CI

* Add an unittest for while/rnn op forward

* Add unittest for raw while op backward

* Fix CI

d5e32794

30 11月, 2017 2 次提交
- F
  
  Add GetInputsElementDim (#6091) · a38c1512
  由 fengjiayi 提交于 11月 30, 2017
  
  a38c1512
- Y
  Fix ShareLoD bug (#6084) · 35453df1
  由 Yu Yang 提交于 11月 30, 2017
```
Fix #6087
```
  35453df1
28 11月, 2017 1 次提交

武

Send recv op (#5520) · 0a8a86e0

由武毅提交于 11月 28, 2017

* WIP send recv op

* WIP send recv

* put grpc impl in details

* put grpc impl in details

* update wip

* update proto

* update proto

* update proto

* clean cmake

* wip on op implementations

* wip on op implementations

* compile ok adding ut

* wip unitest

* add extern cares for linking

* wip add ut

* working version send recv

* revert optimizer.py

* update test cmake

* add libtool to dockerfile

* update cmake dependency

* update cmake depends

* update cmake grpc depends

* fix cmake dependency

* fix compile error

* fix compile

* follow comments

* update

* update copyfrom

0a8a86e0

27 11月, 2017 1 次提交
- Q
  
  refine test_recognize_digits_mlp and format codes (#5937) · b28b2f17
  由 QI JUN 提交于 11月 27, 2017
  
  b28b2f17
26 11月, 2017 1 次提交

Feature/copytensor (#5455) · 45062fe5

由 dzhwinter 提交于 11月 26, 2017

* "make global tensor function independently"

* "replace functor"

* "fix inline template error"

* "fix tensor array with CopyFrom"

* "fix other case use CopyFrom"

* "move the op interface hardly"

* "fix operators"

* "fix typo"

* "delete dynamic recurrent rnn and fix gru_unit in debugmode"

* "fix unique_ptr copy"

* "fix cuda copy"

* "fix namespace error"

* "removed nccl python test"

* "fix include error"

* "fix typo"

* fix copy util test

45062fe5

24 11月, 2017 2 次提交

support testing when training and handle dropout and batch_norm operator in testing mode (#5734) · 3a76062c

由 QI JUN 提交于 11月 24, 2017

* is_training to is_test in dropout op

* handle dropout and batch_norm operator when prune pdesc in testing mode

* handle dropout and batch_norm operator when prune pdesc in testing mode

* add get_inference_program method

* fix dropout op

* fix ci

* test data after each batch training

* refine code

* refine test_book3

* fix ci

* follow comments

3a76062c

Unify dtype and datatype (#5869) · 50d670ee

由 fengjiayi 提交于 11月 24, 2017

* Change all `data_type` in Python to `dtype`

* Change `date_type` in C++ to `dtype`

* Refine

50d670ee

18 11月, 2017 2 次提交
- Q
  enforce shape of backward target to be {1} (#5745) · 569f7c47
  由 Qiao Longfei 提交于 11月 18, 2017
```
* enforce shape of backward target to be {1}

* fix test_regularizer.py

* rm unused code

* fix backward_test

* fix a type bug

* fix test_program
```
  569f7c47
- A
  
  Adding logical operators for beam search and control flow (#5708) · 6cfcf624
  由 Abhinav Arora 提交于 11月 18, 2017
  
  6cfcf624
16 11月, 2017 1 次提交

feature/while_grad_op (#5554) · 18f0c40a

由 Yang Yang(Tony) 提交于 11月 16, 2017

* first commit

* Python API for while op

* Python Unittest for simple while_op forward

* fix out to be list

* Fix UT

* VarType

* Fix several bugs

* Fix bug

* Fix bug

* Fix Bug

* Fix bug

* Fix unittest

* Remove debug log

* Add comments

* add PADDLE_ENFORCE

* while_grad_op first commit

* Add `BlockDescBind::FindRecursiveOrCreateVar()` and fix bugs

* not sure how to setdim of while outputs

* push for test

* add executor vlog

* fix bug of while_op cond

* Several enhancement for code

1. Backward always infer shape & infer var type. Since there are RENAME
variables will be created when creating backward operator, but their
shape & var types are not inferenced.
2. Never use SomePtr-> directly, since every pointer could be nullptr if
it is a function return value. Add `detail::Ref` to cast pointer to
reference safely.
3. Enhance error message for backward.
4. Infer data type of variable in `sum` and `tensor_write`

* Fix bugs of while_op gradient

* Fix several bugs of while_op grad

* fix fill zeros like

* fix 3 >= 3

* fix place holder shouldn't be null

* fail on sum op

* Fix SumOp of TensorList

* clean up

* pass while test

* fix test_array_write_read

* pass sum op

* Support int/int64 for fill_constant_batch_size_like

* Fix compile

18f0c40a

15 11月, 2017 1 次提交
- Q
  fix gitignore (#5657) · 5f9f990e
  由 QI JUN 提交于 11月 14, 2017
```
* fix gitignore

* refine cmake file
```
  5f9f990e
14 11月, 2017 2 次提交

Conditional Block Forward (#5530) · 488320a7

由 Yu Yang 提交于 11月 13, 2017

* Conditional Block Forward

* Assign Operator.

Out=X, when type in [LoDTensor/SelectedRows/LoDTensorArray]

* Stash

* Add Scope::Rename

it is useful in gradient phase of an operator with block

* ConditionalBlock Grad Done

* Add comments

* yapf format code

488320a7

Assign Operator. (#5531) · 7c1755d9

由 Yu Yang 提交于 11月 13, 2017

* Assign Operator.

Out=X, when type in [LoDTensor/SelectedRows/LoDTensorArray]

* Follow comments

7c1755d9

11 11月, 2017 1 次提交
- Y
  Add Scope::Rename (#5534) · edb22c2f
  由 Yu Yang 提交于 11月 10, 2017
```
it is useful in gradient phase of an operator with block
```
  edb22c2f
10 11月, 2017 1 次提交

feature/while_op (#5502) · 40367d18

由 Yang Yang(Tony) 提交于 11月 09, 2017

* first commit

* Python API for while op

* Python Unittest for simple while_op forward

* fix out to be list

* Fix UT

* VarType

* Fix several bugs

* Fix bug

* Fix bug

* Fix Bug

* Fix bug

* Fix unittest

* Remove debug log

* Add comments

* add PADDLE_ENFORCE

* while_grad_op first commit

* Add `BlockDescBind::FindRecursiveOrCreateVar()` and fix bugs

* refine code

* fix unittest bug

40367d18

09 11月, 2017 1 次提交
- Y
  Do not sum output if that output is not a gradient · c9fc7ba9
  由 Yang Yu 提交于 11月 08, 2017
```
* increament is default inplace
```
  c9fc7ba9
08 11月, 2017 6 次提交

Y

Fix CI Compile · 0ede2a73
由 Yang Yu 提交于 11月 07, 2017

0ede2a73

Feature/rnn to array to lod tensor (#5411) · f72729d4

由 Yu Yang 提交于 11月 07, 2017

* Add LoDRankTable

LoD Rank Table stores the `level` of `lod` which is ordered by sequence
length in descending order. It is useful when implement dynamic RNN and
is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
RNN slice output operators.

* Add skeleton for array_to_lod_tensor and lod_tensor_to_array

* Add VarType::LoDTensorArray

* Add PyBind of LoDTensorArray

* Add InferVarType

* Add first unittest

* Add ut

* Add unittest

* Add unittest

* Add unittests

* update

* init

* add infershape for lod_tensor_to_array_op

* compelete array_to_lod_tensor_op

* copy data

* clean code

* clean code

* Fix unittest data

* fix bugs

* fix compile error

* Refine TensorToArrayOp

* refactor array_to_lod_tensor

* Unittest

* fix bugs

* Fix unittest

* Fix unittest

* debug

* Debug

* Fix unittest

* clean code

* refactor

* use ostream

* update test

* fix gpu build error

* make gpu test pass

f72729d4

Y

Rewrite fill_constant op · 5ee62383
由 Yu Yang 提交于 11月 07, 2017

5ee62383

Polish OpWithKernel · bbdac7f7

由 Yu Yang 提交于 11月 07, 2017

* Chage `IndicateDataType` to `GetKernelType`. Make it easier to
  understand.
* Change `OpKernelKey` to `OpKernelType`
* Make operator developers can customize which kernel the operator will
  use in runtime.

bbdac7f7

Y
Compare Operator (#5325) · f74fb790
由 Yu Yang 提交于 11月 07, 2017
```
* Compare Operator

* Follow comments
```
f74fb790
Q

Check errors for the cuda kernel calls. (#5436) · 58db07b7
由 qingqing01 提交于 11月 08, 2017

58db07b7

07 11月, 2017 3 次提交

Add unittest, backward of array read/write op (#5409) · 6cde889b

由 Yu Yang 提交于 11月 06, 2017

* Use stable_sort in lod_rank_table

It is easy to debug and test when use `stable_sort`and the time
complexity is not changed.

* Add LoDTensorArray

* Stash

* Better debug message for IsInitialized

* Stash

* Better debug message for IsInitialized

* Complete array read/write op unittests

* Add unittest, Gradient of array read/write

* Follow comments

6cde889b

Update lod_tensor.md (#5383) · 70154597

由 Yang Yang(Tony) 提交于 11月 06, 2017

An important change on lod tensor indexing. A higher level offset will be based on its next level rather than an absolute offset.

70154597

ReadFromArray/WriteToArray op (#5407) · c9b57dcc

由 Yu Yang 提交于 11月 06, 2017

* Use stable_sort in lod_rank_table

It is easy to debug and test when use `stable_sort`and the time
complexity is not changed.

* Add LoDTensorArray

* Stash

* Better debug message for IsInitialized

* Stash

* Better debug message for IsInitialized

* Complete array read/write op unittests

c9b57dcc

06 11月, 2017 3 次提交

T

refine get cuda context · 272f3e6d
由 typhoonzero 提交于 11月 06, 2017

272f3e6d

Add LoD's slice and append function (#5368) · d05c182e

由 fengjiayi 提交于 11月 05, 2017

* Add GetFineGrainedLoDLength and AppendLoD

* Follow comments and fix bugs

* fix a compile error

* fix a compile bug

d05c182e

Feature/lod tensor array (#5379) · 2be4c3cb

由 Yu Yang 提交于 11月 05, 2017

* Use stable_sort in lod_rank_table

It is easy to debug and test when use `stable_sort`and the time
complexity is not changed.

* Add LoDTensorArray

2be4c3cb

05 11月, 2017 1 次提交
- Y
  Use stable_sort in lod_rank_table (#5378) · ea2fc4cc
  由 Yu Yang 提交于 11月 04, 2017
```
It is easy to debug and test when use `stable_sort`and the time
complexity is not changed.
```
  ea2fc4cc
04 11月, 2017 2 次提交

Add acc test to image classification (#5336) · 906e2565

由 Qiao Longfei 提交于 11月 04, 2017

* add acc layer
* memory log level change from 3 to 10
* use gaussian random to init conv parameters
* use initializer
* fix import
* batch_norm use helper to create persistable var
* refine code
* train only 2 batches for test
* use g_program and g_init_program
* use XavierInitializer to init fc parameter

906e2565

Add LoDRankTable (#5349) · 74849158

由 Yu Yang 提交于 11月 03, 2017

* Add LoDRankTable

LoD Rank Table stores the `level` of `lod` which is ordered by sequence
length in descending order. It is useful when implement dynamic RNN and
is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
RNN slice output operators.

* Add InferVarType

74849158

03 11月, 2017 1 次提交
- Y
  
  Fix comparing between signed and unsigned values (#5328) · 1ed5ae7a
  由 Yi Wang 提交于 11月 02, 2017
  
  1ed5ae7a
02 11月, 2017 1 次提交

Rewrite StaticRNN with Executor (#5224) · 0a32e74d

由 Yu Yang 提交于 11月 01, 2017

* Init commit

* Make executor use ProgramDescBind

* Change Attribute from BlockDesc to BlockDescBind

* Since we will get the program desc in RNN, just BlockDesc is not
  enough.

* Add DeviceContext to Executor API

* Rewrite RNN

* Pass Python

* AddBiasOp does not care num_flatten_dims

* Stash

* Fix MacOS Compile

* Pass RNN forward

* add python test

* refactor test

* Make compile pass

* add gradopmaker

* First draft done

* Polish code

* add grad op maker and grad infershape

* Polish code

* Fix backward.cc bug

* Fix infershape

* Rename function

* add backward test

* simplify recurrent test

* Update

* Pass unittest

* Add comments & refine test

* Add comments

* refactor test

* Complete Unittest

* fix StepScopes enforce

* Remove unused unittest

* no type error

* Update

* Make RNN Pass unittest

0a32e74d

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功