提交 · 9a729aec71b81eaac1e3bb2433f0906bfdcddae0 · PaddlePaddle / Paddle

12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

01 12月, 2017 3 次提交
- T
  
  add grpc benchmark · c66c65cb
  由 typhoonzero 提交于 12月 01, 2017
  
  c66c65cb
- Y
  Fix grpc compile warning (#6050) · 1b612d3a
  由 Yancey 提交于 12月 01, 2017
```
* fix grpc compile warn

* update

* -Wnon-virtual-dtor -> -Wno-non-virtual-dtor
```
  1b612d3a
- T
  
  add switch for distributed support · 1a852861
  由 typhoonzero 提交于 12月 01, 2017
  
  1a852861
28 11月, 2017 1 次提交

武

Send recv op (#5520) · 0a8a86e0

由武毅提交于 11月 28, 2017

* WIP send recv op

* WIP send recv

* put grpc impl in details

* put grpc impl in details

* update wip

* update proto

* update proto

* update proto

* clean cmake

* wip on op implementations

* wip on op implementations

* compile ok adding ut

* wip unitest

* add extern cares for linking

* wip add ut

* working version send recv

* revert optimizer.py

* update test cmake

* add libtool to dockerfile

* update cmake dependency

* update cmake depends

* update cmake grpc depends

* fix cmake dependency

* fix compile error

* fix compile

* follow comments

* update

* update copyfrom

0a8a86e0

27 11月, 2017 2 次提交

F

Compelete max_sequence_len_op (#5913) · 33fa2dfb
由 fengjiayi 提交于 11月 27, 2017

33fa2dfb

武

Conv cudnn 3d (#5783) · a06bec12

由武毅提交于 11月 27, 2017

* conv cudnn 3d

* update test case

* update

* update

* follow comments and remove groups from helper

* update

* refine

* update

* follow comments2

* update

* fix compile

a06bec12

26 11月, 2017 1 次提交

Feature/copytensor (#5455) · 45062fe5

由 dzhwinter 提交于 11月 26, 2017

* "make global tensor function independently"

* "replace functor"

* "fix inline template error"

* "fix tensor array with CopyFrom"

* "fix other case use CopyFrom"

* "move the op interface hardly"

* "fix operators"

* "fix typo"

* "delete dynamic recurrent rnn and fix gru_unit in debugmode"

* "fix unique_ptr copy"

* "fix cuda copy"

* "fix namespace error"

* "removed nccl python test"

* "fix include error"

* "fix typo"

* fix copy util test

45062fe5

22 11月, 2017 1 次提交
- S
  
  test unpool ok cpu · 90f664d0
  由 sweetsky0901 提交于 11月 22, 2017
  
  90f664d0
21 11月, 2017 2 次提交
- S
  
  add unpool2d make ok · 45a8c9dd
  由 sweetsky0901 提交于 11月 21, 2017
  
  45a8c9dd
- S
  
  add unpool · bc45335e
  由 sweetsky0901 提交于 11月 21, 2017
  
  bc45335e
18 11月, 2017 1 次提交
- A
  
  Adding logical operators for beam search and control flow (#5708) · 6cfcf624
  由 Abhinav Arora 提交于 11月 18, 2017
  
  6cfcf624
16 11月, 2017 1 次提交

support adagrad sparse update (#5272) · d7bf372d

由 QI JUN 提交于 11月 15, 2017

* adam sparse support

* fix gpu build error

* fix ci

* fix ci

* fix adagrad sparse update bug

* fix gpu build error

d7bf372d

13 11月, 2017 3 次提交

C

add conv3d_trans_cudnn_op · 3a507b44
由 chengduoZH 提交于 11月 13, 2017

3a507b44

BeamSearchDecodeOp (#5498) · a4106278

由 Qiao Longfei 提交于 11月 13, 2017

* init trieconcat_op

* add basic implementation

* add test

* add more test

* update unit test

* add PackAllSteps test

* fix PackAllSteps

* all test passed

* clean code

* remove state inside helper

* rename prob to score

* optimize RemoveFromEnd

* use deconstructor to delete BeamNode recursively

* optimize interface

* add comment to interface

* optimizer data structure

* use template to define the type of score

* use template parameter for BeamHelper

* change father to parent

* rename TrieConcat to BeamSearchOutConcat

* use LoDTensorArray

* rename BeamSearchOutConcat to BeamSearchDecode

* refine code

* remain all candidate sentence in beam_search_decode_op, do not consider endid

* use unique_ptr

* fix compare bug

* fix lod compile problem

a4106278

D

Fix compling for softmax_with_cross_entropy_op. · 91d4fc69
由 dangqingqing 提交于 11月 13, 2017

91d4fc69

11 11月, 2017 2 次提交
- D
  
  Use G++ to compile some cu operators. · f5e36765
  由 dangqingqing 提交于 11月 11, 2017
  
  f5e36765
- W
  
  this for maxout op new add · 058bdd34
  由 wanghaox 提交于 11月 11, 2017
  
  058bdd34
08 11月, 2017 3 次提交

Feature/rnn to array to lod tensor (#5411) · f72729d4

由 Yu Yang 提交于 11月 07, 2017

* Add LoDRankTable

LoD Rank Table stores the `level` of `lod` which is ordered by sequence
length in descending order. It is useful when implement dynamic RNN and
is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
RNN slice output operators.

* Add skeleton for array_to_lod_tensor and lod_tensor_to_array

* Add VarType::LoDTensorArray

* Add PyBind of LoDTensorArray

* Add InferVarType

* Add first unittest

* Add ut

* Add unittest

* Add unittest

* Add unittests

* update

* init

* add infershape for lod_tensor_to_array_op

* compelete array_to_lod_tensor_op

* copy data

* clean code

* clean code

* Fix unittest data

* fix bugs

* fix compile error

* Refine TensorToArrayOp

* refactor array_to_lod_tensor

* Unittest

* fix bugs

* Fix unittest

* Fix unittest

* debug

* Debug

* Fix unittest

* clean code

* refactor

* use ostream

* update test

* fix gpu build error

* make gpu test pass

f72729d4

Y

Add gtest for drnn · db3b49fe
由 Yu Yang 提交于 11月 07, 2017

db3b49fe
Y
Compare Operator (#5325) · f74fb790
由 Yu Yang 提交于 11月 07, 2017
```
* Compare Operator

* Follow comments
```
f74fb790

07 11月, 2017 1 次提交

ReadFromArray/WriteToArray op (#5407) · c9b57dcc

由 Yu Yang 提交于 11月 06, 2017

* Use stable_sort in lod_rank_table

It is easy to debug and test when use `stable_sort`and the time
complexity is not changed.

* Add LoDTensorArray

* Stash

* Better debug message for IsInitialized

* Stash

* Better debug message for IsInitialized

* Complete array read/write op unittests

c9b57dcc

04 11月, 2017 1 次提交

Add LoDRankTable (#5349) · 74849158

由 Yu Yang 提交于 11月 03, 2017

* Add LoDRankTable

LoD Rank Table stores the `level` of `lod` which is ordered by sequence
length in descending order. It is useful when implement dynamic RNN and
is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
RNN slice output operators.

* Add InferVarType

74849158

03 11月, 2017 1 次提交
- D
  
  Refine sequence max-pooling and add unit testing of gradient check. · afc6343e
  由 dangqingqing 提交于 11月 02, 2017
  
  afc6343e
02 11月, 2017 1 次提交

Rewrite StaticRNN with Executor (#5224) · 0a32e74d

由 Yu Yang 提交于 11月 01, 2017

* Init commit

* Make executor use ProgramDescBind

* Change Attribute from BlockDesc to BlockDescBind

* Since we will get the program desc in RNN, just BlockDesc is not
  enough.

* Add DeviceContext to Executor API

* Rewrite RNN

* Pass Python

* AddBiasOp does not care num_flatten_dims

* Stash

* Fix MacOS Compile

* Pass RNN forward

* add python test

* refactor test

* Make compile pass

* add gradopmaker

* First draft done

* Polish code

* add grad op maker and grad infershape

* Polish code

* Fix backward.cc bug

* Fix infershape

* Rename function

* add backward test

* simplify recurrent test

* Update

* Pass unittest

* Add comments & refine test

* Add comments

* refactor test

* Complete Unittest

* fix StepScopes enforce

* Remove unused unittest

* no type error

* Update

* Make RNN Pass unittest

0a32e74d

31 10月, 2017 1 次提交
- G
  
  Add GRU Operator · b87eabae
  由 guosheng 提交于 10月 18, 2017
  
  b87eabae
30 10月, 2017 1 次提交
- C
  
  fix code format and doc · 5173b8d8
  由 chengduoZH 提交于 10月 30, 2017
  
  5173b8d8
27 10月, 2017 2 次提交

C

write together · 51113cfe
由 chengduoZH 提交于 10月 27, 2017

51113cfe

add sparse support for sum op (#5093) · 7f8574c0

由 QI JUN 提交于 10月 26, 2017

* add sparse support for sum op

* typo fix

* fix gpu build error

* fix unittest error

* typo fix

* infer var type and shape in op_test

* follow comments

* fix build error

* bypass some unittests depend on NetOp

7f8574c0

26 10月, 2017 6 次提交
- C
  
  follow comments · 99c6f44a
  由 chengduoZH 提交于 10月 26, 2017
  
  99c6f44a
- C
  
  Add deconv3d op · 56bbfd1a
  由 chengduoZH 提交于 10月 26, 2017
  
  56bbfd1a
- C
  
  Add pool2d cudnn · 1bb0e294
  由 chengduoZH 提交于 10月 11, 2017
  
  1bb0e294
- C
  
  write conv2d and conv3d together · eafbbc11
  由 chengduoZH 提交于 10月 26, 2017
  
  eafbbc11
- Y
  Feature/save op (#5090) · efc2464f
  由 Yu Yang 提交于 10月 25, 2017
```
* Init

* Stash

* Polish SaveLoadOp

* Fix CI

* Polish code

* Save GPU Tensor

* Stash

* Fix CI
```
  efc2464f
- D
  
  "polish cmake file" · 626ff3b7
  由 Dong Zhihong 提交于 10月 25, 2017
  
  626ff3b7
25 10月, 2017 2 次提交

"Serialize LoDTensor, Save/Restore model" (#4602) · fd2eb550

由 dzhwinter 提交于 10月 24, 2017

* "add model format design doc"

* "add restore function"

* "add parse protobuf"

* "move necessary information to saver.proto"

* "format code"

* "add gpu option"

* "add lod info"

* "add saveop python test wrapper"

* "checkpoint reuse save operator"

* "rewrite model format design doc"

* "async support needed"

* "fix run once"

* "fix doc based on comments"

* "refine based on comments"

* "fix based comments"

* "remove persistable flag from framework.proto"

* "add IndicateDataType to restore op"

* "add save test"

* "modify save restore code"

* "modified the restore logic"

* rm checkpoint_op.cc

* rm test_checkpoint_op.py

* "get inputs outputs name from execution context"

* Saving each variable to a independent file

* Fix bugs

* Rewrite save_restore_op_test with new Python framework

* Move `SaveOp` and `RestoreOp` from OpWithKernel to OpBase

* Refine unit test of SaveOp and RestoreOp

* fix compile errorwq

fd2eb550

D

write nccl c++ test case · ef257e6d
由 Dong Zhihong 提交于 10月 24, 2017

ef257e6d

24 10月, 2017 2 次提交
- D
  
  "add init allreduce test" · 50f04dca
  由 Dong Zhihong 提交于 10月 23, 2017
  
  50f04dca
- D
  
  "add register gpu macro" · 423d7438
  由 Dong Zhihong 提交于 10月 23, 2017
  
  423d7438
23 10月, 2017 1 次提交
- C
  
  Add sequence_conv_op · f2ccef26
  由 chengduoZH 提交于 10月 23, 2017
  
  f2ccef26

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功