提交 · 35c1683e803462d1ae78c49a8c8fb392ff6e2d32 · s920243400 / PaddleDetection

27 12月, 2017 2 次提交

"refine kernel registrar" (#6998) · 35c1683e

由 dzhwinter 提交于 12月 27, 2017

* "refine kernel registrar"

* "refine registrar with multikey"

* "fix register"

* "refine multikernel register"

* "fix CI"

* "fix CI"

* "fix registry"

* "swtich GPU to CUDA"

* "add register macro test case"

* "fix CI"

35c1683e

Q
add memory switch mechanism in operator kernel switch (#6991) · 94096ae5
由 QI JUN 提交于 12月 27, 2017
```
* add memory switch mechanism in operator kernel switch
```
94096ae5

26 12月, 2017 2 次提交

Add data transform fn (#6953) · f97f69fe

由 Qiao Longfei 提交于 12月 26, 2017

* init data_transform

* complete DataTransform

* fix build error

* add data_transform_test

* add a register test for data_transform_fn

* use function to simulate registration macro

* add register macro

* update test

* clean code

* restore unrelated code

* update data transform test

* generate unique name for REGISTER_DATA_TRANSFORM_FN

* add const

* follow comment

* update KernelTypePair hash function

f97f69fe

D
"fix threadpool style" (#7017) · 80dafdf5
由 dzhwinter 提交于 12月 26, 2017
```
* "fix threadpool style"

* "remove header"
```
80dafdf5

25 12月, 2017 2 次提交

Implement a simple threadpool (#6684) · 127bc2e0

由 Yancey 提交于 12月 25, 2017

* implement a simple threadpool

* unlock before cv.notify

* add done function

* add lock with GetAvailable function

* delete done_

* using call_once in GetInstance

* update by comment

* update comment

* enhance unit test for multi threads task

127bc2e0

Q

add op_kernel_type_test · 313afc9c
由 qiaolongfei 提交于 12月 25, 2017

313afc9c

24 12月, 2017 1 次提交

Feature/operator run place (#6783) · 735eba29

由 dzhwinter 提交于 12月 24, 2017

* "change operator interface"

* "move devicepool to device_context"

* "fix operator test"

* "fix op_registry Run interface"

* "net op passed. Need to fix nccl multi-Context"

* "add nccl group function"

* "add nccl group function"

* "fix gpu count exceed 32 error"

* "fix recurrent op, nccl op"

* "change the other operators interface with Place"

* "fix typo"

* "fix pybind"

* "fix device in python side"

* "fix pybind failed"

* "add init for test"

* "fix CI"

735eba29

18 12月, 2017 1 次提交

Feature/global context (#6537) · 24fda392

由 dzhwinter 提交于 12月 18, 2017

* "add DeviceContextPool"

* "add devicecontextpool in pybind"

* "add comments in python side "

* "fix static link error"

* "fix CI error"

* "add executor.py"

* "fix CI error"

* "add with gpu macro"

* "remove comment out codes"

* "add TODO items"

* "update init devices"

24fda392

26 11月, 2017 1 次提交

Feature/copytensor (#5455) · 45062fe5

由 dzhwinter 提交于 11月 26, 2017

* "make global tensor function independently"

* "replace functor"

* "fix inline template error"

* "fix tensor array with CopyFrom"

* "fix other case use CopyFrom"

* "move the op interface hardly"

* "fix operators"

* "fix typo"

* "delete dynamic recurrent rnn and fix gru_unit in debugmode"

* "fix unique_ptr copy"

* "fix cuda copy"

* "fix namespace error"

* "removed nccl python test"

* "fix include error"

* "fix typo"

* fix copy util test

45062fe5

15 11月, 2017 1 次提交
- Q
  fix gitignore (#5657) · 5f9f990e
  由 QI JUN 提交于 11月 14, 2017
```
* fix gitignore

* refine cmake file
```
  5f9f990e
04 11月, 2017 1 次提交

Add LoDRankTable (#5349) · 74849158

由 Yu Yang 提交于 11月 03, 2017

* Add LoDRankTable

LoD Rank Table stores the `level` of `lod` which is ordered by sequence
length in descending order. It is useful when implement dynamic RNN and
is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
RNN slice output operators.

* Add InferVarType

74849158

31 10月, 2017 1 次提交
- D
  
  Refine activation function pointer for LSTM operator. · 1c8a0c4b
  由 dangqingqing 提交于 10月 31, 2017
  
  1c8a0c4b
29 10月, 2017 1 次提交
- Y
  Extract InferShape to many cc files (#5174) · 8f6c0a0f
  由 Yu Yang 提交于 10月 28, 2017
```
* Shrink Operator.h

* Fix CI compile
```
  8f6c0a0f
28 10月, 2017 1 次提交
- Y
  Add debug logs in scope, meta_cache and memory (#5170) · 2a5edec0
  由 Yu Yang 提交于 10月 27, 2017
```
* Add debug logs in scope, meta_cache and memory

* Add missing deps
```
  2a5edec0
27 10月, 2017 2 次提交

add sparse support for sum op (#5093) · 7f8574c0

由 QI JUN 提交于 10月 26, 2017

* add sparse support for sum op

* typo fix

* fix gpu build error

* fix unittest error

* typo fix

* infer var type and shape in op_test

* follow comments

* fix build error

* bypass some unittests depend on NetOp

7f8574c0

Gradient check use graph (#5027) · be00b0c4

由 Yu Yang 提交于 10月 26, 2017

* Simplize Gradient Check

* Stash

* Extract apply_backward_pass to backward.py

Rename apply_backward_pass to append_backward_ops

* Use graph API to check gradient

* Fix ci

* Fix CI

* Fix backward for double precision

* Stash

* Fix CI

* Fix ci

* Ignore GRU test

* Ignore xe op

* Fix CI

* Fix softmax with xe gradient

The correct equation should be IG = OG * (d_softmax_with_xe())

* Fix typo

* Fix merge error

* Disable LRN

be00b0c4

26 10月, 2017 1 次提交

Feature/save op (#5090) · efc2464f

由 Yu Yang 提交于 10月 25, 2017

* Init

* Stash

* Polish SaveLoadOp

* Fix CI

* Polish code

* Save GPU Tensor

* Stash

* Fix CI

efc2464f

25 10月, 2017 1 次提交

"Serialize LoDTensor, Save/Restore model" (#4602) · fd2eb550

由 dzhwinter 提交于 10月 24, 2017

* "add model format design doc"

* "add restore function"

* "add parse protobuf"

* "move necessary information to saver.proto"

* "format code"

* "add gpu option"

* "add lod info"

* "add saveop python test wrapper"

* "checkpoint reuse save operator"

* "rewrite model format design doc"

* "async support needed"

* "fix run once"

* "fix doc based on comments"

* "refine based on comments"

* "fix based comments"

* "remove persistable flag from framework.proto"

* "add IndicateDataType to restore op"

* "add save test"

* "modify save restore code"

* "modified the restore logic"

* rm checkpoint_op.cc

* rm test_checkpoint_op.py

* "get inputs outputs name from execution context"

* Saving each variable to a independent file

* Fix bugs

* Rewrite save_restore_op_test with new Python framework

* Move `SaveOp` and `RestoreOp` from OpWithKernel to OpBase

* Refine unit test of SaveOp and RestoreOp

* fix compile errorwq

fd2eb550

23 10月, 2017 1 次提交
- D
  
  Update lstm comments and fix bug. · 64fe9bcc
  由 dangqingqing 提交于 10月 23, 2017
  
  64fe9bcc
21 10月, 2017 1 次提交
- Y
  
  Correct the dependencies (#4978) · e9e0d7d7
  由 Yu Yang 提交于 10月 20, 2017
  
  e9e0d7d7
20 10月, 2017 1 次提交

Feature/py executor test (#4922) · 3db52783

由 Yu Yang 提交于 10月 19, 2017

* Implement FC layer with helper

* Update LayerHelper

* Add debug string for Python ProtoBuf

and Rename `Sync` to `Flush`

* Add check of ProtoBuf initialization

* Layer wrapper for FC

* Fix unittest

* Fix CI

* Add code generator

* AttributeChecker Better error log and speicalize bool

Since lots of types can be cast to bool

* Complete mlp, fit_a_line

* Expose get global scope

* Make global scope not thread-safe

1. It is no need to make global scope thread-safe, since it will be
invoked in Python main thread.
2. Do not free the global scope when C++ exit. Let the OS free memories,
otherwise, we need to handle the destroy dependencies.

See
https://google.github.io/styleguide/cppguide.html#Static_and_Global_Variables

* Fix

* Implementation of simple conv_2d layer

* Stash

* Remove private data members in OpRegister

* Fix bugs

* Stash

* Expose FeedFetchList as VarType

* Change ProgramDesc not a global variable

* Polish code style

* Stash

* Correct implement BlockDesc destructor

* Correct implement BlockDesc destructor

* Unify program as parameter name

* Fix bugs

* Add unittest

* Fix unit test error

* Remove unused functions

* Add clone for Python Program

* Working on executor

* Stash

* Add glog as dependencies of ops

* Use VLOG to logging some information is helpful when we debug Paddle

* Expose VarDesc::persistable to Python

* Test executor

* Complete unittest

* Polish code

* Fix merge error

* Follow comment

* Polish Python Code

3db52783

19 10月, 2017 3 次提交

D

Add missing file. · a461bf13
由 dangqingqing 提交于 10月 19, 2017

a461bf13

Copy Constructor for ProgramDesc (#4895) · 47f773dd

由 Yu Yang 提交于 10月 18, 2017

* Implement FC layer with helper

* Update LayerHelper

* Add debug string for Python ProtoBuf

and Rename `Sync` to `Flush`

* Add check of ProtoBuf initialization

* Layer wrapper for FC

* Fix unittest

* Fix CI

* Add code generator

* AttributeChecker Better error log and speicalize bool

Since lots of types can be cast to bool

* Complete mlp, fit_a_line

* Implementation of simple conv_2d layer

* Fix bugs

* Change ProgramDesc not a global variable

* Polish code style

* Stash

* Correct implement BlockDesc destructor

* Correct implement BlockDesc destructor

* Unify program as parameter name

* Fix bugs

* Add unittest

* Fix unit test error

* Remove unused functions

* Add clone for Python Program

* Compare OpDescBind directly

47f773dd

Add glog as dependencies of ops (#4908) · e9249d16

由 Yu Yang 提交于 10月 18, 2017

* Add glog as dependencies of ops

* Use VLOG to logging some information is helpful when we debug Paddle

* Fix Unittests

e9249d16

18 10月, 2017 2 次提交
- D
  
  LSTM Operator forward implementation. · 2a8dbd13
  由 dangqingqing 提交于 10月 17, 2017
  
  2a8dbd13
- C
  
  Add sequence_project_op (use im2col) · 1e60c9b2
  由 chengduoZH 提交于 10月 11, 2017
  
  1e60c9b2
17 10月, 2017 2 次提交
- Q
  rm cpp executor_test, rewrite in python later (#4849) · b10cd435
  由 Qiao Longfei 提交于 10月 16, 2017
```
* rm cpp executor_test, rewrite in python later

* remove executor_test code in CMakeList.txt
```
  b10cd435
- Y
  
  add compile DEPS · 865c2c8e
  由 Yang Yang 提交于 10月 16, 2017
  
  865c2c8e
15 10月, 2017 1 次提交

create grad_var when run Backward pass (#4796) · d7383c6d

由 Qiao Longfei 提交于 10月 14, 2017

* add target to Backward, generate var in block when call backward

* modify backward_test

* fix executor_test

* set var desc default type to LOD_TENSOR

* update backward_test

* insert loss in the top level of backward

* create grad vars for all blocks in current program

* optimize code

* update test_program.py

* only create var for newly create blocks when backward

d7383c6d

14 10月, 2017 1 次提交
- Y
  
  Complete infer_var_type · 1b1cb44f
  由 Yu Yang 提交于 10月 13, 2017
  
  1b1cb44f
13 10月, 2017 2 次提交
- Q
  
  add selected rows · 4b13c80e
  由 qijun 提交于 10月 12, 2017
  
  4b13c80e
- Y
  
  Wrong dependency order for op_info and proto_desc (#4763) · 4838ea25
  由 Yu Yang 提交于 10月 12, 2017
  
  4838ea25
12 10月, 2017 5 次提交
- Y
  
  prune link fail · 58b8a1ae
  由 Yang Yang 提交于 10月 12, 2017
  
  58b8a1ae
- Y
  
  prune pass dummy test · a31ff363
  由 Yang Yang 提交于 10月 11, 2017
  
  a31ff363
- Q
  
  correct op deps in executor_test · f4b32673
  由 qijun 提交于 10月 11, 2017
  
  f4b32673
- Q
  
  move GLOB_OP_LIB deps to executor_test · 8e7975da
  由 qijun 提交于 10月 11, 2017
  
  8e7975da
- Q
  
  debug executor_test · ccea4c57
  由 qijun 提交于 10月 11, 2017
  
  ccea4c57
11 10月, 2017 2 次提交
- L
  
  pause executor_test temporary in order to pass the teamcity · 1f592eb8
  由 Luo Tao 提交于 10月 11, 2017
  
  1f592eb8
- Y
  
  Fix compile error in linux · 805639b1
  由 Yu Yang 提交于 10月 11, 2017
  
  805639b1
08 10月, 2017 1 次提交
- Y
  
  before backward · 6e7666f1
  由 Yang Yang 提交于 10月 08, 2017
  
  6e7666f1

s920243400 / PaddleDetection 与 Fork 源项目一致

s920243400 / PaddleDetection
与 Fork 源项目一致