提交 · 36ec3e904243e9c9ddae09c99e11353a0b4d29f4 · PaddlePaddle / Paddle

26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
25 12月, 2017 3 次提交

由 dzhwinter 提交于 12月 25, 2017

* "add data layout"

* "need kernel registry support"

* "fix data layout"

* "reorder include headers"

* "change enum to enum class"

* "fix CI"

7777c811

Impl kernel hint (#6883) · af0c4c45

由 Qiao Longfei 提交于 12月 25, 2017

* init kernel hint

* fix typo

* rm unused code

* add include in op_kernel.h

* restore op_kernel since it will be moved to op_kernel_type

* change force_cpu to use_cpu

* fix compilation

af0c4c45

D

GPUPlace to CUDAPlace (#6960) · 0d2235aa
由 dzhwinter 提交于 12月 25, 2017

0d2235aa

24 12月, 2017 1 次提交

Feature/operator run place (#6783) · 735eba29

由 dzhwinter 提交于 12月 24, 2017

* "change operator interface"

* "move devicepool to device_context"

* "fix operator test"

* "fix op_registry Run interface"

* "net op passed. Need to fix nccl multi-Context"

* "add nccl group function"

* "add nccl group function"

* "fix gpu count exceed 32 error"

* "fix recurrent op, nccl op"

* "change the other operators interface with Place"

* "fix typo"

* "fix pybind"

* "fix device in python side"

* "fix pybind failed"

* "add init for test"

* "fix CI"

735eba29

22 12月, 2017 3 次提交
- F
  
  Pass test_dyn_rnn.py · edba405d
  由 fengjiayi 提交于 12月 22, 2017
  
  edba405d
- F
  
  update pybind · dcc51da4
  由 fengjiayi 提交于 12月 22, 2017
  
  dcc51da4
- D
  "remove GPU Sync Interface" (#6793) · abde3130
  由 dzhwinter 提交于 12月 22, 2017
```
* "remove GPU Sync Interface"

* "fix typo"

* "fix type cast error"

* "fix related Copy with stream"

* "fix failed tests with DevicePool"

* "fix stupid removed position error"
```
  abde3130
21 12月, 2017 4 次提交
- T
  
  fix compile when merge · 5913e735
  由 typhoonzero 提交于 12月 21, 2017
  
  5913e735
- T
  
  fix delete ops · 4658f950
  由 typhoonzero 提交于 12月 21, 2017
  
  4658f950
- F
  
  Add the simple support of no_grad_set · 1a0fc5d8
  由 fengjiayi 提交于 12月 21, 2017
  
  1a0fc5d8
- Y
  Rename XXDescBind --> XXDesc (#6797) · 09189732
  由 Yu Yang 提交于 12月 21, 2017
```
* Rename XXDescBind --> XXDesc

* Fix Compile
```
  09189732
20 12月, 2017 3 次提交
- F
  
  Compelete basic framework · 278ac7be
  由 fengjiayi 提交于 12月 20, 2017
  
  278ac7be
- F
  
  update · 61a7df2e
  由 fengjiayi 提交于 12月 20, 2017
  
  61a7df2e
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
19 12月, 2017 3 次提交
- F
  
  update · 590e6111
  由 fengjiayi 提交于 12月 19, 2017
  
  590e6111
- Q
  
  export const value to python · 5c530ea8
  由 qiaolongfei 提交于 12月 19, 2017
  
  5c530ea8
- F
  
  update · 6bb4a6fd
  由 fengjiayi 提交于 12月 19, 2017
  
  6bb4a6fd
18 12月, 2017 2 次提交

Feature/global context (#6537) · 24fda392

由 dzhwinter 提交于 12月 18, 2017

* "add DeviceContextPool"

* "add devicecontextpool in pybind"

* "add comments in python side "

* "fix static link error"

* "fix CI error"

* "add executor.py"

* "fix CI error"

* "add with gpu macro"

* "remove comment out codes"

* "add TODO items"

* "update init devices"

24fda392

F

update · b3ea677a
由 fengjiayi 提交于 12月 18, 2017

b3ea677a

14 12月, 2017 2 次提交
- F
  
  expose GradOpMaker to Python · 044a13d0
  由 fengjiayi 提交于 12月 14, 2017
  
  044a13d0
- F
  
  update · e11a561c
  由 fengjiayi 提交于 12月 14, 2017
  
  e11a561c
12 12月, 2017 1 次提交
- T
  
  wip need ut · b4cd7f3d
  由 typhoonzero 提交于 12月 12, 2017
  
  b4cd7f3d
30 11月, 2017 1 次提交
- L
  
  add WITH_DOC for print_operators_doc · 4c95301e
  由 Luo Tao 提交于 11月 30, 2017
  
  4c95301e
27 11月, 2017 1 次提交
- D
  
  Add cuda profiler tools. · 6cf2dcbc
  由 dangqingqing 提交于 11月 27, 2017
  
  6cf2dcbc
26 11月, 2017 1 次提交

Feature/copytensor (#5455) · 45062fe5

由 dzhwinter 提交于 11月 26, 2017

* "make global tensor function independently"

* "replace functor"

* "fix inline template error"

* "fix tensor array with CopyFrom"

* "fix other case use CopyFrom"

* "move the op interface hardly"

* "fix operators"

* "fix typo"

* "delete dynamic recurrent rnn and fix gru_unit in debugmode"

* "fix unique_ptr copy"

* "fix cuda copy"

* "fix namespace error"

* "removed nccl python test"

* "fix include error"

* "fix typo"

* fix copy util test

45062fe5

24 11月, 2017 2 次提交

support testing when training and handle dropout and batch_norm operator in testing mode (#5734) · 3a76062c

由 QI JUN 提交于 11月 24, 2017

* is_training to is_test in dropout op

* handle dropout and batch_norm operator when prune pdesc in testing mode

* handle dropout and batch_norm operator when prune pdesc in testing mode

* add get_inference_program method

* fix dropout op

* fix ci

* test data after each batch training

* refine code

* refine test_book3

* fix ci

* follow comments

3a76062c

Unify dtype and datatype (#5869) · 50d670ee

由 fengjiayi 提交于 11月 24, 2017

* Change all `data_type` in Python to `dtype`

* Change `date_type` in C++ to `dtype`

* Refine

50d670ee

14 11月, 2017 1 次提交
- Q
  
  fix lod_tensor_array (#5625) · b6c262e1
  由 Qiao Longfei 提交于 11月 14, 2017
  
  b6c262e1
08 11月, 2017 1 次提交
- Y
  Compare Operator (#5325) · f74fb790
  由 Yu Yang 提交于 11月 07, 2017
```
* Compare Operator

* Follow comments
```
  f74fb790
07 11月, 2017 1 次提交

ReadFromArray/WriteToArray op (#5407) · c9b57dcc

由 Yu Yang 提交于 11月 06, 2017

* Use stable_sort in lod_rank_table

It is easy to debug and test when use `stable_sort`and the time
complexity is not changed.

* Add LoDTensorArray

* Stash

* Better debug message for IsInitialized

* Stash

* Better debug message for IsInitialized

* Complete array read/write op unittests

c9b57dcc

06 11月, 2017 1 次提交

Feature/lod tensor array (#5379) · 2be4c3cb

由 Yu Yang 提交于 11月 05, 2017

* Use stable_sort in lod_rank_table

It is easy to debug and test when use `stable_sort`and the time
complexity is not changed.

* Add LoDTensorArray

2be4c3cb

04 11月, 2017 1 次提交

Add LoDRankTable (#5349) · 74849158

由 Yu Yang 提交于 11月 03, 2017

* Add LoDRankTable

LoD Rank Table stores the `level` of `lod` which is ordered by sequence
length in descending order. It is useful when implement dynamic RNN and
is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
RNN slice output operators.

* Add InferVarType

74849158

02 11月, 2017 1 次提交

Rewrite StaticRNN with Executor (#5224) · 0a32e74d

由 Yu Yang 提交于 11月 01, 2017

* Init commit

* Make executor use ProgramDescBind

* Change Attribute from BlockDesc to BlockDescBind

* Since we will get the program desc in RNN, just BlockDesc is not
  enough.

* Add DeviceContext to Executor API

* Rewrite RNN

* Pass Python

* AddBiasOp does not care num_flatten_dims

* Stash

* Fix MacOS Compile

* Pass RNN forward

* add python test

* refactor test

* Make compile pass

* add gradopmaker

* First draft done

* Polish code

* add grad op maker and grad infershape

* Polish code

* Fix backward.cc bug

* Fix infershape

* Rename function

* add backward test

* simplify recurrent test

* Update

* Pass unittest

* Add comments & refine test

* Add comments

* refactor test

* Complete Unittest

* fix StepScopes enforce

* Remove unused unittest

* no type error

* Update

* Make RNN Pass unittest

0a32e74d

01 11月, 2017 1 次提交

Feature/executor use program bind (#5196) · 1363ddb6

由 Yu Yang 提交于 10月 31, 2017

* Init commit

* Make executor use ProgramDescBind

* Change Attribute from BlockDesc to BlockDescBind

* Since we will get the program desc in RNN, just BlockDesc is not
  enough.

1363ddb6

31 10月, 2017 2 次提交
- Q
  improve unique_name, uniq id is related to prefix (#5223) · a128eb7b
  由 Qiao Longfei 提交于 10月 31, 2017
```
* improve unique_name, uniq id is related to prefix

* fix join
```
  a128eb7b
- Q
  add init_gflags interface (#5193) · a186b53d
  由 QI JUN 提交于 10月 30, 2017
```
* add init_gflags interface

* refine code

* follow comments
```
  a186b53d
29 10月, 2017 1 次提交

support sparse output for lookup table grad op (#5145) · 008f40ce

由 QI JUN 提交于 10月 28, 2017

* add sparse support for sum op

* typo fix

* fix gpu build error

* fix unittest error

* typo fix

* infer var type and shape in op_test

* follow comments

* fix build error

* bypass some unittests depend on NetOp

* support sparse output for lookup table grad op

* refine codes

* fix gpu build error

* fix lookup table grad gpu kernel

* fix ci

* fix ci

* fix ci

* fix bug in lookup_table_grad op

* fix bug in test_word2vec

* register double kernel for some operators

* set is_sparse=True in test_word2vec

* fix lookup table grad op CUDA kernel bug

* disable test_modified_huber_loss_op temporarily

* disable test_lstm_unit_op temporarily

008f40ce

28 10月, 2017 1 次提交

Python API for inference model saving/load (#5020) · 6783dcee

由 fengjiayi 提交于 10月 27, 2017

* Add `dump_to_file()` for ProgrameDescBind in pybind

* Update

* Add utility.py

* typo

* Fix bugs

* Move add_feed/fetch_components to untility.py

* Compelete dump

* Follow comments

* Change output of Prune() from inference to pointer

* Expose Prune() to Python

* Compelete save/load API of inference model

* Fix errors

* Debuging

* Compelete unit tests

* follow comments

6783dcee

27 10月, 2017 1 次提交
- D
  
  rerun ci · 37842d80
  由 Dong Zhihong 提交于 10月 26, 2017
  
  37842d80

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功