提交 · 0c45eab7fffd94169ddda0d61e0613524dbcd8e6 · Crayon鑫 / Paddle

11 2月, 2018 1 次提交
- Y
  
  no getmutable nccl_com · 0c45eab7
  由 Yang Yang 提交于 2月 11, 2018
  
  0c45eab7
10 2月, 2018 3 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
- Y
  
  add nccl · 672cdc21
  由 Yang Yang 提交于 2月 09, 2018
  
  672cdc21
06 2月, 2018 1 次提交
- F
  
  refine code and add unit tests · 0bb9c80e
  由 fengjiayi 提交于 2月 06, 2018
  
  0bb9c80e
31 1月, 2018 1 次提交
- D
  "unify flags" (#7973) · 80eff266
  由 dzhwinter 提交于 1月 31, 2018
```
* "unify flags"

* "fix init"
```
  80eff266
30 1月, 2018 1 次提交
- L
  
  Unify the definition of kFeedOpType and kFetchOpType. · f5d93368
  由 Liu Yiqun 提交于 1月 30, 2018
  
  f5d93368
27 1月, 2018 1 次提交
- K
  
  address comments (#7900) · 9b6387e7
  由 kexinzhao 提交于 1月 26, 2018
  
  9b6387e7
26 1月, 2018 1 次提交

New Run() method for framework::Executor (#7807) · 788f5c6d

由 kexinzhao 提交于 1月 25, 2018

* initial commit

* add new executor run function

* fix bug

* fix multiple definition of feed_fetch_method issue

* fix cmake

* fix tensor copy error

* refine executor code

* add comments

* temporary modification

* address comments

* fix bug

788f5c6d

23 1月, 2018 2 次提交
- Q
  Nmt model (#7340) · e7d44a20
  由 Qiao Longfei 提交于 1月 23, 2018
```
neural machine translation model support beam search with while op
```
  e7d44a20
- D
  
  Refine profiler code. · 0358fd01
  由 dangqingqing 提交于 1月 23, 2018
  
  0358fd01
22 1月, 2018 1 次提交

add memory optimization transpiler demo (#7443) · a6da470b

由 QI JUN 提交于 1月 22, 2018

* add memory optimization transpiler demo

* add memory benchmark compile option

* add gflags instead of macro

* refine code

a6da470b

16 1月, 2018 1 次提交
- D
  
  Refine profiler and expose to Python. · d2a70243
  由 dangqingqing 提交于 1月 16, 2018
  
  d2a70243
09 1月, 2018 1 次提交
- Y
  Test dist word2vec (#7334) · e249ad12
  由 Yancey 提交于 1月 09, 2018
```
* test dist word2vec

* multiple trainers work
```
  e249ad12
08 1月, 2018 3 次提交
- Y
  Create tensor in recv op (#7286) · aa75f1e2
  由 Yancey 提交于 1月 08, 2018
```
* create tensor in recv op

* static global function to global function
```
  aa75f1e2
- Y
  
  Refine get_places · 63ff0b4b
  由 Yang Yu 提交于 1月 08, 2018
  
  63ff0b4b
- E
  Show argument dimensions with operator::DebugStringEx (#7268) · 8814bec0
  由 emailweixu 提交于 1月 07, 2018
```
This can make it easier to locate error.
```
  8814bec0
29 12月, 2017 2 次提交
- Y
  
  Remove debug codes · d25f382d
  由 Yang Yu 提交于 12月 29, 2017
  
  d25f382d
- Y
  
  Follow comments · 5139e6c7
  由 Yang Yu 提交于 12月 29, 2017
  
  5139e6c7
28 12月, 2017 1 次提交
- Y
  
  Refine · f5c2d175
  由 Yang Yu 提交于 12月 28, 2017
  
  f5c2d175
27 12月, 2017 2 次提交
- G
  Fix bugs (#7060) · 5347c8d7
  由 gongweibao 提交于 12月 27, 2017
```
* fix bugs
```
  5347c8d7
- Y
  
  Executor check nan · 3ae781eb
  由 Yang Yu 提交于 12月 27, 2017
  
  3ae781eb
26 12月, 2017 2 次提交
- Y
  
  Update code · 82a22d32
  由 Yang Yu 提交于 12月 26, 2017
  
  82a22d32
- Y
  
  Stash · 938717ba
  由 Yang Yu 提交于 12月 26, 2017
  
  938717ba
24 12月, 2017 1 次提交

Feature/operator run place (#6783) · 735eba29

由 dzhwinter 提交于 12月 24, 2017

* "change operator interface"

* "move devicepool to device_context"

* "fix operator test"

* "fix op_registry Run interface"

* "net op passed. Need to fix nccl multi-Context"

* "add nccl group function"

* "add nccl group function"

* "fix gpu count exceed 32 error"

* "fix recurrent op, nccl op"

* "change the other operators interface with Place"

* "fix typo"

* "fix pybind"

* "fix device in python side"

* "fix pybind failed"

* "add init for test"

* "fix CI"

735eba29

21 12月, 2017 1 次提交
- Y
  Rename XXDescBind --> XXDesc (#6797) · 09189732
  由 Yu Yang 提交于 12月 21, 2017
```
* Rename XXDescBind --> XXDesc

* Fix Compile
```
  09189732
20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
18 12月, 2017 1 次提交

Feature/global context (#6537) · 24fda392

由 dzhwinter 提交于 12月 18, 2017

* "add DeviceContextPool"

* "add devicecontextpool in pybind"

* "add comments in python side "

* "fix static link error"

* "fix CI error"

* "add executor.py"

* "fix CI error"

* "add with gpu macro"

* "remove comment out codes"

* "add TODO items"

* "update init devices"

24fda392

13 12月, 2017 1 次提交
- T
  
  wip: should fix variable recreate · 9508c726
  由 typhoonzero 提交于 12月 13, 2017
  
  9508c726
04 12月, 2017 1 次提交

While op forward for sentimental analysis (#6140) · d5e32794

由 Yu Yang 提交于 12月 04, 2017

* Add DataFeeder

A v2 API like data feeder for book demos.
We can feed data directly from reader.

* Fix CI

* Add an unittest for while/rnn op forward

* Add unittest for raw while op backward

* Fix CI

d5e32794

24 11月, 2017 1 次提交

support testing when training and handle dropout and batch_norm operator in testing mode (#5734) · 3a76062c

由 QI JUN 提交于 11月 24, 2017

* is_training to is_test in dropout op

* handle dropout and batch_norm operator when prune pdesc in testing mode

* handle dropout and batch_norm operator when prune pdesc in testing mode

* add get_inference_program method

* fix dropout op

* fix ci

* test data after each batch training

* refine code

* refine test_book3

* fix ci

* follow comments

3a76062c

16 11月, 2017 1 次提交

feature/while_grad_op (#5554) · 18f0c40a

由 Yang Yang(Tony) 提交于 11月 16, 2017

* first commit

* Python API for while op

* Python Unittest for simple while_op forward

* fix out to be list

* Fix UT

* VarType

* Fix several bugs

* Fix bug

* Fix bug

* Fix Bug

* Fix bug

* Fix unittest

* Remove debug log

* Add comments

* add PADDLE_ENFORCE

* while_grad_op first commit

* Add `BlockDescBind::FindRecursiveOrCreateVar()` and fix bugs

* not sure how to setdim of while outputs

* push for test

* add executor vlog

* fix bug of while_op cond

* Several enhancement for code

1. Backward always infer shape & infer var type. Since there are RENAME
variables will be created when creating backward operator, but their
shape & var types are not inferenced.
2. Never use SomePtr-> directly, since every pointer could be nullptr if
it is a function return value. Add `detail::Ref` to cast pointer to
reference safely.
3. Enhance error message for backward.
4. Infer data type of variable in `sum` and `tensor_write`

* Fix bugs of while_op gradient

* Fix several bugs of while_op grad

* fix fill zeros like

* fix 3 >= 3

* fix place holder shouldn't be null

* fail on sum op

* Fix SumOp of TensorList

* clean up

* pass while test

* fix test_array_write_read

* pass sum op

* Support int/int64 for fill_constant_batch_size_like

* Fix compile

18f0c40a

06 11月, 2017 1 次提交

Feature/lod tensor array (#5379) · 2be4c3cb

由 Yu Yang 提交于 11月 05, 2017

* Use stable_sort in lod_rank_table

It is easy to debug and test when use `stable_sort`and the time
complexity is not changed.

* Add LoDTensorArray

2be4c3cb

04 11月, 2017 1 次提交

Add LoDRankTable (#5349) · 74849158

由 Yu Yang 提交于 11月 03, 2017

* Add LoDRankTable

LoD Rank Table stores the `level` of `lod` which is ordered by sequence
length in descending order. It is useful when implement dynamic RNN and
is shared by dynamic RNN memory, dynamic RNN slice input and dynamic
RNN slice output operators.

* Add InferVarType

74849158

03 11月, 2017 1 次提交
- Y
  
  Fix comparing between signed and unsigned values (#5328) · 1ed5ae7a
  由 Yi Wang 提交于 11月 02, 2017
  
  1ed5ae7a
02 11月, 2017 1 次提交

Rewrite StaticRNN with Executor (#5224) · 0a32e74d

由 Yu Yang 提交于 11月 01, 2017

* Init commit

* Make executor use ProgramDescBind

* Change Attribute from BlockDesc to BlockDescBind

* Since we will get the program desc in RNN, just BlockDesc is not
  enough.

* Add DeviceContext to Executor API

* Rewrite RNN

* Pass Python

* AddBiasOp does not care num_flatten_dims

* Stash

* Fix MacOS Compile

* Pass RNN forward

* add python test

* refactor test

* Make compile pass

* add gradopmaker

* First draft done

* Polish code

* add grad op maker and grad infershape

* Polish code

* Fix backward.cc bug

* Fix infershape

* Rename function

* add backward test

* simplify recurrent test

* Update

* Pass unittest

* Add comments & refine test

* Add comments

* refactor test

* Complete Unittest

* fix StepScopes enforce

* Remove unused unittest

* no type error

* Update

* Make RNN Pass unittest

0a32e74d

01 11月, 2017 1 次提交

Feature/executor use program bind (#5196) · 1363ddb6

由 Yu Yang 提交于 10月 31, 2017

* Init commit

* Make executor use ProgramDescBind

* Change Attribute from BlockDesc to BlockDescBind

* Since we will get the program desc in RNN, just BlockDesc is not
  enough.

1363ddb6

27 10月, 2017 1 次提交

add sparse support for sum op (#5093) · 7f8574c0

由 QI JUN 提交于 10月 26, 2017

* add sparse support for sum op

* typo fix

* fix gpu build error

* fix unittest error

* typo fix

* infer var type and shape in op_test

* follow comments

* fix build error

* bypass some unittests depend on NetOp

7f8574c0

20 10月, 2017 2 次提交

Y
Feature/free kid scope (#4951) · af4dac4a
由 Yu Yang 提交于 10月 19, 2017
```
* Delete kid

* Delete local scope
```
af4dac4a

Feature/py executor test (#4922) · 3db52783

由 Yu Yang 提交于 10月 19, 2017

* Implement FC layer with helper

* Update LayerHelper

* Add debug string for Python ProtoBuf

and Rename `Sync` to `Flush`

* Add check of ProtoBuf initialization

* Layer wrapper for FC

* Fix unittest

* Fix CI

* Add code generator

* AttributeChecker Better error log and speicalize bool

Since lots of types can be cast to bool

* Complete mlp, fit_a_line

* Expose get global scope

* Make global scope not thread-safe

1. It is no need to make global scope thread-safe, since it will be
invoked in Python main thread.
2. Do not free the global scope when C++ exit. Let the OS free memories,
otherwise, we need to handle the destroy dependencies.

See
https://google.github.io/styleguide/cppguide.html#Static_and_Global_Variables

* Fix

* Implementation of simple conv_2d layer

* Stash

* Remove private data members in OpRegister

* Fix bugs

* Stash

* Expose FeedFetchList as VarType

* Change ProgramDesc not a global variable

* Polish code style

* Stash

* Correct implement BlockDesc destructor

* Correct implement BlockDesc destructor

* Unify program as parameter name

* Fix bugs

* Add unittest

* Fix unit test error

* Remove unused functions

* Add clone for Python Program

* Working on executor

* Stash

* Add glog as dependencies of ops

* Use VLOG to logging some information is helpful when we debug Paddle

* Expose VarDesc::persistable to Python

* Test executor

* Complete unittest

* Polish code

* Fix merge error

* Follow comment

* Polish Python Code

3db52783

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致