- 26 11月, 2018 1 次提交
-
-
由 minqiyang 提交于
test=develop
-
- 16 11月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
* fix inference on gpu out of mem the transfer logic in operator.cc will keep creating new scopes.
-
- 14 11月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 08 11月, 2018 1 次提交
-
-
由 minqiyang 提交于
Fix code to support cpplint syntax check test=develop
-
- 15 10月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 28 9月, 2018 2 次提交
-
-
由 Xin Pan 提交于
scope's API modifies its internal state. And scope's API can be called from multiple threads during traing. Hence, we need locks to protect the scope's internal states. We can optimize it in the future. But the current solution is buggy. test=develop
-
由 Yan Chunwei 提交于
- add naive executor - fix concurrency performance issue
-
- 17 9月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 15 9月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 10 9月, 2018 2 次提交
- 07 6月, 2018 2 次提交
-
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
- 06 6月, 2018 3 次提交
-
-
由 Xin Pan 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
- 20 4月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 19 4月, 2018 1 次提交
-
-
由 qiaolongfei 提交于
-
- 04 4月, 2018 1 次提交
-
-
由 Yancey1989 提交于
-
- 12 3月, 2018 1 次提交
-
-
由 QI JUN 提交于
* add memopt level * add opt level for image classification demo * clean code * add delete op * clean code * test machine translation demo * clean code * clean code * skip fill constant with force cpu * clean code * clean code * refine code * clean code * fix bug
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 11 2月, 2018 1 次提交
-
-
由 Tao Luo 提交于
-
- 10 2月, 2018 2 次提交
- 31 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "unify flags" * "fix init"
-
- 22 1月, 2018 1 次提交
-
-
由 QI JUN 提交于
* add memory optimization transpiler demo * add memory benchmark compile option * add gflags instead of macro * refine code
-
- 08 1月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
* add rename guard * add device_data_transform * add device_data_transform_test * modify GetExpectedKernelType * update operator.run * support test test_label_semantic_roles * optimize code * optimize code * rename GetActualKernelType to GetExpectedKernelType * fix chunk_eval_op and device_data_transform_test * add is_same_place to place * optimize code, refine rename_guard * refine rename guard, add GetKernelTypeForVar * optimize code * add some log * rename guard * use sub scope to create var * fix compile * add IsInitialized for Tensor * add VarIsTensor * fix op_registry_test * test * tmp disable priority * restore switch_kernel.md * code clean
-
- 04 1月, 2018 1 次提交
-
-
由 Yang Yu 提交于
-
- 26 12月, 2017 1 次提交
-
-
由 Yang Yu 提交于
Cannot get var name recursive since they could be same.
-
- 04 12月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Add DataFeeder A v2 API like data feeder for book demos. We can feed data directly from reader. * Fix CI * Add an unittest for while/rnn op forward * Add unittest for raw while op backward * Fix CI
-
- 16 11月, 2017 1 次提交
-
-
由 Yang Yang(Tony) 提交于
* first commit * Python API for while op * Python Unittest for simple while_op forward * fix out to be list * Fix UT * VarType * Fix several bugs * Fix bug * Fix bug * Fix Bug * Fix bug * Fix unittest * Remove debug log * Add comments * add PADDLE_ENFORCE * while_grad_op first commit * Add `BlockDescBind::FindRecursiveOrCreateVar()` and fix bugs * not sure how to setdim of while outputs * push for test * add executor vlog * fix bug of while_op cond * Several enhancement for code 1. Backward always infer shape & infer var type. Since there are RENAME variables will be created when creating backward operator, but their shape & var types are not inferenced. 2. Never use SomePtr-> directly, since every pointer could be nullptr if it is a function return value. Add `detail::Ref` to cast pointer to reference safely. 3. Enhance error message for backward. 4. Infer data type of variable in `sum` and `tensor_write` * Fix bugs of while_op gradient * Fix several bugs of while_op grad * fix fill zeros like * fix 3 >= 3 * fix place holder shouldn't be null * fail on sum op * Fix SumOp of TensorList * clean up * pass while test * fix test_array_write_read * pass sum op * Support int/int64 for fill_constant_batch_size_like * Fix compile
-
- 11 11月, 2017 1 次提交
-
-
由 Yu Yang 提交于
it is useful in gradient phase of an operator with block
-
- 02 11月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Init commit * Make executor use ProgramDescBind * Change Attribute from BlockDesc to BlockDescBind * Since we will get the program desc in RNN, just BlockDesc is not enough. * Add DeviceContext to Executor API * Rewrite RNN * Pass Python * AddBiasOp does not care num_flatten_dims * Stash * Fix MacOS Compile * Pass RNN forward * add python test * refactor test * Make compile pass * add gradopmaker * First draft done * Polish code * add grad op maker and grad infershape * Polish code * Fix backward.cc bug * Fix infershape * Rename function * add backward test * simplify recurrent test * Update * Pass unittest * Add comments & refine test * Add comments * refactor test * Complete Unittest * fix StepScopes enforce * Remove unused unittest * no type error * Update * Make RNN Pass unittest
-
- 28 10月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Add debug logs in scope, meta_cache and memory * Add missing deps
-
- 25 10月, 2017 1 次提交
-
-
由 dzhwinter 提交于
* "add model format design doc" * "add restore function" * "add parse protobuf" * "move necessary information to saver.proto" * "format code" * "add gpu option" * "add lod info" * "add saveop python test wrapper" * "checkpoint reuse save operator" * "rewrite model format design doc" * "async support needed" * "fix run once" * "fix doc based on comments" * "refine based on comments" * "fix based comments" * "remove persistable flag from framework.proto" * "add IndicateDataType to restore op" * "add save test" * "modify save restore code" * "modified the restore logic" * rm checkpoint_op.cc * rm test_checkpoint_op.py * "get inputs outputs name from execution context" * Saving each variable to a independent file * Fix bugs * Rewrite save_restore_op_test with new Python framework * Move `SaveOp` and `RestoreOp` from OpWithKernel to OpBase * Refine unit test of SaveOp and RestoreOp * fix compile errorwq
-
- 20 10月, 2017 2 次提交
- 17 10月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Make global scope not thread-safe 1. It is no need to make global scope thread-safe, since it will be invoked in Python main thread. 2. Do not free the global scope when C++ exit. Let the OS free memories, otherwise, we need to handle the destroy dependencies. See https://google.github.io/styleguide/cppguide.html#Static_and_Global_Variables * Revert "FIX: Release CPU/GPU memory via deleter" This reverts commit 8f80f5bc.
-
- 12 10月, 2017 2 次提交
-
-
由 dongzhihong 提交于
-
由 dongzhihong 提交于
-