- 22 1月, 2018 1 次提交
-
-
由 QI JUN 提交于
* add memory optimization transpiler demo * add memory benchmark compile option * add gflags instead of macro * refine code
-
- 21 1月, 2018 2 次提交
-
-
由 Qiao Longfei 提交于
* init complete data layout transform * can compile * test passed * optimize code * fix while_grad_op first step loss lod problem * optimize in out ptr for transform * add check * update copyright * clean code * add NeedTransformLayout * add comment * change the interface of data_type_transform * init data_type_transform_test * complete data_type_transform_test * add TransDataType to data_transform
-
由 dzhwinter 提交于
* "fix decode bug" * "follow commnet" * "fix error" * "fix hook bug" * fix based comment * fix copyright * fix based on comment
-
- 19 1月, 2018 2 次提交
-
-
由 Qiao Longfei 提交于
* check if kernel if found for kernel type * do kernel check before data transform
-
由 Qiao Longfei 提交于
* add data layout transform and optimize the implementation of data_transform
-
- 18 1月, 2018 4 次提交
-
-
由 Yang Yang 提交于
-
由 Yang Yang 提交于
-
由 Yu Yang 提交于
* Fix ParallelDo not support empty input gradient * Polish ParallelDo and fix several bugs * Fix CI * Fix CI
-
由 Qiao Longfei 提交于
-
- 17 1月, 2018 2 次提交
-
-
由 Yan Chunwei 提交于
-
由 Qiao Longfei 提交于
-
- 15 1月, 2018 2 次提交
-
-
由 dzhwinter 提交于
* add copyright hook * add copyright hook * refine copyright hook * "test copyright hook" * fix check style * fix ci
-
由 Yancey1989 提交于
-
- 14 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "unified operators" * "add CUDNN register" * "add use cudnn attribute" * "add attribute" * "test conv tranpose op" * "remove duplicated attr" * "fix op test" * "add attribute to set cudnn" * "add more log" * "need layout op register support" * "add more log" * "change GetExpectedKernelType " * "fix Get attr in conv_op" * "fix CI" * "fix tests" * "removed kernel priority fallback" * "fix CI" * "fix stack pointer bug" * "refine buggy interface" * "add const cast to save life" * "fix get_output_with_grad" * "fix op test with dataformat" * ""fix pooling * "fix pooling test" * "fix CI" * "fix with_gpu error" * "add transform needed functional check" * "fix unpack list error" * "comment out parallel.do temporary" * "fix CI" * "fix compile doc error" * "make threshold larger"
-
- 12 1月, 2018 3 次提交
-
-
由 Qiao Longfei 提交于
* add GetLoD for debug * add LoDToString * optimize if * typo * add lod_tensor to operator's dependency
-
由 Yang Yang 提交于
-
由 xuwei06 提交于
-
- 11 1月, 2018 1 次提交
-
-
由 Yang Yang 提交于
-
- 10 1月, 2018 8 次提交
-
-
由 Qiao Longfei 提交于
* init data_type_transform * split data_layout_transform * tmp rm data_transform_test * change device_data_transform to data_device_transform * clean code * clean code
-
由 Yang Yu 提交于
glog will not hold ARGV[0] inside.
-
由 Yang Yang(Tony) 提交于
feature/parallel_gpu
-
由 dzhwinter 提交于
-
由 dzhwinter 提交于
-
由 dzhwinter 提交于
* "init use all default devices" * "fix init test"
-
由 Qiao Longfei 提交于
* add lod tensor ToAbsOffset test * add share lod to topk op and softmax op
-
由 xuwei06 提交于
Added backward.calc_gradient to backpropagate gradient from given targets to inputs.
-
- 09 1月, 2018 4 次提交
-
-
由 Yancey 提交于
* test dist word2vec * multiple trainers work
-
由 Yu Yang 提交于
* Rename Tensor::CopyFrom to Tensor::Copy * Fix CI * Fix compile
-
由 Yu Yang 提交于
* Remove unused LoDTensor methods * Update
-
由 qiaolongfei 提交于
-
- 08 1月, 2018 9 次提交
-
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 Yancey 提交于
* create tensor in recv op * static global function to global function
-
由 qiaolongfei 提交于
-
由 Yang Yu 提交于
-
由 dzhwinter 提交于
* "reuse ShareLoD with no regret" * "removed base class shareLayout" * "fix CI"
-
由 Yang Yu 提交于
-
由 Qiao Longfei 提交于
* add rename guard * add device_data_transform * add device_data_transform_test * modify GetExpectedKernelType * update operator.run * support test test_label_semantic_roles * optimize code * optimize code * rename GetActualKernelType to GetExpectedKernelType * fix chunk_eval_op and device_data_transform_test * add is_same_place to place * optimize code, refine rename_guard * refine rename guard, add GetKernelTypeForVar * optimize code * add some log * rename guard * use sub scope to create var * fix compile * add IsInitialized for Tensor * add VarIsTensor * fix op_registry_test * test * tmp disable priority * restore switch_kernel.md * code clean
-
由 emailweixu 提交于
This can make it easier to locate error.
-
- 05 1月, 2018 1 次提交
-
-
由 Yang Yu 提交于
It will be used for LoD information in LoDTensor since LoD is a copy on write field. It is pretty slow for copying LoD information between operators. For resnet it will cost roughly 10% time of whole time, including reading data.
-