- 12 1月, 2018 1 次提交
-
-
由 Yang Yang 提交于
-
- 11 1月, 2018 1 次提交
-
-
由 Yang Yang 提交于
-
- 10 1月, 2018 5 次提交
-
-
由 Qiao Longfei 提交于
* init data_type_transform * split data_layout_transform * tmp rm data_transform_test * change device_data_transform to data_device_transform * clean code * clean code
-
由 Yang Yang(Tony) 提交于
feature/parallel_gpu
-
由 dzhwinter 提交于
* "init use all default devices" * "fix init test"
-
由 Qiao Longfei 提交于
* add lod tensor ToAbsOffset test * add share lod to topk op and softmax op
-
由 xuwei06 提交于
Added backward.calc_gradient to backpropagate gradient from given targets to inputs.
-
- 09 1月, 2018 4 次提交
-
-
由 Yancey 提交于
* test dist word2vec * multiple trainers work
-
由 Yu Yang 提交于
* Rename Tensor::CopyFrom to Tensor::Copy * Fix CI * Fix compile
-
由 Yu Yang 提交于
* Remove unused LoDTensor methods * Update
-
由 qiaolongfei 提交于
-
- 08 1月, 2018 9 次提交
-
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 Yancey 提交于
* create tensor in recv op * static global function to global function
-
由 qiaolongfei 提交于
-
由 Yang Yu 提交于
-
由 dzhwinter 提交于
* "reuse ShareLoD with no regret" * "removed base class shareLayout" * "fix CI"
-
由 Yang Yu 提交于
-
由 Qiao Longfei 提交于
* add rename guard * add device_data_transform * add device_data_transform_test * modify GetExpectedKernelType * update operator.run * support test test_label_semantic_roles * optimize code * optimize code * rename GetActualKernelType to GetExpectedKernelType * fix chunk_eval_op and device_data_transform_test * add is_same_place to place * optimize code, refine rename_guard * refine rename guard, add GetKernelTypeForVar * optimize code * add some log * rename guard * use sub scope to create var * fix compile * add IsInitialized for Tensor * add VarIsTensor * fix op_registry_test * test * tmp disable priority * restore switch_kernel.md * code clean
-
由 emailweixu 提交于
This can make it easier to locate error.
-
- 05 1月, 2018 4 次提交
-
-
由 Yang Yu 提交于
It will be used for LoD information in LoDTensor since LoD is a copy on write field. It is pretty slow for copying LoD information between operators. For resnet it will cost roughly 10% time of whole time, including reading data.
-
由 dzhwinter 提交于
* "add c++ side kernel selection" * "add multiple kernel op test" * "kernel selection only support cudnn" * "better formatter" * "small fix with UseCPU" * "depends on change interface Get(Place, Library)" * "fix CI" * "fix python cudnn test" * "leave the register cudnn op to another PR" * "fix CI" * "use all kernel by default" * "fix CI"
-
由 Yang Yu 提交于
-
由 Yancey 提交于
* send_recv variable * delete unused logs * fix ci failed * update * resize tensor before tensor copy * add selectedrows unit test * check rows
-
- 04 1月, 2018 6 次提交
- 03 1月, 2018 7 次提交
- 02 1月, 2018 3 次提交