- 17 1月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 15 1月, 2018 2 次提交
-
-
由 dzhwinter 提交于
* add copyright hook * add copyright hook * refine copyright hook * "test copyright hook" * fix check style * fix ci
-
由 Yancey1989 提交于
-
- 14 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "unified operators" * "add CUDNN register" * "add use cudnn attribute" * "add attribute" * "test conv tranpose op" * "remove duplicated attr" * "fix op test" * "add attribute to set cudnn" * "add more log" * "need layout op register support" * "add more log" * "change GetExpectedKernelType " * "fix Get attr in conv_op" * "fix CI" * "fix tests" * "removed kernel priority fallback" * "fix CI" * "fix stack pointer bug" * "refine buggy interface" * "add const cast to save life" * "fix get_output_with_grad" * "fix op test with dataformat" * ""fix pooling * "fix pooling test" * "fix CI" * "fix with_gpu error" * "add transform needed functional check" * "fix unpack list error" * "comment out parallel.do temporary" * "fix CI" * "fix compile doc error" * "make threshold larger"
-
- 12 1月, 2018 2 次提交
-
-
由 Qiao Longfei 提交于
* add GetLoD for debug * add LoDToString * optimize if * typo * add lod_tensor to operator's dependency
-
由 xuwei06 提交于
-
- 10 1月, 2018 8 次提交
-
-
由 Qiao Longfei 提交于
* init data_type_transform * split data_layout_transform * tmp rm data_transform_test * change device_data_transform to data_device_transform * clean code * clean code
-
由 Yang Yu 提交于
glog will not hold ARGV[0] inside.
-
由 Yang Yang(Tony) 提交于
feature/parallel_gpu
-
由 dzhwinter 提交于
-
由 dzhwinter 提交于
-
由 dzhwinter 提交于
* "init use all default devices" * "fix init test"
-
由 Qiao Longfei 提交于
* add lod tensor ToAbsOffset test * add share lod to topk op and softmax op
-
由 xuwei06 提交于
Added backward.calc_gradient to backpropagate gradient from given targets to inputs.
-
- 09 1月, 2018 4 次提交
-
-
由 Yancey 提交于
* test dist word2vec * multiple trainers work
-
由 Yu Yang 提交于
* Rename Tensor::CopyFrom to Tensor::Copy * Fix CI * Fix compile
-
由 Yu Yang 提交于
* Remove unused LoDTensor methods * Update
-
由 qiaolongfei 提交于
-
- 08 1月, 2018 9 次提交
-
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 Yancey 提交于
* create tensor in recv op * static global function to global function
-
由 qiaolongfei 提交于
-
由 Yang Yu 提交于
-
由 dzhwinter 提交于
* "reuse ShareLoD with no regret" * "removed base class shareLayout" * "fix CI"
-
由 Yang Yu 提交于
-
由 Qiao Longfei 提交于
* add rename guard * add device_data_transform * add device_data_transform_test * modify GetExpectedKernelType * update operator.run * support test test_label_semantic_roles * optimize code * optimize code * rename GetActualKernelType to GetExpectedKernelType * fix chunk_eval_op and device_data_transform_test * add is_same_place to place * optimize code, refine rename_guard * refine rename guard, add GetKernelTypeForVar * optimize code * add some log * rename guard * use sub scope to create var * fix compile * add IsInitialized for Tensor * add VarIsTensor * fix op_registry_test * test * tmp disable priority * restore switch_kernel.md * code clean
-
由 emailweixu 提交于
This can make it easier to locate error.
-
- 05 1月, 2018 4 次提交
-
-
由 Yang Yu 提交于
It will be used for LoD information in LoDTensor since LoD is a copy on write field. It is pretty slow for copying LoD information between operators. For resnet it will cost roughly 10% time of whole time, including reading data.
-
由 dzhwinter 提交于
* "add c++ side kernel selection" * "add multiple kernel op test" * "kernel selection only support cudnn" * "better formatter" * "small fix with UseCPU" * "depends on change interface Get(Place, Library)" * "fix CI" * "fix python cudnn test" * "leave the register cudnn op to another PR" * "fix CI" * "use all kernel by default" * "fix CI"
-
由 Yang Yu 提交于
-
由 Yancey 提交于
* send_recv variable * delete unused logs * fix ci failed * update * resize tensor before tensor copy * add selectedrows unit test * check rows
-
- 04 1月, 2018 6 次提交
- 03 1月, 2018 3 次提交
-
-
由 tensor-tang 提交于
-
由 Luo Tao 提交于
-
由 QI JUN 提交于
* fix bug in SetAttrDescVisitor * add comments
-