- 29 9月, 2018 1 次提交
-
-
由 Dun 提交于
* refine reduce by cub * optimize KernelDepthwiseConvFilterGrad * optimize depthwise conv and reduce mean and reduce sum * fix bug: dilation * cuda arch and cuda 8 compatible
-
- 08 5月, 2018 2 次提交
-
-
由 Yu Yang 提交于
Do not use ctor * Reduce line of codes. * We can use virtual function for Maker now. * The implementation does not care what maker holds, it is easier to refactor later.
-
由 chengduoZH 提交于
-
- 04 5月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 03 5月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 18 4月, 2018 1 次提交
-
-
由 Abhinav Arora 提交于
* Fix conv_op.h * Fix conv_mkldnn_op * Fix cpplint issues in detection_map_op
-
- 28 2月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 16 2月, 2018 2 次提交
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 01 2月, 2018 1 次提交
-
-
由 xzl 提交于
-
- 23 1月, 2018 1 次提交
-
-
由 xzl 提交于
-
- 22 1月, 2018 1 次提交
-
-
由 zlx 提交于
-
- 15 1月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 14 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "unified operators" * "add CUDNN register" * "add use cudnn attribute" * "add attribute" * "test conv tranpose op" * "remove duplicated attr" * "fix op test" * "add attribute to set cudnn" * "add more log" * "need layout op register support" * "add more log" * "change GetExpectedKernelType " * "fix Get attr in conv_op" * "fix CI" * "fix tests" * "removed kernel priority fallback" * "fix CI" * "fix stack pointer bug" * "refine buggy interface" * "add const cast to save life" * "fix get_output_with_grad" * "fix op test with dataformat" * ""fix pooling * "fix pooling test" * "fix CI" * "fix with_gpu error" * "add transform needed functional check" * "fix unpack list error" * "comment out parallel.do temporary" * "fix CI" * "fix compile doc error" * "make threshold larger"
-
- 08 1月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
* add rename guard * add device_data_transform * add device_data_transform_test * modify GetExpectedKernelType * update operator.run * support test test_label_semantic_roles * optimize code * optimize code * rename GetActualKernelType to GetExpectedKernelType * fix chunk_eval_op and device_data_transform_test * add is_same_place to place * optimize code, refine rename_guard * refine rename guard, add GetKernelTypeForVar * optimize code * add some log * rename guard * use sub scope to create var * fix compile * add IsInitialized for Tensor * add VarIsTensor * fix op_registry_test * test * tmp disable priority * restore switch_kernel.md * code clean
-
- 05 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "add c++ side kernel selection" * "add multiple kernel op test" * "kernel selection only support cudnn" * "better formatter" * "small fix with UseCPU" * "depends on change interface Get(Place, Library)" * "fix CI" * "fix python cudnn test" * "leave the register cudnn op to another PR" * "fix CI" * "use all kernel by default" * "fix CI"
-
- 20 12月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Move framework.proto to proto namespace * Fix compile * Fix compile * Fix Compile
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 11 12月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 22 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 21 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 15 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 10 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 09 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 08 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 06 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 30 10月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 26 10月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 21 10月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 20 10月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Remove template parameter for Tensor methods * Also check the type is correct when data() * Simplize holder_ * Fix accuracy_op * Register Code
-
- 17 10月, 2017 1 次提交
-
-
由 Yu Yang 提交于
They are public now
-
- 12 10月, 2017 1 次提交
-
-
由 武毅 提交于
* add cudnn_conv_op * WIP * update * update * fix grad check * use platform::memory * add support group for cudnn * update * follow comments * fix onlycpu build * update cuda define * follow comments * follow comments * merge with updates * fix compile error * follow comments * follow comments
-
- 28 9月, 2017 1 次提交
-
-
由 Yu Yang 提交于
-
- 21 9月, 2017 3 次提交