- 10 5月, 2019 1 次提交
-
-
由 qingqing01 提交于
* Add conv2d_grad_grad_op * Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h. - Now use it in conv2d_grad_grad. - Will simply the searching code in conv2d and conv2d_grad in next PR. * Enhance and fix bug in unit testing of gradient_checker. * Support to fetch empty variables,return None in Python.
-
- 21 1月, 2019 1 次提交
-
-
由 Dun 提交于
* mem opt * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine with cub test=develop * fix mkldnn test && remove comments && test=develop * polish code && test=develop * add only_forward test && test=develop
-
- 04 1月, 2019 1 次提交
-
-
由 xiaolil1 提交于
* Enable basic MKL-DNN INT8 Conv OP test=develop * Modify test case test=develop * Clean unittest code test=develop * Fix test test=develop * Modify test test=develop * Modify basic INT8 Conv test=develop
-
- 02 1月, 2019 1 次提交
-
-
由 Xin Pan 提交于
test=develop
-
- 25 12月, 2018 1 次提交
-
-
由 chengduo 提交于
* refine tensor test=develop * refine tensor test=develop * fix device_context log test=develop
-
- 21 12月, 2018 1 次提交
-
-
由 chengduo 提交于
* Add Temporal Allocator * add Temporay Allocator to DeviceContext test=develop * code refine test=develop * fix mean_iou test=develop * Add DeviceTemporaryAllocator test=develop * fix conv_op bug test=develop * small fix test=develop * code refine test=develop * log refine test=develop * fix unit test test=develop * move double check * refine concat_and_split test=develop * add limit_of_temporary_allocation test=develop * fix name test=develop
-
- 05 12月, 2018 1 次提交
-
-
由 Xin Pan 提交于
test=develop
-
- 19 11月, 2018 1 次提交
-
-
由 qingqing01 提交于
* Convolution fusion operator. * Clean code test=develop
-
- 29 9月, 2018 1 次提交
-
-
由 Dun 提交于
* refine reduce by cub * optimize KernelDepthwiseConvFilterGrad * optimize depthwise conv and reduce mean and reduce sum * fix bug: dilation * cuda arch and cuda 8 compatible
-
- 08 5月, 2018 2 次提交
-
-
由 Yu Yang 提交于
Do not use ctor * Reduce line of codes. * We can use virtual function for Maker now. * The implementation does not care what maker holds, it is easier to refactor later.
-
由 chengduoZH 提交于
-
- 04 5月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 03 5月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 18 4月, 2018 1 次提交
-
-
由 Abhinav Arora 提交于
* Fix conv_op.h * Fix conv_mkldnn_op * Fix cpplint issues in detection_map_op
-
- 28 2月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 16 2月, 2018 2 次提交
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 01 2月, 2018 1 次提交
-
-
由 xzl 提交于
-
- 23 1月, 2018 1 次提交
-
-
由 xzl 提交于
-
- 22 1月, 2018 1 次提交
-
-
由 zlx 提交于
-
- 15 1月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 14 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "unified operators" * "add CUDNN register" * "add use cudnn attribute" * "add attribute" * "test conv tranpose op" * "remove duplicated attr" * "fix op test" * "add attribute to set cudnn" * "add more log" * "need layout op register support" * "add more log" * "change GetExpectedKernelType " * "fix Get attr in conv_op" * "fix CI" * "fix tests" * "removed kernel priority fallback" * "fix CI" * "fix stack pointer bug" * "refine buggy interface" * "add const cast to save life" * "fix get_output_with_grad" * "fix op test with dataformat" * ""fix pooling * "fix pooling test" * "fix CI" * "fix with_gpu error" * "add transform needed functional check" * "fix unpack list error" * "comment out parallel.do temporary" * "fix CI" * "fix compile doc error" * "make threshold larger"
-
- 08 1月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
* add rename guard * add device_data_transform * add device_data_transform_test * modify GetExpectedKernelType * update operator.run * support test test_label_semantic_roles * optimize code * optimize code * rename GetActualKernelType to GetExpectedKernelType * fix chunk_eval_op and device_data_transform_test * add is_same_place to place * optimize code, refine rename_guard * refine rename guard, add GetKernelTypeForVar * optimize code * add some log * rename guard * use sub scope to create var * fix compile * add IsInitialized for Tensor * add VarIsTensor * fix op_registry_test * test * tmp disable priority * restore switch_kernel.md * code clean
-
- 05 1月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "add c++ side kernel selection" * "add multiple kernel op test" * "kernel selection only support cudnn" * "better formatter" * "small fix with UseCPU" * "depends on change interface Get(Place, Library)" * "fix CI" * "fix python cudnn test" * "leave the register cudnn op to another PR" * "fix CI" * "use all kernel by default" * "fix CI"
-
- 20 12月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Move framework.proto to proto namespace * Fix compile * Fix compile * Fix Compile
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 11 12月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 22 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 21 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 15 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 10 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 09 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 08 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 06 11月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 30 10月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-
- 26 10月, 2017 1 次提交
-
-
由 chengduoZH 提交于
-