- 20 8月, 2018 1 次提交
-
-
由 Yihua Xu 提交于
* Optimize CRF decoding with AVX/AVX2 instruction * Enable the AVX2 flags for compiling * Clean the code and decrease the count of multiply calculation * Add the support of AVX512 instruction to optimize CRF Decoding * Clean the code * Enable the AVX512f flags for compiling * Clean the code for the invaluable switch * Fixed the issue to check AVX512F status * Clean the code * Add some explanation of the key points
-
- 11 4月, 2018 1 次提交
-
-
由 Siddharth Goyal 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 08 1月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
* add rename guard * add device_data_transform * add device_data_transform_test * modify GetExpectedKernelType * update operator.run * support test test_label_semantic_roles * optimize code * optimize code * rename GetActualKernelType to GetExpectedKernelType * fix chunk_eval_op and device_data_transform_test * add is_same_place to place * optimize code, refine rename_guard * refine rename guard, add GetKernelTypeForVar * optimize code * add some log * rename guard * use sub scope to create var * fix compile * add IsInitialized for Tensor * add VarIsTensor * fix op_registry_test * test * tmp disable priority * restore switch_kernel.md * code clean
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 05 12月, 2017 1 次提交
-
-
由 Qiao Longfei 提交于
* add crf_decoding layer * fix some typo * fix test_crf_decoding_op
-
- 04 11月, 2017 1 次提交
-
-
由 Cao Ying 提交于
* proj init. * add unittest and implementation.
-