- 10 3月, 2018 2 次提交
-
-
由 pzelazko-intel 提交于
* MKLDNN pool2d OP kernel added * conv2d and pool2d MKLDNN kernels renamed * MKLDNN conv2d kernel refactoring
-
由 fengjiayi 提交于
-
- 09 3月, 2018 9 次提交
-
-
由 QI JUN 提交于
* fix mac build error * override GetExpectedKernelType for cast op * fix typo * add cuda unittest
-
由 Yancey 提交于
Fix sparse update memory error for distributed training
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 武毅 提交于
-
由 kexinzhao 提交于
* test cpu float16 data transform * add isnan etc * small fix * fix containsNAN test error * add data_type transform GPU test * add float16 GPU example * fix error * fix GPU test error * initial commit * fix error * small fix * add more gemm fp16 tests * fix error * add utility function
-
由 武毅 提交于
-
由 Xin Pan 提交于
-
- 08 3月, 2018 10 次提交
-
-
由 qingqing01 提交于
-
由 Yiqun Liu 提交于
* Add test for nested RecordEvent. * Remove the debug information. * Add log information for the 3 usages and reduce the loop counts of nested case.
-
由 Xin Pan 提交于
-
由 QI JUN 提交于
-
由 chengduoZH 提交于
-
由 Xin Pan 提交于
-
由 Yiqun Liu 提交于
* Add profiling information for inference example, recognize digits. * Refine the profiling method. * Correct the use of RecordEvent and simplify recognize_digits.
-
由 qingqing01 提交于
-
由 kexinzhao 提交于
-
由 Tao Luo 提交于
* compile and install the static library of fluid inference * fix dynload_cuda not in CPU mode * update shared library and adjust the deploy of openblas * adjust the deploy of openblas * * auto add all fluid modules for static library * use libprotobuf.a instead of libprotobuf-lite.a for profiler * use set_property to set the global varible instead of ENV * add gpu depends of fluid modules, auto add inference_lib_dist depends * change the condition of openblas_lib, and fix a typo
-
- 07 3月, 2018 12 次提交
-
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 Luo Tao 提交于
-
由 Luo Tao 提交于
-
由 pzelazko-intel 提交于
* MKLDNN conv2 OP kernel added * TODOs added * mkldnn conv2d OP refactor * CanCUDNNBeUsed and CanMKLDNNBeUsed moved
-
由 Yan Chunwei 提交于
-
由 fengjiayi 提交于
-
由 chengduoZH 提交于
-
由 Yu Yang 提交于
-
由 kexinzhao 提交于
* test cpu float16 data transform * add isnan etc * small fix * fix containsNAN test error * add data_type transform GPU test * add float16 GPU example * fix error * fix GPU test error * add context wait
-
由 Abhinav Arora 提交于
* Redesign channel implementation for Select Op * Remove unecessary header * Remove unnecessary comments
-
- 06 3月, 2018 7 次提交
-
-
由 chengduoZH 提交于
-
由 fengjiayi 提交于
-
由 qingqing01 提交于
* Fix mAP evaluator bug. * Fix bug in detection mAP evaluator. * Fix unit testing. * Support to set background label index in detection mAP op.
-
由 Yu Yang 提交于
-
由 Xin Pan 提交于
-
由 Xin Pan 提交于
smaller binary proto avoid untrackable kernel
-
由 Yu Yang 提交于
-