- 13 3月, 2018 1 次提交
-
-
由 QI JUN 提交于
* fix nccl op unit test * fix build error * format code * refine nccl related unit test * fix build error * add setGPUData * clean up * follow comments * rm test_nccl.cu * follow comment * rm wait
-
- 12 3月, 2018 6 次提交
-
-
由 Yu Yang 提交于
-
由 QI JUN 提交于
* add memopt level * add opt level for image classification demo * clean code * add delete op * clean code * test machine translation demo * clean code * clean code * skip fill constant with force cpu * clean code * clean code * refine code * clean code * fix bug
-
由 Yancey 提交于
-
由 qingqing01 提交于
* Clipping bbox in the mAP evaluator calculation. * Fix bug in detection_output and mAP calculation in SSD. * Fix bug in detection.py. * Fix bug in test_detection_map_op.py.
-
由 Kexin Zhao 提交于
-
由 Kexin Zhao 提交于
-
- 10 3月, 2018 5 次提交
-
-
由 pzelazko-intel 提交于
* MKLDNN pool2d OP kernel added * conv2d and pool2d MKLDNN kernels renamed * MKLDNN conv2d kernel refactoring
-
由 Kexin Zhao 提交于
-
由 Kexin Zhao 提交于
-
由 Kexin Zhao 提交于
-
由 fengjiayi 提交于
-
- 09 3月, 2018 10 次提交
-
-
由 QI JUN 提交于
* fix mac build error * override GetExpectedKernelType for cast op * fix typo * add cuda unittest
-
由 Yancey 提交于
Fix sparse update memory error for distributed training
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 Liu Yiqun 提交于
-
由 武毅 提交于
-
由 kexinzhao 提交于
* test cpu float16 data transform * add isnan etc * small fix * fix containsNAN test error * add data_type transform GPU test * add float16 GPU example * fix error * fix GPU test error * initial commit * fix error * small fix * add more gemm fp16 tests * fix error * add utility function
-
由 武毅 提交于
-
由 Xin Pan 提交于
-
- 08 3月, 2018 11 次提交
-
-
由 chengduoZH 提交于
-
由 qingqing01 提交于
-
由 Yiqun Liu 提交于
* Add test for nested RecordEvent. * Remove the debug information. * Add log information for the 3 usages and reduce the loop counts of nested case.
-
由 Xin Pan 提交于
-
由 QI JUN 提交于
-
由 chengduoZH 提交于
-
由 Xin Pan 提交于
-
由 Yiqun Liu 提交于
* Add profiling information for inference example, recognize digits. * Refine the profiling method. * Correct the use of RecordEvent and simplify recognize_digits.
-
由 qingqing01 提交于
-
由 kexinzhao 提交于
-
由 Tao Luo 提交于
* compile and install the static library of fluid inference * fix dynload_cuda not in CPU mode * update shared library and adjust the deploy of openblas * adjust the deploy of openblas * * auto add all fluid modules for static library * use libprotobuf.a instead of libprotobuf-lite.a for profiler * use set_property to set the global varible instead of ENV * add gpu depends of fluid modules, auto add inference_lib_dist depends * change the condition of openblas_lib, and fix a typo
-
- 07 3月, 2018 7 次提交
-
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 Luo Tao 提交于
-
由 Luo Tao 提交于
-
由 pzelazko-intel 提交于
* MKLDNN conv2 OP kernel added * TODOs added * mkldnn conv2d OP refactor * CanCUDNNBeUsed and CanMKLDNNBeUsed moved
-
由 Yan Chunwei 提交于
-