- 04 4月, 2020 1 次提交
-
-
由 Yuan Shuai 提交于
* [LITE][OPENCL] improve perf and support variable length for fc_buffer、elementwise_add、scale、activation、grid_sampler. test=develop
-
- 03 4月, 2020 4 次提交
-
-
由 xiebaiyuan 提交于
* [LITE][OPENCL][Image] add lws turn & close cl check when shutdownlog , test=develop * [LITE][OPENCL][Image] add lws turn & close cl check when shutdownlog , test=develop * [LITE][OPENCL][Image] add lws turn & close cl check when shutdownlog , test=develop * [LITE][OPENCL][Image] add lws turn & close cl check when shutdownlog , test=develop * [LITE][OPENCL][Image] add lws turn & close cl check when shutdownlog , test=develop
-
由 huzhiqiang 提交于
-
由 xiaogang 提交于
-
由 zhupengyang 提交于
-
- 02 4月, 2020 2 次提交
-
-
由 Santa An 提交于
* [LITE][BM] add fill_const assign_value ops, test=develop
-
由 zhupengyang 提交于
-
- 01 4月, 2020 2 次提交
-
-
由 Wilber 提交于
add cuda kernel. abs, tanh, elementwise_sub
-
由 zhupengyang 提交于
-
- 30 3月, 2020 2 次提交
-
-
由 xiaogang 提交于
fix sequence_pool bug
-
由 zhupengyang 提交于
-
- 27 3月, 2020 3 次提交
-
-
由 Yuan Shuai 提交于
* [LITE][OPENCL] fix elemul kernel of opencl. test=develop * fix instanceNorm of opencl. test=develop * add more info about cl device info. test=develop
-
由 xiebaiyuan 提交于
* [LITE][OPENCL][Image] mv kernel init gws lws into prepare for run, test=develop * [LITE][OPENCL][Image] shut down profile, test=develop * [LITE][OPENCL][Image] move log ahead, test=develop
-
由 mapingshuo 提交于
-
- 26 3月, 2020 2 次提交
- 25 3月, 2020 2 次提交
-
-
由 huzhiqiang 提交于
-
由 Yuan Shuai 提交于
* [LITE][OPENCL] enhance elemul, fix reshape for opencl. test=develop * comment loop test for pad2 opencl kernel. test=develop * [LITE][PROFILE] add sequnce statics for precision profiler. test=develop
-
- 24 3月, 2020 5 次提交
-
-
由 mapingshuo 提交于
* rename elementwise_sub_grad, test=develop * rename Grad to GRAD, test=develop * deal with case that Y@GRAD is empty, test=develop
-
由 cc 提交于
-
由 xiebaiyuan 提交于
-
由 xiebaiyuan 提交于
* [LITE][OPENCL][Image]optimise conv2d 5x5 7x7,test=develop * [LITE][OPENCL][Image]optimise conv2d 5x5 7x7,test=develop * [LITE][OPENCL][Image]optimise conv2d 5x5 7x7,test=develop
-
由 Yuan Shuai 提交于
* [LITE][OPENCL] comment vlog for opencl kernel. test=develop
-
- 23 3月, 2020 2 次提交
-
-
由 xiaogang 提交于
-
由 zhupengyang 提交于
-
- 22 3月, 2020 1 次提交
-
-
由 Yuan Shuai 提交于
* [LITE][OPENCL] clean code for opencl. test=develop * [LITE][PROFILER] Enhance Precision Profiler. test=develop * delete useless var in profiler. test=develop * add ocl header. test=develop
-
- 20 3月, 2020 1 次提交
-
-
由 mapingshuo 提交于
* rm grad code * add mul_grad, test=develop
-
- 18 3月, 2020 3 次提交
-
-
由 Yuan Shuai 提交于
* Fix OpenCL kernel of exp, tanh, * Fix layout pass for opencl backend, * Add how to debug cl kernel. test=develop.
-
由 zhupengyang 提交于
-
由 HappyAngel 提交于
-
- 17 3月, 2020 5 次提交
-
-
由 Wilber 提交于
add cuda c++ doc
-
由 Wilber 提交于
- 增加cuda c++ demo. - 考虑到检测模型尾部一般是multiclass_nms,该kernel为host,如果fetch kernel为cuda的话,则会在此处插入无用的io_copy(host->cuda),由于该原因,注释掉fetch的cuda kernel. 默认使用host的fetch kernel. 此处暗中进行的行为:每次predictor run完,都会默认把数据从cuda拷贝到cpu
-
由 Santa An 提交于
* * support download bm_sdk, test=develop * [LITE][BM] add slice op * [LITE][BM] fix concat issue * [LITE][BM] support reduce full ops, test=develop * [LITE][BM] change test_resnet50 to change test_classify * [LITE][BM] add cast op * [LITE][BM] add reduce and other ops, test=develop * [LITE][BM] add reduce,cast and other ops, test=develop
-
由 Wilber 提交于
add cuda ci. Organize cuda compilation products.
-
由 mapingshuo 提交于
* add sgd op, test=develop * test=develop
-
- 16 3月, 2020 4 次提交
-
-
由 hong19860320 提交于
-
由 mapingshuo 提交于
* add backend * add kernel * add grad kernel
-
由 cc 提交于
-
由 zhupengyang 提交于
* [NPU] subgraph's precision register to kAny
-
- 15 3月, 2020 1 次提交
-
-
由 Yuan Shuai 提交于
* [LITE][OPENCL] Change fp32 fc to fp16's. test=develop * fix act in conv3x3opt opencl kernel. test=develop * [LITE][OPENCL] fix opencl fc kernel. test=develop * [LITE][OPENCL] add lws for opencl conv image kernel. test=develop
-