- 24 9月, 2020 1 次提交
-
-
由 zhaoyang-star 提交于
* [Bugfix][OpenCL][Core] fix opencl multi-run result error when using memory_optimize_pass (#4410) * [Bugfix][OpenCL][Core] fix opencl multi-run result error when using memory_optimize_pass. test=develop * test=develop Co-authored-by: Nysh329 <ysh329@users.noreply.github.com>
-
- 22 9月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 21 9月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 19 9月, 2020 1 次提交
-
-
由 hong19860320 提交于
-
- 17 9月, 2020 1 次提交
-
-
由 ysh329 提交于
-
- 15 9月, 2020 1 次提交
-
-
由 ysh329 提交于
[cherry-pick][PROFILE][BugFix] Precision profiler writes output tensor to files for each op; Fix dropout opencl kernel register (#4331) * cherry-pick from #4255, write output tensor to file. test=develop * cherry-pick from fix opencl dropout. test=develop (#4253)
-
- 10 9月, 2020 1 次提交
-
-
由 hong19860320 提交于
* [cherry-pick]platform portability of tls, test=develop (#4261) test=develop Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 21 8月, 2020 1 次提交
-
-
由 ysh329 提交于
* fix remove tf redundant op pass. test=develop (#4158) * test=develop
-
- 12 8月, 2020 1 次提交
-
-
由 ysh329 提交于
-
- 01 8月, 2020 1 次提交
-
-
由 hong19860320 提交于
test=develop
-
- 30 7月, 2020 1 次提交
-
-
由 ysh329 提交于
[BugFix][OPENCL] Fix initalization sequence of opencl backend valid API. test=develop (#4003) (#4021) * fix opencl backend. test=develop
-
- 23 7月, 2020 1 次提交
-
-
由 hong19860320 提交于
* [Cherry-pick][Core] Add the graph optimization of subblocks for transformer model (#3947) test=develop * [Core][ARM] Fix beam_search, eltwise_mul supports broadcast and int64_t data type, add print op and kernel, add exeception test=develop * Fix the dims of parent idx of the arm kernel of beam_search op * elementwise_mul supports int64_t data type with broadcasting * Add print op and kernel for debugging * Support throwing the exception when the internal error occurs * Refine while and conditional_block op kernel * Support the graph optimization on subblocks * Pass program_desc and block_idx into the kernel of the control flow ops(while/conditional_block/subgraph), and create the RuntimeProgram online, it make it possiable to call the control flow ops recursively *Add unit test for masked transformer model
-
- 17 7月, 2020 1 次提交
-
-
由 ysh329 提交于
* [cherry-pick][OPENCL] remove conv redundant's for opencl kernel. test=develop Co-authored-by: Nxiebaiyuan <xiebaiyuan@qq.com>
-
- 13 7月, 2020 1 次提交
-
-
由 Qi Li 提交于
* [NPU] enhance cache offline model, test=develop
-
- 22 5月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 18 5月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 14 5月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 12 5月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 09 5月, 2020 1 次提交
-
-
由 Yuan Shuai 提交于
* [LITE] Set high perf mode as default in mobile_light deme; Fix Precision Profiler caused by X86 header; Enhance profiler for opencl. test=develop Co-authored-by: NHappyAngel <chenjiaobuaa@126.com>
-
- 08 5月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 07 5月, 2020 1 次提交
-
-
由 cc 提交于
-
- 29 4月, 2020 1 次提交
-
-
由 HappyAngel 提交于
*fix depthwise deconv+bn fusion
-
- 27 4月, 2020 2 次提交
-
-
由 huzhiqiang 提交于
-
由 hong19860320 提交于
-
- 22 4月, 2020 2 次提交
-
-
由 Yuan Shuai 提交于
* Fix activation act_type default value. test=develop * Fix enum value. test=develop * Fix conv test for relu. test=develop
-
由 Cwndmiao 提交于
-
- 20 4月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 19 4月, 2020 1 次提交
-
-
由 xiebaiyuan 提交于
* [lite][opencl] remove event with clfinish, add strict check for cl warning. add conv 3x3opt fallback opt layout cast ,test=develop * [LITE][OPENCL]rm event in element_add_buffer_compute test=develop * [LITE][OPENCL]suite cl_functions_test.cc test=develop * [LITE][OPENCL] suite cl_common.sh lint check test=develop * [LITE][OPENCL] suite conv_image_compute.cc lint check test=develop * [LITE][OPENCL] suite cl_wait_list() lint check test=develop
-
- 16 4月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 15 4月, 2020 3 次提交
-
-
由 MaxwellDing 提交于
refactor(*): reduce Wsign-compare warning
-
由 zhupengyang 提交于
-
由 hong19860320 提交于
-
- 14 4月, 2020 3 次提交
-
-
由 silingtong123 提交于
-
由 Wilber 提交于
for multiple-use args, only cast once
-
由 airockchip 提交于
-
- 13 4月, 2020 3 次提交
-
-
由 zhupengyang 提交于
* [NPU] add shape bridge move shape arm kernel to host * enhance compare arm kernel * [NPU] add gather op bridge * enable reshape arm ut * [NPU] add lookup_table bridge
-
由 Wilber 提交于
lite cuda support exec multi-stream
-
由 huzhiqiang 提交于
-
- 10 4月, 2020 2 次提交
-
-
由 cc 提交于
* Optimize weight quantizaion, test=develop
-
由 HappyAngel 提交于
* update profile info(add new element), test=develop
-