- 05 8月, 2020 1 次提交
-
-
由 ysh329 提交于
* add int32->int64 cast arm cpu kernel. test=develop
-
- 30 7月, 2020 1 次提交
-
-
由 ysh329 提交于
-
- 29 7月, 2020 1 次提交
-
-
由 hong19860320 提交于
[ARM] [NPU] Fix the overflow of layer_norm op bridge, fix the registration of lstm op kernel (#4007) * [NPU] Fix the overflow of layer_norm op bridge test=develop * [ARM] Fix the registration of lstm op kernel test=develop
-
- 27 7月, 2020 1 次提交
-
-
由 hong19860320 提交于
test=develop
-
- 23 7月, 2020 1 次提交
-
-
由 hong19860320 提交于
* [Cherry-pick][Core] Add the graph optimization of subblocks for transformer model (#3947) test=develop * [Core][ARM] Fix beam_search, eltwise_mul supports broadcast and int64_t data type, add print op and kernel, add exeception test=develop * Fix the dims of parent idx of the arm kernel of beam_search op * elementwise_mul supports int64_t data type with broadcasting * Add print op and kernel for debugging * Support throwing the exception when the internal error occurs * Refine while and conditional_block op kernel * Support the graph optimization on subblocks * Pass program_desc and block_idx into the kernel of the control flow ops(while/conditional_block/subgraph), and create the RuntimeProgram online, it make it possiable to call the control flow ops recursively *Add unit test for masked transformer model
-
- 17 7月, 2020 1 次提交
-
-
由 ysh329 提交于
* [cherry-pick][OPENCL] remove conv redundant's for opencl kernel. test=develop Co-authored-by: Nxiebaiyuan <xiebaiyuan@qq.com>
-
- 14 7月, 2020 1 次提交
-
-
由 Qi Li 提交于
[cherry-pick] [NPU] apply npu cache offline model to other devices, test=develop (#3925), test=release/v2.6 (#3933)
-
- 13 7月, 2020 1 次提交
-
-
由 Qi Li 提交于
* [NPU] enhance cache offline model, test=develop
-
- 09 7月, 2020 2 次提交
-
-
由 ysh329 提交于
* fix opencl fc kernel caused int16 model weight abnormal. test=develop
-
由 HappyAngel 提交于
* [arm]add 2x2s2p1 pooling (#3705) * fix pooling bug and speed * add 2x2s2p1 pooling. test=develop * fix conflict, test=develop * fix conflict in wino * [arm] add 3x3s1 Winograd int8 (#3767) * fix: winograd support unsame pad test=develop * feat: add winograd int8 kernel test=develop * fix: style fix test=develo * fix winograd_int8 ut sgement default. test=develop * close basic_test, test=develop Co-authored-by: NMyPandaShaoxiang <txg4794@163.com> * fix xiaodu crash in gemm prepacked * in huwen phone, 3x3s2p0 avg pooling will rand crash, other phone does not have this feature * [arm] update con int8 kernel choose (#3834) * fix conv int8 kernel choose and sooftmax compute bug * change axis_size = 4 kernel choose, test=develop * fix format. test=develop * fix format.test=develop * fix build test=develop * fix buiild error test=develop * fix wino_int8 computte erroor. test=develop * Update the link to debug, test=develop, test=document_fix (#3870) (#3871) Co-authored-by: NMyPandaShaoxiang <txg4794@163.com> Co-authored-by: Ncc <52520497+juncaipeng@users.noreply.github.com>
-
- 22 6月, 2020 1 次提交
-
-
由 mapingshuo 提交于
-
- 11 5月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 10 5月, 2020 1 次提交
-
-
由 Wilber 提交于
* update cuda demo.
-
- 01 5月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 30 4月, 2020 1 次提交
-
-
由 hong19860320 提交于
-
- 27 4月, 2020 1 次提交
-
-
由 huzhiqiang 提交于
-
- 22 4月, 2020 4 次提交
-
-
由 Yuan Shuai 提交于
* Fix Places of CXX Config for OpenCL. test=develop * fix shared ptr as unqiue ptr. test=develop
-
由 Yuan Shuai 提交于
* fix bilinear opencl kernel. test=develop * [LITE][OPENCL] replace map with memsync. test=develop * [GIT] Ehance ignore. test=develop * replace map of Fc with MemCpySync. test=develop * comment for mul of opencl. test=develop * Fix opencl miscs. test=develop
-
由 Yuan Shuai 提交于
* Fix activation act_type default value. test=develop * Fix enum value. test=develop * Fix conv test for relu. test=develop
-
由 Cwndmiao 提交于
-
- 21 4月, 2020 2 次提交
-
-
由 HappyAngel 提交于
* improve 3x3s1 direct profile * fix format, test=develop
-
由 xiaogang 提交于
-
- 20 4月, 2020 2 次提交
-
-
由 zhupengyang 提交于
-
由 xiebaiyuan 提交于
1. fix clang android log warning. 2. remove an unused event.
-
- 19 4月, 2020 1 次提交
-
-
由 xiebaiyuan 提交于
* [lite][opencl] remove event with clfinish, add strict check for cl warning. add conv 3x3opt fallback opt layout cast ,test=develop * [LITE][OPENCL]rm event in element_add_buffer_compute test=develop * [LITE][OPENCL]suite cl_functions_test.cc test=develop * [LITE][OPENCL] suite cl_common.sh lint check test=develop * [LITE][OPENCL] suite conv_image_compute.cc lint check test=develop * [LITE][OPENCL] suite cl_wait_list() lint check test=develop
-
- 17 4月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 16 4月, 2020 2 次提交
-
-
由 zhupengyang 提交于
-
由 zhupengyang 提交于
-
- 15 4月, 2020 4 次提交
-
-
由 Yuan Shuai 提交于
* fix bilinear opencl kernel. test=develop * [LITE][OPENCL] replace map with memsync. test=develop
-
由 MaxwellDing 提交于
refactor(*): reduce Wsign-compare warning
-
由 zhupengyang 提交于
-
由 hong19860320 提交于
-
- 14 4月, 2020 5 次提交
-
-
由 silingtong123 提交于
-
由 airockchip 提交于
-
由 xiaogang 提交于
-
由 huzhiqiang 提交于
-
由 cc 提交于
* Optimize matmul for size(x_dims)=2 size(y_dims)>2
-
- 13 4月, 2020 3 次提交
-
-
由 zhupengyang 提交于
* [NPU] add shape bridge move shape arm kernel to host * enhance compare arm kernel * [NPU] add gather op bridge * enable reshape arm ut * [NPU] add lookup_table bridge
-
由 cc 提交于
-
由 xiebaiyuan 提交于
* [LITE][OPENCL][Image] lazy init cl::event , test=develop * [LITE][OPENCL][Image] lazy init cl::event , test=develop
-