[cherry-pick][ARM] armv7 improve sgemmc4 small kernel speed by add 4x8 block, test=develop (#2486)
* unfinish sgemmc4 * finish armv8 sgemmc4 * arm add sgemmc4 with deal with remain * [ARM] add sgemmc4 small kernel, test=develop * [ARM] sgemmc4 small improve armv7 speed by add 4x8 block, test=develop
Showing
想要评论请 注册 或 登录