未验证 提交 913a28d4 编写于 作者: W WangLiu 提交者: GitHub

Merge pull request #555 from smilejames/develop

fix gemm assembly bug while compile with clang of NDK17
......@@ -1209,12 +1209,12 @@ void WriteWithAddRelu(int mc, int nc, float *c, float *C, int ldc) {
// C = A * B, batchnorm(C)
void WriteWithBn(int mc, int nc, float *c, float *C, int ldc, float *scale,
float *bias) {
int nc1 = nc / 16;
int volatile nc1 = nc / 16;
int _nc1 = nc % 16;
int nc2 = _nc1 / 4;
int nc3 = 16 - 4 * (_nc1 % 4);
int step = 4 * (ldc - nc);
int step1 = 4 * (NC - nc);
int volatile nc2 = _nc1 / 4;
int volatile nc3 = 16 - 4 * (_nc1 % 4);
int volatile step = 4 * (ldc - nc);
int volatile step1 = 4 * (NC - nc);
asm volatile(
"subs %[mc], %[mc], #1 \n\t"
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册