提交 4512aecb 编写于 作者: L liuqi

Fix the bug: tuning 2d lws for gpu do not work.

上级 5dc58b5b
...@@ -408,7 +408,7 @@ void TuningOrRun2DKernel(const cl::Kernel &kernel, ...@@ -408,7 +408,7 @@ void TuningOrRun2DKernel(const cl::Kernel &kernel,
{kwg_size / 128, 128, 0}, {kwg_size / 256, 256, 0}, {kwg_size / 128, 128, 0}, {kwg_size / 256, 256, 0},
{kwg_size, 1, 0}, {1, kwg_size, 0}}; {kwg_size, 1, 0}, {1, kwg_size, 0}};
for (auto &ele : candidates) { for (auto &ele : candidates) {
const uint32_t tmp = ele[0] * ele[1] * ele[2]; const uint32_t tmp = ele[0] * ele[1];
if (0 < tmp && tmp <= kwg_size) { if (0 < tmp && tmp <= kwg_size) {
results.push_back(ele); results.push_back(ele);
} }
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册