fix load_when_predict mode in opencl (#1619)

* optimize GPU conv performance and structure * add CL macro for test_conv_gpu * fix build failure * change funtion name * change funtion name * fix load_when_predict mode in opencl

fix load_when_predict mode in opencl (#1619)
* optimize GPU conv performance and structure * add CL macro for test_conv_gpu * fix build failure * change funtion name * change funtion name * fix load_when_predict mode in opencl
c6d366c2 · Jiaying Zhao · GitHub · cf542116 · c6d366c2
隐藏空白更改
内联并排

Showing with 2 addition and 2 deletion

src/framework/executor.cpp src/framework/executor.cpp +2 -2

未找到文件。
--- a/src/framework/executor.cpp
+++ b/src/framework/executor.cpp
@@ -715,14 +715,14 @@ void Executor<GPU_CL, float>::InitNoPersistableMemory(
    for (const auto &var_desc : block->Vars()) {
      auto var = program_.scope->Var(var_desc->Name());
-      auto cl_image = var->template GetMutable<CLImage>();
      if (var_desc->Persistable()) {
        if (var_desc->Name() == "feed" || var_desc->Name() == "fetch") {
+          var->template GetMutable<framework::LoDTensorArray>();
          continue;
        }
      } else {
        if (var_desc->Type() == VARTYPE_TYPE_LOD_TENSOR) {
+          auto cl_image = var->template GetMutable<CLImage>();
          cl_context context = program_.scope->GetCLScpoe()->Context();
          cl_command_queue command_queue =
              program_.scope->GetCLScpoe()->CommandQueue();