训练model从GPU集群移到CPU集群报错
Created by: qubingxin
之前用GPU训练CNNmodel,移到CPU集群训练报错。local_config和receiver已改,--where参数已改。 具体错误如下,未找到libpython2.6.so.1.0: I1130 10:44:17.482218 3086 PyDataProvider.cpp:43] module:pyDataProviderImage class:GeneralJpegDataProvider I1130 10:44:17.868773 3086 PythonUtil.cpp:149] createPythonClass moduleName.c_str:pyDataProviderImage I1130 10:44:17.868809 3086 PythonUtil.cpp:154] createPythonClass className.c_str():GeneralJpegDataProvider I1130 10:44:17.888891 3086 PythonUtil.cpp:83] Python Error: <type 'exceptions.ImportError'> : libpython2.6.so.1.0: cannot open shared object file: No such file or directory I1130 10:44:17.888914 3086 PythonUtil.cpp:87] Python Callstack: I1130 10:44:17.888922 3086 PythonUtil.cpp:92] /home/normandy/maybach/204635/workspace/thirdparty/thirdparty/pyDataProviderImage.py : 127 F1130 10:44:17.889091 3086 PythonUtil.cpp:108] Create class GeneralJpegDataProvider failed.
pyDataProviderImage.py的第126-127行是: self.lib_name = "decodeJPEG._DeJPEG" self.libmodel = import(self.lib_name,fromlist=['_DeJPEG'])