Created by: zhangyong15
现象,集群训练报错,Local模式正常。期间使用classification_cost报top k,依照#2574方法尝试解决,报相关错误如下:
Fri Jun 23 15:40:01 2017[1,53]:F0623 15:40:01.543020 31957 LightNetwork.cpp:397] Check failed: error >= 0 ERROR connecting to 10.87.100.36: Connection refused [111]
Fri Jun 23 15:40:01 2017[1,53]:
* Check failure stack trace: *
Fri Jun 23 15:40:01 2017[1,53]:F0623 15:40:01.543030 32628 LightNetwork.cpp:397] Check failed: error >= 0 ERROR connecting to 10.87.100.36: Connection refused [111]
Fri Jun 23 15:40:01 2017[1,53]:
* Check failure stack trace: *
Fri Jun 23 15:40:01 2017[1,53]: @ 0x91316d google::LogMessage::Fail()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x91316d google::LogMessage::Fail()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x916c1c google::LogMessage::SendToLog()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x912c93 google::LogMessage::Flush()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x916c1c google::LogMessage::SendToLog()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x912e99 google::LogMessage::~LogMessage()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x912c93 google::LogMessage::Flush()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x916147 google::ErrnoLogMessage::~ErrnoLogMessage()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x912e99 google::LogMessage::~LogMessage()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x768fa1 paddle::SocketClient::TcpClient()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x916147 google::ErrnoLogMessage::~ErrnoLogMessage()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x7691a1 paddle::SocketClient::SocketClient()
Fri Jun 23 15:40:01 2017[1,53]: @ 0x768fa1 paddle::SocketClient::TcpClient()
Fri Jun 23 15:40:01 2017[1,53]: @ 0xf06e50 paddle::ParameterClient2::init()