connection refused by pserver, maybe pserver failed!
Created by: youan1
如题,
Wed Aug 23 23:35:12 2017[1,104]<stderr>:F0823 23:35:12.819823 46441 LightNetwork.cpp:395] connection refused by pserver, maybe pserver failed!
Wed Aug 23 23:35:12 2017[1,104]<stderr>:*** Check failure stack trace: ***
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dc82b827d google::LogMessage::Fail()
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dc82bbd2c google::LogMessage::SendToLog()
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dc82b7da3 google::LogMessage::Flush()
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dc82bd23e google::LogMessageFatal::~LogMessageFatal()
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dc81153c1 paddle::SocketClient::TcpClient()
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dc81155a1 paddle::SocketClient::SocketClient()
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dc8d989b0 paddle::ParameterClient2::init()
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dc892509d paddle::RemoteParameterUpdater::init()
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dc82981ea ParameterUpdater::init()
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dc7f41f7b _wrap_ParameterUpdater_init
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4b4cb9 PyEval_EvalFrameEx
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4b6b28 PyEval_EvalCodeEx
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4b5d10 PyEval_EvalFrameEx
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4b6b28 PyEval_EvalCodeEx
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4b5d10 PyEval_EvalFrameEx
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4b6b28 PyEval_EvalCodeEx
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4b5d10 PyEval_EvalFrameEx
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4b6b28 PyEval_EvalCodeEx
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4b6c52 PyEval_EvalCode
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4e1c7d PyRun_FileExFlags
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4e3501 PyRun_SimpleFileExFlags
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x4159dd Py_Main
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x7f4dcaceebd5 __libc_start_main
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ 0x414b71 (unknown)
Wed Aug 23 23:35:12 2017[1,104]<stderr>: @ (nil) (unknown)
Wed Aug 23 23:35:13 2017[1,104]<stderr>:./train.sh: line 239: 46441 Aborted python27-gcc482/bin/python conf/trainer_config.conf
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ '[' 134 -ne 0 ']'
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ kill_pserver2_exit
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ ps aux
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ grep paddle_pserver2
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ grep paddle_cluster_job
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ grep -v grep
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ cut -c10-14
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ xargs kill -9
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ log_fatal 'paddle_trainer failed kill paddle_pserver2 and exit'
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ echo '[./common.sh : 399] [kill_pserver2_exit]'
Wed Aug 23 23:35:13 2017[1,104]<stderr>:[./common.sh : 399] [kill_pserver2_exit]
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ echo '[FATAL]: paddle_trainer failed kill paddle_pserver2 and exit'
Wed Aug 23 23:35:13 2017[1,104]<stderr>:[FATAL]: paddle_trainer failed kill paddle_pserver2 and exit
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ get_stack
Wed Aug 23 23:35:13 2017[1,104]<stderr>:+ set +x