Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
机器未来
Paddle
提交
d5d0f7e8
P
Paddle
项目概览
机器未来
/
Paddle
与 Fork 源项目一致
Fork自
PaddlePaddle / Paddle
通知
1
Star
1
Fork
0
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
1
列表
看板
标记
里程碑
合并请求
0
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
P
Paddle
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
1
Issue
1
列表
看板
标记
里程碑
合并请求
0
合并请求
0
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
d5d0f7e8
编写于
12月 16, 2016
作者:
L
livc
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
modify markdown format
上级
202b2e75
变更
1
隐藏空白更改
内联
并排
Showing
1 changed file
with
8 addition
and
8 deletion
+8
-8
doc/howto/usage/cluster/cluster_train_en.md
doc/howto/usage/cluster/cluster_train_en.md
+8
-8
未找到文件。
doc/howto/usage/cluster/cluster_train_en.md
浏览文件 @
d5d0f7e8
...
...
@@ -55,16 +55,16 @@ At last your workspace should look like as follow:
```
Not all of these files are needed for cluster training, but it's not necessary to remove useless files.
`
``
trainer_config.py``
`
`
trainer_config.py
`
Indicates the model config file.
`
``
train.list
``` and ```
test.list
``
`
`
train.list`
and
`test.list
`
File index. It stores all relative or absolute file paths of all train/test data at current node.
`
``
dataprovider.py
``
`
`
dataprovider.py
`
used to read train/test samples. It's same as local training.
`
``
data
``
`
`
data
`
all files in data directory are refered by train.list/test.list which are refered by data provider.
...
...
@@ -139,16 +139,16 @@ The cluster Job will start in several seconds.
### Check Cluster Training Result
Check log in $workspace/log for details, each node owns same log structure.
`
``
paddle_trainer.INFO
``
`
`
paddle_trainer.INFO
`
It provides almost all interal output log for training, same as local training. Check runtime model convergence here.
`
``
paddle_pserver2.INFO
``
`
`
paddle_pserver2.INFO
`
It provides pserver running log, which could help to diagnose distributed error.
`
``
server.log
``
`
`
server.log
`
It provides stderr and stdout of pserver process. Check error log if training crashs.
`
``
train.log
``
`
`
train.log
`
It provides stderr and stdout of trainer process. Check error log if training crashs.
### Check Model Output
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录