Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
BaiXuePrincess
PaddleRec
提交
7c10b488
P
PaddleRec
项目概览
BaiXuePrincess
/
PaddleRec
与 Fork 源项目一致
Fork自
PaddlePaddle / PaddleRec
通知
1
Star
0
Fork
0
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
0
列表
看板
标记
里程碑
合并请求
0
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
P
PaddleRec
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
0
Issue
0
列表
看板
标记
里程碑
合并请求
0
合并请求
0
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
7c10b488
编写于
5月 12, 2020
作者:
T
tangwei
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
add qsub submit
上级
16966dc5
变更
4
隐藏空白更改
内联
并排
Showing
4 changed file
with
20 addition
and
16 deletion
+20
-16
fleet_rec/core/engine/cluster/cluster.py
fleet_rec/core/engine/cluster/cluster.py
+0
-5
models/rank/dnn/backend.yaml
models/rank/dnn/backend.yaml
+4
-3
models/rank/dnn/submit.sh
models/rank/dnn/submit.sh
+12
-7
models/rank/dnn/worker.sh
models/rank/dnn/worker.sh
+4
-1
未找到文件。
fleet_rec/core/engine/cluster/cluster.py
浏览文件 @
7c10b488
...
...
@@ -28,7 +28,6 @@ class ClusterEngine(Engine):
def
__init_impl__
(
self
):
abs_dir
=
os
.
path
.
dirname
(
os
.
path
.
abspath
(
__file__
))
self
.
submit_script
=
os
.
path
.
join
(
abs_dir
,
"master.sh"
)
self
.
job_script
=
os
.
path
.
join
(
abs_dir
,
"worker.sh"
)
def
start_worker_procs
(
self
):
default_env
=
os
.
environ
.
copy
()
...
...
@@ -58,10 +57,6 @@ class ClusterEngine(Engine):
role
=
envs
.
get_runtime_environ
(
"engine_role"
)
if
role
==
"MASTER"
:
worker_script
=
{}
worker_script
[
"engine_worker"
]
=
self
.
job_script
envs
.
set_runtime_environs
(
worker_script
)
self
.
start_master_procs
()
elif
role
==
"WORKER"
:
...
...
models/rank/dnn/backend.yaml
浏览文件 @
7c10b488
...
...
@@ -27,8 +27,9 @@ engine:
paddlerec
:
"
/home/tangwei/fleet_rec_env/FleetRec"
submit
:
hpc
:
"
/home/tangwei/
submit-tieba/smart_client
/"
qconf
:
"
/home/tangwei/Plines/imageq/
package/my_conf/para
.conf"
hpc
:
"
/home/tangwei/
Plines/client/smart_client_khan
/"
qconf
:
"
/home/tangwei/Plines/imageq/
qsub_f
.conf"
nodes
:
10
scrpit
:
"
{workspace}/submit.sh"
\ No newline at end of file
submit_scrpit
:
"
{workspace}/submit.sh"
job_scrpit
:
"
{workspace}/worker.sh"
models/rank/dnn/submit.sh
浏览文件 @
7c10b488
...
...
@@ -25,15 +25,16 @@ function package() {
temp
=
${
engine_temp_path
}
echo
"package temp dir: "
${
temp
}
cp
${
engine_worker
}
${
temp
}
cp
${
engine_job_scrpit
}
${
temp
}
cp
${
engine_submit_qconf
}
${
temp
}
echo
"copy job.sh from "
${
engine_worker
}
" to "
${
temp
}
mkdir
${
temp
}
/python
cp
-r
${
engine_package_python
}
/
*
${
temp
}
/python/
mkdir
-p
${
temp
}
/package
/python
cp
-r
${
engine_package_python
}
/
*
${
temp
}
/p
ackage/p
ython/
echo
"copy python from "
${
engine_package_python
}
" to "
${
temp
}
mkdir
${
temp
}
/whl
cp
${
engine_package_paddlerec
}
${
temp
}
/whl/
mkdir
${
temp
}
/
package/
whl
cp
${
engine_package_paddlerec
}
${
temp
}
/
package/
whl/
echo
"copy "
${
engine_package_paddlerec
}
" to "
${
temp
}
"/whl/"
}
...
...
@@ -68,7 +69,11 @@ function submit() {
g_job_entry
=
"worker.sh"
${
$engine_submit_hpc
}
/bin/qsub_f
\
engine_hdfs_output
=
${
engine_hdfs_output
}
/
`
date
+%Y%m%d%H%M%S
`
cd
${
engine_temp_path
}
${
engine_submit_hpc
}
/bin/qsub_f
\
-N
${
g_job_name
}
\
--conf
${
engine_submit_qconf
}
\
--hdfs
${
engine_hdfs_name
}
\
...
...
@@ -85,4 +90,4 @@ function main() {
before_submit
submit
after_submit
}
}
\ No newline at end of file
fleet_rec/core/engine/cluster
/worker.sh
→
models/rank/dnn
/worker.sh
浏览文件 @
7c10b488
...
...
@@ -49,7 +49,10 @@ function user_define_variables() {
}
function
job
()
{
mpirun
-npernode
2
-timestamp-output
-tag-output
-machinefile
${
PBS_NODEFILE
}
python
-u
${
g_job_entry
}
echo
"job"
g_run_stage
=
"job"
# mpirun -npernode 2 -timestamp-output -tag-output -machinefile ${PBS_NODEFILE} python -u ${g_job_entry}
}
function
main
()
{
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录