Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
BaiXuePrincess
PaddleRec
提交
16966dc5
P
PaddleRec
项目概览
BaiXuePrincess
/
PaddleRec
与 Fork 源项目一致
Fork自
PaddlePaddle / PaddleRec
通知
1
Star
0
Fork
0
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
0
列表
看板
标记
里程碑
合并请求
0
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
P
PaddleRec
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
0
Issue
0
列表
看板
标记
里程碑
合并请求
0
合并请求
0
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
16966dc5
编写于
5月 12, 2020
作者:
T
tangwei
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
add qsub submit
上级
12f161d6
变更
3
隐藏空白更改
内联
并排
Showing
3 changed file
with
39 addition
and
15 deletion
+39
-15
fleet_rec/core/engine/cluster/cluster.py
fleet_rec/core/engine/cluster/cluster.py
+4
-0
models/rank/dnn/backend.yaml
models/rank/dnn/backend.yaml
+6
-5
models/rank/dnn/submit.sh
models/rank/dnn/submit.sh
+29
-10
未找到文件。
fleet_rec/core/engine/cluster/cluster.py
浏览文件 @
16966dc5
...
@@ -58,6 +58,10 @@ class ClusterEngine(Engine):
...
@@ -58,6 +58,10 @@ class ClusterEngine(Engine):
role
=
envs
.
get_runtime_environ
(
"engine_role"
)
role
=
envs
.
get_runtime_environ
(
"engine_role"
)
if
role
==
"MASTER"
:
if
role
==
"MASTER"
:
worker_script
=
{}
worker_script
[
"engine_worker"
]
=
self
.
job_script
envs
.
set_runtime_environs
(
worker_script
)
self
.
start_master_procs
()
self
.
start_master_procs
()
elif
role
==
"WORKER"
:
elif
role
==
"WORKER"
:
...
...
models/rank/dnn/backend.yaml
浏览文件 @
16966dc5
...
@@ -16,6 +16,11 @@ engine:
...
@@ -16,6 +16,11 @@ engine:
workspace
:
"
fleetrec.models.rank.dnn"
workspace
:
"
fleetrec.models.rank.dnn"
backend
:
"
MPI"
backend
:
"
MPI"
hdfs
:
name
:
"
hdfs://nmg01-taihang-hdfs.dmop.baidu.com:54310"
ugi
:
"
fcr,SaK2VqfEDeXzKPor"
output
:
"
/app/ecom/fcr/fanyabo/wadstyleimageq/tangwei12/output_1/"
package
:
package
:
build_script
:
"
{workspace}/package.sh"
build_script
:
"
{workspace}/package.sh"
python
:
"
/home/tangwei/fleet_rec_env/cpython-2.7.11-ucs4"
python
:
"
/home/tangwei/fleet_rec_env/cpython-2.7.11-ucs4"
...
@@ -23,11 +28,7 @@ engine:
...
@@ -23,11 +28,7 @@ engine:
submit
:
submit
:
hpc
:
"
/home/tangwei/submit-tieba/smart_client/"
hpc
:
"
/home/tangwei/submit-tieba/smart_client/"
hdfs
:
"
xx"
qconf
:
"
/home/tangwei/Plines/imageq/package/my_conf/para.conf"
hout
:
"
xxx"
ugi
:
"
xxxx"
nodes
:
10
nodes
:
10
before_hook
:
"
"
end_hook
:
"
"
scrpit
:
"
{workspace}/submit.sh"
scrpit
:
"
{workspace}/submit.sh"
\ No newline at end of file
models/rank/dnn/submit.sh
浏览文件 @
16966dc5
...
@@ -22,6 +22,19 @@ function vars_get_from_env() {
...
@@ -22,6 +22,19 @@ function vars_get_from_env() {
function
package
()
{
function
package
()
{
g_run_stage
=
"package"
g_run_stage
=
"package"
temp
=
${
engine_temp_path
}
echo
"package temp dir: "
${
temp
}
cp
${
engine_worker
}
${
temp
}
echo
"copy job.sh from "
${
engine_worker
}
" to "
${
temp
}
mkdir
${
temp
}
/python
cp
-r
${
engine_package_python
}
/
*
${
temp
}
/python/
echo
"copy python from "
${
engine_package_python
}
" to "
${
temp
}
mkdir
${
temp
}
/whl
cp
${
engine_package_paddlerec
}
${
temp
}
/whl/
echo
"copy "
${
engine_package_paddlerec
}
" to "
${
temp
}
"/whl/"
}
}
#-----------------------------------------------------------------------------------------------------------------
#-----------------------------------------------------------------------------------------------------------------
...
@@ -50,20 +63,26 @@ function after_submit() {
...
@@ -50,20 +63,26 @@ function after_submit() {
function
submit
()
{
function
submit
()
{
g_run_stage
=
"submit"
g_run_stage
=
"submit"
before_submit
g_job_name
=
"paddle_rec_mpi"
g_hdfs_path
=
$g_hdfs_path
g_job_entry
=
"worker.sh"
${
g_hpc_path
}
/bin/qsub_f
\
${
$engine_submit_hpc
}
/bin/qsub_f
\
-N
${
g_job_name
}
\
-N
${
g_job_name
}
\
--conf
${
g_qsub_
conf
}
\
--conf
${
engine_submit_q
conf
}
\
--hdfs
${
g_hdfs_path
}
\
--hdfs
${
engine_hdfs_name
}
\
--ugi
${
g
_hdfs_ugi
}
\
--ugi
${
engine
_hdfs_ugi
}
\
--hout
${
g
_hdfs_output
}
\
--hout
${
engine
_hdfs_output
}
\
--files
${
g_submit_package
}
\
--files
${
engine_temp_path
}
\
-l
nodes
=
${
g_job
_nodes
}
,walltime
=
1000:00:00,resource
=
full
${
g_job_entry
}
-l
nodes
=
${
engine_submit
_nodes
}
,walltime
=
1000:00:00,resource
=
full
${
g_job_entry
}
after_submit
}
}
function
main
()
{
function
main
()
{
echo
"run submit done"
package
before_submit
submit
after_submit
}
}
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录