提交 58e8fe28 编写于 作者: D Davanoffi Liang 提交者: Hongsheng Zeng

add a episode in quick start to show the final test reward (#37)

* add a episode to show the final test reward

* make code more clear
上级 cdd4622a
......@@ -84,7 +84,7 @@ def main():
batch_reward = calc_discount_norm_reward(reward_list)
agent.learn(batch_obs, batch_action, batch_reward)
if i % 100 == 0:
if (i + 1) % 100 == 0:
all_reward = run_evaluate_episode(env, agent)
logger.info('Test reward: {}'.format(all_reward))
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册