add a episode in quick start to show the final test reward (#37)

* add a episode to show the final test reward * make code more clear

add a episode in quick start to show the final test reward (#37)
* add a episode to show the final test reward * make code more clear
58e8fe28 · Davanoffi Liang · Hongsheng Zeng · cdd4622a · 58e8fe28
隐藏空白更改
内联并排

Showing with 1 addition and 1 deletion

examples/QuickStart/train.py examples/QuickStart/train.py +1 -1

未找到文件。
--- a/examples/QuickStart/train.py
+++ b/examples/QuickStart/train.py
@@ -84,7 +84,7 @@ def main():
        batch_reward = calc_discount_norm_reward(reward_list)

        agent.learn(batch_obs, batch_action, batch_reward)
-        if i % 100 == 0:
+        if (i + 1) % 100 == 0:
            all_reward = run_evaluate_episode(env, agent)
            logger.info('Test reward: {}'.format(all_reward))