提交 e9b90adb 编写于 作者: X xiaowei_xing

test

上级 e78d9faf
......@@ -160,7 +160,7 @@ $$
$$
$$
= \mathbb{E}_ {s_{0:t},a_{0:(t-1)}} [b(s_t) \mathbb{E}_{a_t}[\nabla_{\theta}\log \pi_{\theta}(a_t|s_t)]]
= \mathbb{E}_ {s_{0:t},a_{0:(t-1)}} [b(s_t) \mathbb{E}_ {a_t}[\nabla_{\theta}\log \pi_{\theta}(a_t|s_t)]]
$$
$$
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册