提交 e9b90adb 编写于 作者: X xiaowei_xing

test

上级 e78d9faf
...@@ -160,7 +160,7 @@ $$ ...@@ -160,7 +160,7 @@ $$
$$ $$
$$ $$
= \mathbb{E}_ {s_{0:t},a_{0:(t-1)}} [b(s_t) \mathbb{E}_{a_t}[\nabla_{\theta}\log \pi_{\theta}(a_t|s_t)]] = \mathbb{E}_ {s_{0:t},a_{0:(t-1)}} [b(s_t) \mathbb{E}_ {a_t}[\nabla_{\theta}\log \pi_{\theta}(a_t|s_t)]]
$$ $$
$$ $$
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册