提交 a32b9123 编写于 作者: X xiaowei_xing

test

上级 d8d1e298
......@@ -79,5 +79,8 @@ $$
$$
$$
= \mathbb{E}_ {\tau\sim\pi_{\theta} [\nabla_{\theta}[\sum_{t=1}^{T}(\log\pi_{\theta}(a_t|s_t))]r(\tau)]
$$
\ No newline at end of file
= \mathbb{E}_ {\tau\sim\pi_{\theta}} [\nabla_{\theta} [\sum_{t=1}^{T}(\log\pi_{\theta}(a_t|s_t))] r(\tau)]
$$
$$
= \mathbb{E}_ {\tau\sim\pi_{\theta} []
\ No newline at end of file
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册