提交 fef4c1b1 编写于 作者: X xiaowei_xing

test

上级 a32b9123
......@@ -83,4 +83,5 @@ $$
$$
$$
= \mathbb{E}_ {\tau\sim\pi_{\theta} []
\ No newline at end of file
= \mathbb{E}_ {\tau\sim\pi_{\theta} [\sum_{t=1}^{T}(\nabla_{\theta}(\log\pi_{\theta}(a_t|s_t))(\sum_{t=1}^{T}\gamma^t r(s_t,a_t)))]
$$
\ No newline at end of file
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册