test

1260c729 · xiaowei_xing · be842c30 · 1260c729
显示空白变更内容
内联并排

Showing with 4 addition and 2 deletion

docs/10.md docs/10.md +4 -2

未找到文件。
--- a/docs/10.md
+++ b/docs/10.md
@@ -554,6 +554,8 @@ $$

 **练习 6.7** 这里是对离散动作空间使用自动微分来执行最大似然估计的伪代码。

-${\sf logits = policy.predictions(states)}$
+$\text{logits = policy.predictions(states)}$

-${\sf negative_likelihoods = tf.nn.softmax_cross_entropy_with_logits(labels=actions, logits=logits)}$
\ No newline at end of file
+$\text{negative_likelihoods = tf.nn.softmax_cross_entropy_with_logits(}$
+
+$\text{labels=actions, logits=logits)}$
\ No newline at end of file