test

01e291d5 · xiaowei_xing · a97cf7d9 · 01e291d5
隐藏空白更改
内联并排

Showing with 3 addition and 3 deletion

docs/10.md docs/10.md +3 -3

未找到文件。
--- a/docs/10.md
+++ b/docs/10.md
@@ -572,11 +572,11 @@ $$

 **解答** $\text{actions}$ 的形状为 $(N\ast T,d_{a})$，$\text{states}$ 的形状为 $(N\ast T,d_{s})$，$\text{q_values}$ 的形状为 $(N\ast T,1)$。

-$logits = policy.predictions(states)$
+`logits = policy.predictions(states)`

-$\text{negative_likelihoods = tf.nn.softmax_cross_entropy_with_logits(}$
+`negative_likelihoods = tf.nn.softmax_cross_entropy_with_logits(`

-$\quad\quad\text{labels=actions, logits=logits)}$
+`$\quad\quad$ labels=actions, logits=logits)`

 $\color{red}{\text{weighted_negative_likelihoods = tf.multiply(negative_likelihoods, q_values)}}$