diff --git a/chapter_natural-language-processing/glove.md b/chapter_natural-language-processing/glove.md index f244a0b5abd54f78fbfd3f5427506f0af0254698..f505e07536bae0bbc6ae1c968e22b6a726d3b958 100644 --- a/chapter_natural-language-processing/glove.md +++ b/chapter_natural-language-processing/glove.md @@ -42,7 +42,7 @@ $$\sum_{i\in\mathcal{V}} \sum_{j\in\mathcal{V}} h(x_{ij}) \left(\boldsymbol{u}_j 我们还可以从另外一个角度来理解GloVe词嵌入。沿用本节前面的符号,$\mathbb{P}(w_j \mid w_i)$表示数据集中以$w_i$为中心词生成背景词$w_j$的条件概率,并记作$p_{ij}$。作为源于某大型语料库的真实例子,以下列举了两组分别以“ice”(“冰”)和“steam”(“蒸汽”)为中心词的条件概率以及它们之间的比值 [1]: |$w_k$=|“solid”|“gas”|“water”|“fashion”| -|--:|:-:|:-:|:-:| +|--:|:-:|:-:|:-:|:-:| |$p_1=\mathbb{P}(w_k\mid$ “ice” $)$|0.00019|0.000066|0.003|0.000017| |$p_2=\mathbb{P}(w_k\mid$ “steam” $)$|0.000022|0.00078|0.0022|0.000018| |$p_1/p_2$|8.9|0.085|1.36|0.96|