提交 84cf7836 编写于 作者: Y Yunpeng Li 提交者: A. Unique TensorFlower

Internal change

PiperOrigin-RevId: 487583744
上级 0e3ae4d0
......@@ -252,7 +252,7 @@ def causal_windowed_performer_attention(query_matrix,
attention = numerator / denominator
attention = tf.reshape(attention, new_shape)
start = tf.zeros([len(old_shape)], dtype=old_shape.dtype)
start = tf.zeros([old_shape.shape[0]], dtype=old_shape.dtype)
attention = tf.slice(attention, start, old_shape)
# Queued window cache (drop instead of decay) not yet supported.
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册