提交 5b92f9f0 编写于 作者: S Scott Zhu 提交者: A. Unique TensorFlower

Prepare for upcoming keras initializer change.

PiperOrigin-RevId: 451475383
上级 7278d89b
......@@ -58,15 +58,16 @@ class ReplacedTokenDetectionHead(tf.keras.layers.Layer):
intermediate_activation=self.activation,
dropout_rate=self.hidden_cfg['dropout_rate'],
attention_dropout_rate=self.hidden_cfg['attention_dropout_rate'],
kernel_initializer=self.initializer,
kernel_initializer=tf_utils.clone_initializer(self.initializer),
name='transformer/layer_%d_rtd' % i))
self.dense = tf.keras.layers.Dense(
self.hidden_size,
activation=self.activation,
kernel_initializer=self.initializer,
kernel_initializer=tf_utils.clone_initializer(self.initializer),
name='transform/rtd_dense')
self.rtd_head = tf.keras.layers.Dense(
units=1, kernel_initializer=self.initializer,
units=1,
kernel_initializer=tf_utils.clone_initializer(self.initializer),
name='transform/rtd_head')
if output not in ('predictions', 'logits'):
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册