炫云云：SelfAttentionMask _ 【IIS7站长之家】

炫云云：SelfAttentionMask

作者：[db:作者] 时间：2021-09-10 10:16

import tensorflow as tf
from utils import tf_utils
from tensorflow.keras.layers import Layer

class SelfAttentionMask(Layer):
    """Create 3D attention mask from a 2D tensor mask.
    
    inputs[0]: from_tensor: 2D or 3D Tensor of shape [batch_size, from_seq_length, ...].
    inputs[1]: to_mask: int32 Tensor of shape [batch_size, to_seq_length].

    Returns:
        float Tensor of shape [batch_size, from_seq_length, to_seq_length].
    """
    def call(self, inputs):
        from_tensor, to_mask = inputs
        from_shape = tf_utils.get_shape_list(from_tensor, expected_rank=[2, 3])
        batch_size = from_shape[0]
        from_seq_length = from_shape[1]

        to_shape = tf_utils.get_shape_list(to_mask, expected_rank=2)
        to_seq_length = to_shape[1]

        to_mask = tf.cast(
            tf.reshape(to_mask, [batch_size, 1, to_seq_length]),
            dtype=from_tensor.dtype)

        # We don't assume that `from_tensor` is a mask (although it could be). We
        # don't actually care if we attend *from* padding tokens (only *to* padding)
        # tokens so we create a tensor of all ones.
        #
        # `broadcast_ones` = [batch_size, from_seq_length, 1]
        broadcast_ones = tf.ones(
            shape=[batch_size, from_seq_length, 1], dtype=from_tensor.dtype)
        # Here we broadcast along two dimensions to create the mask.
        mask = broadcast_ones * to_mask

        return mask

上一篇：炫云云：MultiHeadAttention、Transformer

下一篇：没有了

立即下载 - IIS7 站长工具包

炫云云：SelfAttentionMask

作者：[db:作者] 时间：2021-09-10 10:16

最新 更多<<

推荐 更多<<

炫云云：SelfAttentionMask

作者：[db:作者] 时间：2021-09-10 10:16

最新 更多<<

推荐 更多<<

最新更多<<

推荐更多<<