【514】keras Dense 层操作三维数据_人工智能

【514】keras Dense 层操作三维数据

2023-02-09 学习力462

核心提示：参考：Keras API reference / Layers API / Core layers / Dense layer　　语法如下：tf.keras.layers.Dense(units,activation=None,use_bias=True,kernel_initializer="glorot_uniform",bias_initializer="zeros",kernel_regularizer=None,bias_regularizer=

参考：Keras API reference / Layers API / Core layers / Dense layer

　　语法如下：

tf.keras.layers.Dense(
    units,
    activation=None,
    use_bias=True,
    kernel_initializer="glorot_uniform",
    bias_initializer="zeros",
    kernel_regularizer=None,
    bias_regularizer=None,
    activity_regularizer=None,
    kernel_constraint=None,
    bias_constraint=None,
    **kwargs
)

Just your regular densely-connected NN layer.

Dense implements the operation: output = activation(dot(input, kernel) + bias) where activation is the element-wise activation function passed as the activation argument, kernel is a weights matrix created by the layer, and bias is a bias vector created by the layer (only applicable if use_bias is True).

Note: If the input to the layer has a rank greater than 2, then Dense computes the dot product between the inputs and the kernel along the last axis of the inputs and axis 1 of the kernel (using tf.tensordot). For example, if input has dimensions (batch_size, d0, d1), then we create a kernel with shape (d1, units), and the kernel operates along axis 2 of the input, on every sub-tensor of shape (1, 1, d1) (there are batch_size * d0 such sub-tensors). The output in this case will have shape (batch_size, d0, units).

Besides, layer attributes cannot be modified after the layer has been called once (except the trainable attribute).

　　主要是针对高亮的部分进行解读。

　　当 inputs 的数据的秩超过2（这里粗浅的认为是维度）时，Dense 沿着 inputs 的最后一个维度与 kernel 做叉乘。

　　举例：

　　inputs 的维度为 $X=(batch\_size, d_0, d_1)$, kernel 的维度为 $W=(d_1, units)$，因此输出层可以按照如下计算：

$$Y=X \times W$$

　　由此可得，输出维度为 $Y=(batch\_size, d_0, units)$。这个实际上是不难理解的，但是应用到神经网络上就不一样了。

　　相当于最后一个维度 $d_1$ 对 $units$ 做了 $d_0$ 个全连接，同时它们公用一个 kernel，这也就是 Attention 实现的方法，只要对三维的输入做了一个 Dense，就相当于都变成了一个数，也就是 $\alpha$。

【514】keras Dense 层操作三维数据

点赞 0反对 0举报 0

免责声明：本文仅代表作者个人观点，与乐学笔记（本网）无关。其原创性以及文中陈述文字和内容未经本站证实，对本文以及其中全部或者部分内容、文字的真实性、完整性、及时性本站不作任何保证或承诺，请读者仅作参考，并请自行核实相关内容。
本网站有部分内容均转载自其它媒体，转载目的在于传递更多信息，并不代表本网赞同其观点和对其真实性负责，若因作品内容、知识产权、版权和其他问题，请及时提供相关证明等材料并与我们留言联系，本网站将在规定时间内给予删除等相关处理.