PyTorch 中如何使用 torch.nn.Conv2d 层？-JavaScript中文网-JavaScript教程资源分享门户

PyTorch 中如何使用 torch.nn.Conv2d 层？

推荐答案

在 PyTorch 中，torch.nn.Conv2d 是用于二维卷积操作的层。以下是一个简单的示例，展示如何使用 torch.nn.Conv2d：

-- -------------------- ---- -------
------ -----
------ -------- -- --

- -------
---------- - ------------------------ ---------------- -------------- --------- ----------

- ---------- ------------ --------- ------- ------
------------ - -------------- -- --- ---

- -----------
------------- - ------------------------

--------------------------  - -------

在这个示例中，我们定义了一个卷积层，输入通道数为 3，输出通道数为 16，卷积核大小为 3x3，步幅为 1，填充为 1。然后我们创建了一个随机输入张量，并将其传递给卷积层，最后输出了卷积后的张量形状。

本题详细解读

`torch.nn.Conv2d` 参数详解

torch.nn.Conv2d 的主要参数包括：

in_channels (int): 输入图像的通道数。
out_channels (int): 卷积产生的通道数（即卷积核的数量）。
kernel_size (int or tuple): 卷积核的大小。可以是一个整数（表示正方形卷积核）或一个元组（表示矩形卷积核）。
stride (int or tuple, optional): 卷积步幅。默认值为 1。
padding (int or tuple, optional): 输入张量的填充大小。默认值为 0。
padding_mode (str, optional): 填充模式。默认值为 'zeros'，其他选项包括 'reflect'、'replicate' 和 'circular'。
dilation (int or tuple, optional): 卷积核元素之间的间距。默认值为 1。
groups (int, optional): 控制输入和输出之间的连接方式。默认值为 1。
bias (bool, optional): 如果为 True，则添加可学习的偏置。默认值为 True。

输入和输出形状

输入形状: (batch_size, in_channels, height, width)
输出形状: (batch_size, out_channels, height_out, width_out)

其中，height_out 和 width_out 可以通过以下公式计算：

[ \text{height_out} = \left\lfloor \frac{\text{height} + 2 \times \text{padding} - \text{dilation} \times (\text{kernel_size} - 1) - 1}{\text{stride}} + 1 \right\rfloor ]

[ \text{width_out} = \left\lfloor \frac{\text{width} + 2 \times \text{padding} - \text{dilation} \times (\text{kernel_size} - 1) - 1}{\text{stride}} + 1 \right\rfloor ]

示例解释

在示例中，我们定义了一个卷积层，输入通道数为 3，输出通道数为 16，卷积核大小为 3x3，步幅为 1，填充为 1。输入张量的形状为 (1, 3, 32, 32)，经过卷积后，输出张量的形状为 (1, 16, 32, 32)。

注意事项

卷积层的权重和偏置是可学习的参数，可以通过 conv_layer.weight 和 conv_layer.bias 访问。
卷积层的输出形状取决于输入形状、卷积核大小、步幅、填充和扩张率等参数。
在实际应用中，通常会将多个卷积层堆叠起来，形成卷积神经网络（CNN）。

纠错
反馈