当前位置：首页 > wzjs >正文

怎么做的网站收录快北京网站优化推广公司

wzjs 2025/8/29 9:02:49

怎么做的网站收录快,北京网站优化推广公司,男孩做网站,微信网页版登录二维码1、torch.nn.parameter torch.nn.parameter 是 PyTorch 中的一种特殊类型的 tensor，用于表示神经网络中的可学习参数。在 PyTorch 中，可学习参数是模型在训练过程中需要更新的变量，例如全连接层 torch.nn.Linear() 中的参数 weight 和 bias…

1、torch.nn.parameter

torch.nn.parameter 是 PyTorch 中的一种特殊类型的 tensor，用于表示神经网络中的可学习参数。在 PyTorch 中，可学习参数是模型在训练过程中需要更新的变量，例如全连接层 torch.nn.Linear() 中的参数 weight 和 bias 。

官方文档：点击跳转

torch.nn.Parameter 是继承自 torch.Tensor 的子类，其主要作用是作为 nn.Module 中的可训练参数使用。

2、torch.nn.Parameter 与 torch.Tensor 的区别

1）自动添加到模型参数列表中

使用 torch.nn.Parameter 定义的张量会被自动添加到模型的参数列表中，并且可以通过 .parameters() 方法或 .named_parameters() 方法列出。

import torch
import torch.nn as nnclass Layer(nn.Module):def __init__(self):super().__init__()self.weight = torch.nn.Parameter(torch.tensor([1., 2.]))def forward(self, input):return input * self.weightlayer = Layer()
print(layer.weight.requires_grad)  # Truefor name, param in layer.named_parameters():print(name)  # weightprint(param)   # Parameter containing: tensor([1., 2.], requires_grad=True)

普通的 torch.Tensor 对象不会被自动添加到模型的参数列表中，因此不会被 .parameters() 方法或 .named_parameters() 方法列出。

import torch
import torch.nn as nnclass Layer(nn.Module):def __init__(self):super().__init__()self.weight = torch.Tensor([1., 2.])def forward(self, input):return input * self.weightlayer = Layer()
print(layer.weight.requires_grad)  # Falsefor name, param in layer.named_parameters():print(name, param)   # 无输出

2）requires_grad 属性

torch.nn.Parameter 对象的 requires_grad 属性默认为 True，因此它们被视为模型的可训练参数，并在反向传播中进行梯度计算和优化器更新。

import torcha = torch.nn.Parameter(torch.tensor([1., 2., 3.]))
print(a.requires_grad)  # True

普通的 torch.Tensor 对象的 requires_grad 属性默认为 False，可手动设置为等于 True

import torcha = torch.Tensor([1., 2., 3.])
print(a.requires_grad)  # Falsea.requires_grad = True
print(a.requires_grad)  # True

3）自动求导和优化器更新

当 torch.nn.Parameter 对象的 requires_grad 属性为 True 时，它们会参与自动求导，并且可以被优化器自动更新。

import torch
import torch.nn as nn
import torch.optim as optimclass Model(nn.Module):def __init__(self):super().__init__()self.weight = torch.nn.Parameter(torch.tensor([1., 2.]))def forward(self, input):return input * self.weight# 定义网络
model = Model()# 查看更新前的权重
print(model.weight)   # Parameter containing: tensor([1., 2.], requires_grad=True)# 前向传播
output = model(2)# 定义损失函数，并计算损失
criterion = nn.CrossEntropyLoss()
loss = criterion(output, torch.tensor([3., 6.]))# 定义优化器并反向传播
optimizer = optim.SGD(model.parameters(), lr=0.01)
optimizer.zero_grad()
loss.backward()
optimizer.step()# 查看更新后的权重（查看参数是否更新）
print(model.weight)   # Parameter containing: tensor([1.0385, 1.9615], requires_grad=True)

对于普通的 torch.Tensor 对象，即使将其 requires_grad 属性设置为 True，它们也不会被自动添加到模型参数中，也不会被优化器自动更新。

3、举例说明

以 torch.nn.Linear 为例，观察在 nn.Module 类中，是如何使用 nn.Parameter 来对参数进行初始化的

删减版（简版）

import torch
import torch.nn as nn
import torch.nn.functional as Fclass Linear(nn.Module):def __init__(self, in_features: int, out_features: int):super().__init__()self.in_features = in_featuresself.out_features = out_featuresself.weight = torch.nn.Parameter(torch.empty((out_features, in_features)))self.bias = torch.nn.Parameter(torch.empty(out_features))def forward(self, input):return F.linear(input, self.weight, self.bias)layer = Linear(3, 5)
output = layer(torch.rand(1, 3))
print(output.shape)   # torch.Size([1, 5])

实际版本

class Linear(Module):r"""Applies a linear transformation to the incoming data: :math:`y = xA^T + b`This module supports :ref:`TensorFloat32<tf32_on_ampere>`.On certain ROCm devices, when using float16 inputs this module will use :ref:`different precision<fp16_on_mi200>` for backward.Args:in_features: size of each input sampleout_features: size of each output samplebias: If set to ``False``, the layer will not learn an additive bias.Default: ``True``Shape:- Input: :math:`(*, H_{in})` where :math:`*` means any number ofdimensions including none and :math:`H_{in} = \text{in\_features}`.- Output: :math:`(*, H_{out})` where all but the last dimensionare the same shape as the input and :math:`H_{out} = \text{out\_features}`.Attributes:weight: the learnable weights of the module of shape:math:`(\text{out\_features}, \text{in\_features})`. The values areinitialized from :math:`\mathcal{U}(-\sqrt{k}, \sqrt{k})`, where:math:`k = \frac{1}{\text{in\_features}}`bias:   the learnable bias of the module of shape :math:`(\text{out\_features})`.If :attr:`bias` is ``True``, the values are initialized from:math:`\mathcal{U}(-\sqrt{k}, \sqrt{k})` where:math:`k = \frac{1}{\text{in\_features}}`Examples::>>> m = nn.Linear(20, 30)>>> input = torch.randn(128, 20)>>> output = m(input)>>> print(output.size())torch.Size([128, 30])"""__constants__ = ['in_features', 'out_features']in_features: intout_features: intweight: Tensordef __init__(self, in_features: int, out_features: int, bias: bool = True,device=None, dtype=None) -> None:factory_kwargs = {'device': device, 'dtype': dtype}super().__init__()self.in_features = in_featuresself.out_features = out_featuresself.weight = Parameter(torch.empty((out_features, in_features), **factory_kwargs))if bias:self.bias = Parameter(torch.empty(out_features, **factory_kwargs))else:self.register_parameter('bias', None)self.reset_parameters()def reset_parameters(self) -> None:# Setting a=sqrt(5) in kaiming_uniform is the same as initializing with# uniform(-1/sqrt(in_features), 1/sqrt(in_features)). For details, see# https://github.com/pytorch/pytorch/issues/57109init.kaiming_uniform_(self.weight, a=math.sqrt(5))if self.bias is not None:fan_in, _ = init._calculate_fan_in_and_fan_out(self.weight)bound = 1 / math.sqrt(fan_in) if fan_in > 0 else 0init.uniform_(self.bias, -bound, bound)def forward(self, input: Tensor) -> Tensor:return F.linear(input, self.weight, self.bias)def extra_repr(self) -> str:return 'in_features={}, out_features={}, bias={}'.format(self.in_features, self.out_features, self.bias is not None)

从上面代码可以看到， Linear 在初始化时，weights 和 bias 都是使用 torch.nn.Parameter() 来生成的，也就是下面这两行代码：

self.weight = torch.nn.Parameter(torch.Tensor(out_features, in_features))
self.bias = torch.nn.Parameter(torch.Tensor(out_features))

查看全文

http://www.dtcms.com/wzjs/530069.html

免费个人网站建设大全外包平台

做网站需要多少sem是什么?

四川网站建设找珊瑚云爱站网收录

网站经营性备案条件在线查网站的ip地址

马鞍山网站开发网页关键词排名优化

网站建设flash建站系统cms

佛山外贸网站建设咨询广州日新增51万人

做网站公司(信科网络)免费行情软件网站下载大全

动态网站开发全流程怎么做神马搜索排名seo

网站建设公司汕头的刚刚地震最新消息今天

现在做网站还用dw做模板了吗专业网站建设

wordpress做论坛网站免费网站推广工具

购物网站建设网站站长统计入口

wordpress关闭错误提示手机网络优化

网站域名备案和icp备案一样么产品网络推广的方法有哪些

商城类网站备案免费开网店免费供货

Wordpress虚拟资源交易aso应用优化

深圳网站设计公司怎么样郑州网站seo推广

手机建立网站的软件百度网盘人工客服

湖北省粮食局网站建设管理系统长沙网站推广公司

成都装修公司前十名宁波seo博客

博客的网站页面设计seo网站推广费用

四川省微信网站建设公青岛百度网站排名

nba最新排名表排名优化关键词公司

网站平台建设是什么学大教育培训机构怎么样

梨树县交通建设网站南宁百度推广排名优化

吉林响应式网站价格黑帽seo优化推广

手机网站制作案例免费建站

做网站怎么赚钱 111镇江百度推广