Pytorch中Softmax和LogSoftmax的使用详解

一、函数解释

1.Softmax函数常用的用法是指定参数dim就可以：

（1）dim=0：对每一列的所有元素进行softmax运算，并使得每一列所有元素和为1。

（2）dim=1：对每一行的所有元素进行softmax运算，并使得每一行所有元素和为1。

 class Softmax(Module): r"""Applies the Softmax function to an n-dimensional input Tensor rescaling them so that the elements of the n-dimensional output Tensor lie in the range [0,1] and sum to 1. Softmax is defined as: .. math:: \text{Softmax}(x_{i}) = \frac{\exp(x_i)}{\sum_j \exp(x_j)} Shape: - Input: :math:`(*)` where `*` means, any number of additional dimensions - Output: :math:`(*)`, same shape as the input Returns: a Tensor of the same dimension and shape as the input with values in the range [0, 1] Arguments: dim (int): A dimension along which Softmax will be computed (so every slice along dim will sum to 1). .. note:: This module doesn't work directly with NLLLoss, which expects the Log to be computed between the Softmax and itself. Use `LogSoftmax` instead (it's faster and has better numerical properties). Examples:: >>> m = nn.Softmax(dim=1) >>> input = torch.randn(2, 3) >>> output = m(input) """ __constants__ = ['dim'] def __init__(self, dim=None): super(Softmax, self).__init__() self.dim = dim def __setstate__(self, state): self.__dict__.update(state) if not hasattr(self, 'dim'): self.dim = None def forward(self, input): return F.softmax(input, self.dim, _stacklevel=5) def extra_repr(self): return 'dim={dim}'.format(dim=self.dim)

2.LogSoftmax其实就是对softmax的结果进行log，即Log(Softmax(x))

 class LogSoftmax(Module): r"""Applies the :math:`\log(\text{Softmax}(x))` function to an n-dimensional input Tensor. The LogSoftmax formulation can be simplified as: .. math:: \text{LogSoftmax}(x_{i}) = \log\left(\frac{\exp(x_i) }{ \sum_j \exp(x_j)} \right) Shape: - Input: :math:`(*)` where `*` means, any number of additional dimensions - Output: :math:`(*)`, same shape as the input Arguments: dim (int): A dimension along which LogSoftmax will be computed. Returns: a Tensor of the same dimension and shape as the input with values in the range [-inf, 0) Examples:: >>> m = nn.LogSoftmax() >>> input = torch.randn(2, 3) >>> output = m(input) """ __constants__ = ['dim'] def __init__(self, dim=None): super(LogSoftmax, self).__init__() self.dim = dim def __setstate__(self, state): self.__dict__.update(state) if not hasattr(self, 'dim'): self.dim = None def forward(self, input): return F.log_softmax(input, self.dim, _stacklevel=5)

二、代码示例

输入代码

 import torch import torch.nn as nn import numpy as np batch_size = 4 class_num = 6 inputs = torch.randn(batch_size, class_num) for i in range(batch_size): for j in range(class_num): inputs[i][j] = (i + 1) * (j + 1) print("inputs:", inputs)

得到大小batch_size为4，类别数为6的向量（可以理解为经过最后一层得到）

tensor([[ 1., 2., 3., 4., 5., 6.],
[ 2., 4., 6., 8., 10., 12.],
[ 3., 6., 9., 12., 15., 18.],
[ 4., 8., 12., 16., 20., 24.]])

接着我们对该向量每一行进行Softmax

 Softmax = nn.Softmax(dim=1) probs = Softmax(inputs) print("probs:\n", probs)

得到

tensor([[4.2698e-03, 1.1606e-02, 3.1550e-02, 8.5761e-02, 2.3312e-01, 6.3369e-01],
[3.9256e-05, 2.9006e-04, 2.1433e-03, 1.5837e-02, 1.1702e-01, 8.6467e-01],
[2.9067e-07, 5.8383e-06, 1.1727e-04, 2.3553e-03, 4.7308e-02, 9.5021e-01],
[2.0234e-09, 1.1047e-07, 6.0317e-06, 3.2932e-04, 1.7980e-02, 9.8168e-01]])

此外，我们对该向量每一行进行LogSoftmax

 LogSoftmax = nn.LogSoftmax(dim=1) log_probs = LogSoftmax(inputs) print("log_probs:\n", log_probs)

得到

tensor([[-5.4562e+00, -4.4562e+00, -3.4562e+00, -2.4562e+00, -1.4562e+00, -4.5619e-01],
[-1.0145e+01, -8.1454e+00, -6.1454e+00, -4.1454e+00, -2.1454e+00, -1.4541e-01],
[-1.5051e+01, -1.2051e+01, -9.0511e+00, -6.0511e+00, -3.0511e+00, -5.1069e-02],
[-2.0018e+01, -1.6018e+01, -1.2018e+01, -8.0185e+00, -4.0185e+00, -1.8485e-02]])

验证每一行元素和是否为1

 # probs_sum in dim=1 probs_sum = [0 for i in range(batch_size)] for i in range(batch_size): for j in range(class_num): probs_sum[i] += probs[i][j] print(i, "row probs sum:", probs_sum[i])

得到每一行的和，看到确实为1

0 row probs sum: tensor(1.)
1 row probs sum: tensor(1.0000)
2 row probs sum: tensor(1.)
3 row probs sum: tensor(1.)

验证LogSoftmax是对Softmax的结果进行Log

 # to numpy np_probs = probs.data.numpy() print("numpy probs:\n", np_probs) # np.log() log_np_probs = np.log(np_probs) print("log numpy probs:\n", log_np_probs)

得到

numpy probs:
[[4.e-03 1.e-02 3.e-02 8.e-02 2.e-01 6.e-01]
[3.e-05 2.e-04 2.e-03 1.e-02 1.e-01 8.e-01]
[2.e-07 5.e-06 1.e-04 2.e-03 4.e-02 9.e-01]
[2.0e-09 1.e-07 6.0e-06 3.e-04 1.e-02 9.e-01]]
log numpy probs:
[[-5.e+00 -4.e+00 -3.e+00 -2.e+00 -1.e+00 -4.e-01]
[-1.0e+01 -8.e+00 -6.e+00 -4.e+00 -2.e+00 -1.e-01]
[-1.e+01 -1.e+01 -9.0e+00 -6.0e+00 -3.0e+00 -5.e-02]
[-2.0018486e+01 -1.e+01 -1.e+01 -8.0e+00 -4.0e+00 -1.e-02]]

验证完毕

三、整体代码

 import torch import torch.nn as nn import numpy as np batch_size = 4 class_num = 6 inputs = torch.randn(batch_size, class_num) for i in range(batch_size): for j in range(class_num): inputs[i][j] = (i + 1) * (j + 1) print("inputs:", inputs) Softmax = nn.Softmax(dim=1) probs = Softmax(inputs) print("probs:\n", probs) LogSoftmax = nn.LogSoftmax(dim=1) log_probs = LogSoftmax(inputs) print("log_probs:\n", log_probs) # probs_sum in dim=1 probs_sum = [0 for i in range(batch_size)] for i in range(batch_size): for j in range(class_num): probs_sum[i] += probs[i][j] print(i, "row probs sum:", probs_sum[i]) # to numpy np_probs = probs.data.numpy() print("numpy probs:\n", np_probs) # np.log() log_np_probs = np.log(np_probs) print("log numpy probs:\n", log_np_probs)

基于pytorch softmax,logsoftmax 表达

 import torch import numpy as np input = torch.autograd.Variable(torch.rand(1, 3)) print(input) print('softmax={}'.format(torch.nn.functional.softmax(input, dim=1))) print('logsoftmax={}'.format(np.log(torch.nn.functional.softmax(input, dim=1))))

以上为个人经验，希望能给大家一个参考，也希望大家多多支持本网站。

您可能感兴趣的文章:

PyTorch的SoftMax交叉熵损失和梯度用法
浅谈pytorch中torch.max和F.softmax函数的维度解释
PyTorch: Softmax多分类实战操作

Pytorch中Softmax和LogSoftmax的使用详解

一、函数解释

1.Softmax函数常用的用法是指定参数dim就可以：

2.LogSoftmax其实就是对softmax的结果进行log，即Log(Softmax(x))

二、代码示例

三、整体代码

基于pytorch softmax,logsoftmax 表达

2023年最新react面试题总结大全(附详细答案)

python实现自动更换ip的方法

Python中的for循环示例详解

可爱松鼠微信头像图片

Ghost安装器怎么安装Win10-Ghost安装器下安装Win10专业版系统详细图文教程

VUE3使用JSON编辑器的详细图文教程

iphone X如何关闭后台？苹果iphone X关闭软件后台方法介绍

Uint 和 int 的区别解析

Headshot插件如何使用-Headshot插件使用教程

Filecoin(FIL)是什么币？如何挖掘Filecoin

纪念碑谷2第七关怎么玩纪念碑谷2第7关通关图文攻略

天天爱消除 1314520刷分攻略图文教程

怎么抠出公章-Photoshop抠出图片中的公章教程

炉石传说探险者协会橙卡点评分析

AI制作漂亮的宣传海报

愤怒的小鸟英雄传一只小鸟五种玩法高大上的职业介绍

小米pro和小米5哪个好小米pro和小米5区别对比评测

Win11怎么开启远程桌面- Win11远程桌面的四种使用技巧

万网CN域名免费注册的活动注册地址

MSXML是什么意思，什么是MSXML

Pytorch中Softmax和LogSoftmax的使用详解

一、函数解释

1.Softmax函数常用的用法是指定参数dim就可以：

2.LogSoftmax其实就是对softmax的结果进行log，即Log(Softmax(x))

二、代码示例

三、整体代码

基于pytorch softmax,logsoftmax 表达

2023年最新react面试题总结大全(附详细答案)

python实现自动更换ip的方法

Python中的for循环示例详解

可爱松鼠微信头像图片

Ghost安装器怎么安装Win10-Ghost安装器下安装Win10专业版系统详细图文教程

VUE3使用JSON编辑器的详细图文教程

iphone X如何关闭后台？苹果iphone X关闭软件后台方法介绍

Uint 和 int 的区别解析

Headshot插件如何使用-Headshot插件使用教程

Filecoin(FIL)是什么币？如何挖掘Filecoin

纪念碑谷2第七关怎么玩 纪念碑谷2第7关通关图文攻略

天天爱消除 1314520刷分攻略 图文教程

怎么抠出公章-Photoshop抠出图片中的公章教程

炉石传说探险者协会橙卡点评分析

AI制作漂亮的宣传海报

愤怒的小鸟英雄传一只小鸟五种玩法 高大上的职业介绍

小米pro和小米5哪个好 小米pro和小米5区别对比评测

Win11怎么开启远程桌面- Win11远程桌面的四种使用技巧

万网CN域名免费注册的活动注册地址

MSXML是什么意思，什么是MSXML

纪念碑谷2第七关怎么玩纪念碑谷2第7关通关图文攻略

天天爱消除 1314520刷分攻略图文教程

愤怒的小鸟英雄传一只小鸟五种玩法高大上的职业介绍

小米pro和小米5哪个好小米pro和小米5区别对比评测