Pytorch入门学习（三）---- NN包的使用

来源：互联网发布：淘宝模板代码怎么使用编辑：程序博客网时间：2024/04/29 20:53

pytorch和torch的对比。pytorch将所有的Container都用autograd替代。这样的话根本不需要用ConcatTable，CAddTable之类的。直接用符号运算就行了。
output = nn.CAddTable():forward({input1, input2}) 直接用output = input1 + input2 就行。真简单。
从下图看出，pytorch的网络模块只有.weight和.bias。而那些梯度.gradInput和.output都被消除。

这里写图片描述

例子：

import torchfrom torch.autograd import Variableimport torch.nn as nnimport torch.nn.functional as Fclass MNISTConvNet(nn.Module):    def __init__(self):        # this is the place where you instantiate all your modules        # you can later access them using the same names you've given them in        # here        super(MNISTConvNet, self).__init__()        self.conv1 = nn.Conv2d(1, 10, 5)        self.pool1 = nn.MaxPool2d(2, 2)        self.conv2 = nn.Conv2d(10, 20, 5)        self.pool2 = nn.MaxPool2d(2, 2)        self.fc1 = nn.Linear(320, 50)        self.fc2 = nn.Linear(50, 10)    # it's the forward function that defines the network structure    # we're accepting only a single input in here, but if you want,    # feel free to use more    def forward(self, input):        x = self.pool1(F.relu(self.conv1(input)))        x = self.pool2(F.relu(self.conv2(x)))#看到下面，深深震撼。。这灵活度太大了吧。        # in your model definition you can go full crazy and use arbitrary        # python code to define your model structure        # all these are perfectly legal, and will be handled correctly        # by autograd:        # if x.gt(0) > x.numel() / 2:        #      ...        #        # you can even do a loop and reuse the same module inside it        # modules no longer hold ephemeral state, so you can use them        # multiple times during your forward pass        # while x.norm(2) < 10:        #    x = self.conv1(x)        x = x.view(x.size(0), -1)        x = F.relu(self.fc1(x))        x = F.relu(self.fc2(x))        return x

注意：nn.Conv2d输入必须是4D的。就是nSamples x nChannels x Height x Width,如果只有单个数据，那么可以在最前面增加一维input.unsqueeze(0)

查看某一层的权值信息

print(net.conv1.weight.grad)print(net.conv1.weight.data.norm())  # norm of the weightprint(net.conv1.weight.grad.data.norm())  # norm of the gradients

查看网络某一层的output和grad_output

前面以前看了如何查看某一层的weight以及grad。如果要查看某一层的output和grad_output，则需要用hook。

前向时hook

def printnorm(self, input, output):    # input is a tuple of packed inputs    # output is a Variable. output.data is the Tensor we are interested    print('Inside ' + self.__class__.__name__ + ' forward')    print('')    print('input: ', type(input))    print('input[0]: ', type(input[0]))    print('output: ', type(output))    print('')    print('input size:', input[0].size())    print('output size:', output.data.size())    print('output norm:', output.data.norm())net.conv2.register_forward_hook(printnorm)out = net(input)

反向时hook

def printgradnorm(self, grad_input, grad_output):    print('Inside ' + self.__class__.__name__ + ' backward')    print('Inside class:' + self.__class__.__name__)    print('')    print('grad_input: ', type(grad_input))    print('grad_input[0]: ', type(grad_input[0]))    print('grad_output: ', type(grad_output))    print('grad_output[0]: ', type(grad_output[0]))    print('')    print('grad_input size:', grad_input[0].size())    print('grad_output size:', grad_output[0].size())    print('grad_input norm:', grad_input[0].data.norm())net.conv2.register_backward_hook(printgradnorm)out = net(input)err = loss_fn(out, target)err.backward()

1 0