使用MindSpore Hub加载预训练模型并迁移

239 阅读 0 评论 158 点赞

我是靠谱客的博主自觉犀牛，这篇文章主要介绍使用MindSpore Hub加载预训练模型并迁移，现在分享给大家，希望可以做个参考。

1、下载安装MindSpore HUB，因为我是MindSpore1.9.0，所以安装1.9.0的MindSpore Hub

pip install https://ms-release.obs.cn-north-4.myhuaweicloud.com/1.9.0/Hub/any/mindspore_hub-1.9.0-py3-none-any.whl --trusted-host ms-release.obs.cn-north-4.myhuaweicloud.com -i https://pypi.tuna.tsinghua.edu.cn/simple

2、使用现有模型进行迁移学习

先附上官方教程

从Hub加载模型 — MindSpore master documentation

一看每个模型都是不同的人写的...写法等全都不一样，用起来也让你头大。先说几个我试了的。

GoogLeNet imagenet2012预训，必须写num_classes，必须等于1000....给这个参数纯属闲的。

model = "mindspore/1.9/googlenet_imagenet2012" #可用 num_classes=1000
network = mshub.load(model,num_classes=1000)
network.set_train(False)

Resnet50，不能用，它会自己去加载参数，但是对应不上。

model = "mindspore/1.9/resnet50_imagenet2012"
network = mshub.load(model)
network.set_train(False)

For 'load_param_into_net', end_point.weight in the argument 'net' should have the same shape as end_point.weight in the argument 'parameter_dict'. But got its shape (10, 2048) in the argument 'net' and shape (1001, 2048) in the argument 'parameter_dict'.May you need to check whether the checkpoint you loaded is correct or the batch size and so on in the 'net' and 'parameter_dict' are same.

VGG16，不能用...这个感觉写的时候就没有写对，反正你不用给任何参数，就给你出错。

model = "mindspore/1.9/vgg16_bn_imagenet2012"
network = mshub.load(model)
network.set_train(False)


nn.Dense(512 * (self.image_h // 32) * (self.image_w // 32), 4096),
TypeError: unsupported operand type(s) for //: 'str' and 'int'

好的，于是，我就使用GoogleNet想做一些迁移工作，于是换了最后一层的线性层，代码如下。

# 加载预训练的GoogleNet模型
model = "mindspore/1.9/googlenet_imagenet2012" #可用 num_classes=1000
network = mshub.load(model, num_classes=1000, include_top=False)# include_top=False 去掉了最后的线性层
# network = mshub.load(model, num_classes=1000)
network.set_train(False)
# 增加一个新的分类层
last_channel = 1024
num_classes = args.num_classes
classification_layer = nn.Dense(last_channel, num_classes)
classification_layer.set_train(True)
train_network = nn.SequentialCell([network, classification_layer])

但是。。。真的太麻烦了，并且还报错，维度的错误，但由于模型文件不在本地，也根本无法调试。我决定放弃了，看来必须自己找到网络的模型，然后尝试加载预训练的参数会好很多....

总结一下，总之，很难用。文档写的也都不对。按照文档写的进行操作也会报错。每个模型应该是包给了不同的人做的，代码风格不一，以至于导致了最后每个模型的使用和体验都很差。