基于自适应信息损失的InfoGAN模型的无监督图像特征提取器设计

杨照宇; 郝建军

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

基于自适应信息损失的InfoGAN模型的无监督图像特征提取器设计

首发时间：2019-04-04

杨照宇 ¹
杨照宇（1994-），男，硕士研究生，主要研究方向：深度学习，机器视觉
郝建军 ¹
郝建军（1969-），男，副教授、硕导，主要研究方向：大数据，机器视觉

1、北京邮电大学信息与通信工程学院

摘要：无监督地训练特征向量提取器是图像特征提取的一大重要研究领域，将有效的图像特征描述向量应用在后序的图像分类，检索等任务中可以避免维度灾难，提高任务预测准确率等效果。其中，基于GAN网络的InfoGAN模型在无监督解耦数据的特征任务上有着极大的优势，因为该模型不仅可以通过无监督训练来得到高维图像的特征提取器，而且还能由模型中生成模块来生成不同特征向量对应的图像，即可以从生成的图像结果可视化提取特征的类别。但该模型只能适用于低维度的隐向量来提取输入图像的特征，维度设置越小从图像中分离出来的特征量也会越少，因此该模型只能描述输入数据的部分特征信息，其训练得到的输出的特征无法应用于后序的复杂图像处理任务，这也就极大地限制了模型的适用范围。在本文中，通过优化改进原模型中的信息损失函数，使用最大似然替换原模型中基于均方误差的信息损失函数，经过改进的模型可以在高维的隐向量上实现稳定收敛，从而可以解耦输入数据更多的信息，突破了原有模型特征描述时维数上的限制，保证了模型可以提取到输入数据的更多维的特征表达。在实验验证部分，将改进的模型和原始模型在MNIST数据集上进行对比实验，实验结果显示原模型在训练高维特征时出现训练崩溃，无法收敛的问题，但改进后的模型可以稳定地收敛并能输出有效的图像特征表达。

关键词：信息处理技术特征提取无监督学习生成对抗网络最大似然损失

For information in English, please click here

Design of unsupervised feature extractor based on InfoGAN with adaptive Info-loss

Yang ZhaoYu ¹
杨照宇（1994-），男，硕士研究生，主要研究方向：深度学习，机器视觉
Hao JianJun ¹
郝建军（1969-），男，副教授、硕导，主要研究方向：大数据，机器视觉

1、School of Information and Communication Engineering, Beijing University of Posts and Telecommunications 100876

Abstract：Unsupervised training of feature vector extractors is an important research field in image processing. The application of effective image feature description vectors in the following tasks, such as image classification and retrieval, can avoid dimension disasters and improve the accuracy of prediction of the tasks. Among many methods, InfoGAN model based on GAN network has great advantages in unsupervised learning to disentangle data features: this model can not only get feature extractors of high-dimensional images through unsupervised training, but also generate images corresponding to different feature vectors by generating module in the model, which can visually distinguish feature categories from the results of generated images. However, this model can only be applied to low-dimensional latent codes to extract the features of the input image, and the smaller the dimension setting, the less the features be got from the image, so the trained model can only describe part of the feature information of the input data, and the output features are unable to be used in the complex image processing tasks, this shortage greatly limits the application range of the model. In this paper, we optimize and improve the information loss function in the original model, and the maximum likelihood is used to replace the original information loss function based on the mean square error in the original model. So, the improved model can achieve stable convergence in the high-dimensional vector, which can disentangle more information from the input data, and be free of the limitation of dimension of the original model to make sure that the model can be used in complex tasks. In the experiment, the improved model and the original model are compared on the MINIST data set. The experimental results show that the original model has training collapse and unable to converge when training with high-dimensional latent codes, but the improved model can converge steadily and output valid image feature expression.

Keywords： information processing technology feature extraction unsupervised learning generative adversarial networks maximum likelihood error

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

杨照宇，郝建军. 基于自适应信息损失的InfoGAN模型的无监督图像特征提取器设计[EB/OL]. 北京：中国科技论文在线 [2019-04-04]. https://www.paper.edu.cn/releasepaper/content/201904-70.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	201904-70
论文题目	基于自适应信息损失的InfoGAN模型的无监督图像特征提取器设计
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.