基于自适应信息损失的InfoGAN模型的无监督图像特征提取器设计
首发时间:2019-04-04
摘要:无监督地训练特征向量提取器是图像特征提取的一大重要研究领域,将有效的图像特征描述向量应用在后序的图像分类,检索等任务中可以避免维度灾难,提高任务预测准确率等效果。其中,基于GAN网络的InfoGAN模型在无监督解耦数据的特征任务上有着极大的优势,因为该模型不仅可以通过无监督训练来得到高维图像的特征提取器,而且还能由模型中生成模块来生成不同特征向量对应的图像,即可以从生成的图像结果可视化提取特征的类别。但该模型只能适用于低维度的隐向量来提取输入图像的特征,维度设置越小从图像中分离出来的特征量也会越少,因此该模型只能描述输入数据的部分特征信息,其训练得到的输出的特征无法应用于后序的复杂图像处理任务,这也就极大地限制了模型的适用范围。在本文中,通过优化改进原模型中的信息损失函数,使用最大似然替换原模型中基于均方误差的信息损失函数,经过改进的模型可以在高维的隐向量上实现稳定收敛,从而可以解耦输入数据更多的信息,突破了原有模型特征描述时维数上的限制,保证了模型可以提取到输入数据的更多维的特征表达。在实验验证部分,将改进的模型和原始模型在MNIST数据集上进行对比实验,实验结果显示原模型在训练高维特征时出现训练崩溃,无法收敛的问题,但改进后的模型可以稳定地收敛并能输出有效的图像特征表达。
关键词: 信息处理技术 特征提取 无监督学习 生成对抗网络 最大似然损失
For information in English, please click here
Design of unsupervised feature extractor based on InfoGAN with adaptive Info-loss
Abstract:Unsupervised training of feature vector extractors is an important research field in image processing. The application of effective image feature description vectors in the following tasks, such as image classification and retrieval, can avoid dimension disasters and improve the accuracy of prediction of the tasks. Among many methods, InfoGAN model based on GAN network has great advantages in unsupervised learning to disentangle data features: this model can not only get feature extractors of high-dimensional images through unsupervised training, but also generate images corresponding to different feature vectors by generating module in the model, which can visually distinguish feature categories from the results of generated images. However, this model can only be applied to low-dimensional latent codes to extract the features of the input image, and the smaller the dimension setting, the less the features be got from the image, so the trained model can only describe part of the feature information of the input data, and the output features are unable to be used in the complex image processing tasks, this shortage greatly limits the application range of the model. In this paper, we optimize and improve the information loss function in the original model, and the maximum likelihood is used to replace the original information loss function based on the mean square error in the original model. So, the improved model can achieve stable convergence in the high-dimensional vector, which can disentangle more information from the input data, and be free of the limitation of dimension of the original model to make sure that the model can be used in complex tasks. In the experiment, the improved model and the original model are compared on the MINIST data set. The experimental results show that the original model has training collapse and unable to converge when training with high-dimensional latent codes, but the improved model can converge steadily and output valid image feature expression.
Keywords: information processing technology feature extraction unsupervised learning generative adversarial networks maximum likelihood error
基金:
引用
No.****
动态公开评议
共计0人参与
勘误表
基于自适应信息损失的InfoGAN模型的无监督图像特征提取器设计
评论
全部评论0/1000