，成果详细信息-中国科技论文在线

俞凯

35浏览
0点赞
0收藏
0分享
0下载
0评论
引用

期刊论文

Deep feature for text-dependent speaker verification

暂无

Speech Communication，2015，73（）：1-13 | 2015年10月01日 | doi.org/10.1016/j.specom.2015.07.003

URL:https://www.sciencedirect.com/science/article/abs/pii/S016763931500076X?via%3Dihub

摘要/描述

Recently deep learning has been successfully used in speech recognition, however it has not been carefully explored and widely accepted for speaker verification. To incorporate deep learning into speaker verification, this paper proposes novel approaches of extracting and using features from deep learning models for text-dependent speaker verification. In contrast to the traditional short-term spectral feature, such as MFCC or PLP, in this paper, outputs from hidden layer of various deep models are employed as deep features for text-dependent speaker verification. Fours types of deep models are investigated: deep Restricted Boltzmann Machines, speech-discriminant Deep Neural Network (DNN), speaker-discriminant DNN, and multi-task joint-learned DNN. Once deep features are extracted, they may be used within either the GMM-UBM framework or the identity vector (i-vector) framework. Joint linear discriminant analysis and probabilistic linear discriminant analysis are proposed as effective back-end classifiers for identity vector based deep features. These approaches were evaluated on the RSR2015 data corpus. Experiments showed that deep feature based methods can obtain significant performance improvements compared to the traditional baselines, no matter if they are directly applied in the GMM-UBM system or utilized as identity vectors. The EER of the best system using the proposed identity vector is 0.10%, only one fifteenth of that in the GMM-UBM baseline.

关键词: Text-dependent speaker verification ， Deep neural networks ， Deep features ， RSR2015

问答

暂无问题，成为第一个提问者

我要提问全部问题

学者未上传该成果的PDF文件，请等待学者更新

我要评论

全部评论 共 0 条

本学者其他成果

同领域成果