，成果详细信息-中国科技论文在线

俞凯

65浏览
0点赞
0收藏
0分享
0下载
0评论
引用

期刊论文

Sequence discriminative training for deep learning based acoustic keyword spotting

暂无

Speech Communication，2018，102（）：100-111 | 2018年09月01日 | doi.org/10.1016/j.specom.2018.08.001

URL:https://www.sciencedirect.com/science/article/abs/pii/S0167639317303631

摘要/描述

Speech recognition is a sequence prediction problem. Besides employing various deep learning approaches for frame-level classification, sequence-level discriminative training has been proved to be indispensable to achieve the state-of-the-art performance in large vocabulary continuous speech recognition (LVCSR). However, keyword spotting (KWS), as one of the most common speech recognition tasks, almost only benefits from frame-level deep learning due to the difficulty of getting competing sequence hypotheses. The few studies on sequence discriminative training for KWS are limited for fixed vocabulary or LVCSR based methods and have not been compared to the state-of-the-art deep learning based KWS approaches. In this paper, a sequence discriminative training framework is proposed for both fixed vocabulary and unrestricted acoustic KWS. Sequence discriminative training for both sequence-level generative and discriminative models are systematically investigated. By introducing word-independent phone lattices or non-keyword blank symbols to construct competing hypotheses, feasible and efficient sequence discriminative training approaches are proposed for acoustic KWS. Experiments showed that the proposed approaches obtained consistent and significant improvement in both fixed vocabulary and unrestricted KWS tasks, compared to previous frame-level deep learning based acoustic KWS methods.

关键词: ASR ， KWS ， Sequence discriminative training ， Generative sequence model ， Discriminative sequence model

问答

暂无问题，成为第一个提问者

我要提问全部问题

学者未上传该成果的PDF文件，请等待学者更新

我要评论

全部评论 共 0 条

本学者其他成果

同领域成果