基于深度学习的音频事件检测

洪晓锋; 刘刚

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

基于深度学习的音频事件检测

首发时间：2020-12-30

洪晓锋 ¹
洪晓锋（1994-），男，硕士研究生，音频事件检测
刘刚 ¹
刘刚（1973-），男，副教授、博导，语音信号处理

1、北京邮电大学模式识别实验室，北京，100876

摘要：神经网络方法在音频事件检测及标记任务中被广泛采用，国际权威声学场景和事件检测及分类竞赛 (Detection and Classification of Acoustic Scenes and Events, DCASE) 中大多数系统都采用时域音频信号或者音频的log-mel谱图作为输入，并取得了优秀的效果。本文介绍了2D-Wave和2D-Wave-LogMel系统，基于神经网络强大的学习能力，将时域信号作为输入并学习出相应的频域表示，再结合log-mel谱图获得更为丰富的音频信号表示作为输入，在FSD50K数据集上取得了优于基线系统的效果。

关键词：音频事件检测神经网络 DCASE FSD50K

For information in English, please click here

Audio Event Detection Based on Deep Learning

Hong Xiaofeng ¹
洪晓锋（1994-），男，硕士研究生，音频事件检测
Liu Gang ¹
刘刚（1973-），男，副教授、博导，语音信号处理

1、Pattern Recognition of Intelligence Search Laboratory of Beijing University of Posts and Telecommunications, Beijing, 100876

Abstract：Neural networks are widely used in audio event detection and tagging tasks. In the detection and classification of acoustic scenes and events (dcase), most systems use time-domain audio signal or log Mel spectrum of audio as input, and achieved excellent results. In this paper, we use the 2D wave-50mel network as the input signal, and use it as the input signal to represent the learning effect of the system.

Keywords： audio event detection Neural networks DCASE FSD50K

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

洪晓锋，刘刚. 基于深度学习的音频事件检测[EB/OL]. 北京：中国科技论文在线 [2020-12-30]. https://www.paper.edu.cn/releasepaper/content/202012-121.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202012-121
论文题目	基于深度学习的音频事件检测
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.