一种基于神经网络的音频场景分析方法
首发时间:2010-01-04
摘要:音频语义分析是多媒体应用中的重要问题。本文提出了一种基于神经网络的方法来分析动作电影音频事件序列中的高层语义内容。根据相邻基本音频事件的时间间隔,我们首先将得到的事件序列分成部分的特定的场景段,然后发现音频内的高层语义内容。通过使用神经网络的方法,在语义推理中将先验知识和机器学习是有效地结合起来。具体来说,模型参数先由统计学习得到,然后再以先验知识为基础进行手动修改。我们选择了一些动作电影的音频流评估该方法的有效性,实验结果表明,本文的方法能取得满意的检测结果。
For information in English, please click here
AN NEURAL NETWORK BASED METHOD FOR AUDIO SEMANTIC ANALYSIS
Abstract:Audio semantic analysis is an important issue for multimedia applications. In this paper, we propose a neural network based approach to analyze the high-level semantic content of audio event sequences for the action movies. According to the time interval between adjacent basic audio events, we first divide the given event sequence into some scene segments, and then discover the high-level semantic content of the audio context. By using the neural network based approach, the prior knowledge and the machine learning are effectively combined in the semantic inference. Specifically, the model parameters are learned by the statistical learning, and then are modified manually based on the prior knowledge. Moreover, we select some audio streams from the action movies to evaluate the performance of the proposed approach. The experiment results demonstrate that our approach can work well.
Keywords: Audio semantic analysis auditory scene analysis neural network
基金:
论文图表:
引用
No.3844050896612625****
同行评议
勘误表
一种基于神经网络的音频场景分析方法
评论
全部评论0/1000