基于多视角注意力的答案选择算法研究

江宇鸥; 徐蔚然

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
同行评议
相关论文
评论

基于多视角注意力的答案选择算法研究

首发时间：2020-03-18

江宇鸥 ¹
江宇鸥（1996-），女，主要研究方向：自然语言处理
徐蔚然 ¹
徐蔚然（1975-），男，副教授、硕导，主要研究方向：自然语言处理

1、北京邮电大学信息与通信工程学院，北京市 100876

摘要：答案选择是自然语言处理领域中的一个重要子任务，同时也是自动问答系统的一个极其重要的支撑技术。由于答案选择任务主要解决问题和答案之间的相关性匹配，而注意力机制可以提供灵活并有效的信息交互与利用的方式，继而成为问答系统中不可或缺的一个关键技术模块。本文提出一种基于多视角注意力机制的答案选择算法，通过多种注意力类型（协同注意力、自注意力）和多种注意力变体（最大池化、平均池化、软对齐）的调用来建模多角度的语义视图，从而提高语义编码的完整性和准确性。同时为了消除同时执行多种注意力机制所需架构工程的昂贵需求，提升算法的计算效率，本文提出将注意力作为一种特征增强方式使用，实现多种注意力机制的可扩展调用。通过压缩函数返回标量特征，并将特征重新附加到原始的单词表示上，为后续编码层提供包含句子内部的知识和句子之间的知识的特征，改进表示学习过程。模型在事实型问答数据集（TrecQA）、开放域数据集（WikiQA）和社区问答数据集（SemEval-2016 CQA和YahooCQA）上进行实验，均实现了目前最好的性能。通过消融研究，也证明了多视角注意力机制的有效性。

关键词：人工智能问答系统答案选择注意力机制

For information in English, please click here

A Multi-view Attention Network for Answer Selection Algorithm

JIANG Yuou ¹
江宇鸥（1996-），女，主要研究方向：自然语言处理
XU Weiran ¹
徐蔚然（1975-），男，副教授、硕导，主要研究方向：自然语言处理

1、School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876

Abstract：Answer selection is an important sub-task in the field of natural language processing, and it is also an extremely important supporting technology for automatic question answering systems. Since answer selection task mainly solves the semantic matching between question and answer, the attention mechanism could provide an effective way of information interaction, thus becoming an indispensable key technical module in the question answering system. This paper proposes a multi-view attention network which uses multiple attention types(co-attention and self-attention) and multiple attention variants(max pooling, average pooling, soft alignment) to model multi-perspective semantic views, thus improving the completeness and accuracy of 40 semantic encoding. At the same time, in order to eliminate the expensive requirements of architectural engineering and improve the computational efficiency of the algorithm, this paper proposes to re-imagine attention as a form of feature argumentation method, achieving multiple attention casts. The model returns scalar feature using compressed function after soft attention operations, and re-attach it to the original word representation, providing hints with global knowledge and cross-sentence 45 knowledge for subsequent encoding layers, which could improve representation learning. Experiments on the factual based question answering dataset (TrecQA), open-domain dataset (WikiQA), and community question answering dataset (SemEval-2016 CQA and YahooCQA) outperform existing state-of-the-art models and ablation studies prove the effectiveness of the multi-view attention mechanism.

Keywords： artificial intelligence question answering system answer selection attention mechanism

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

江宇鸥，徐蔚然. 基于多视角注意力的答案选择算法研究[EB/OL]. 北京：中国科技论文在线 [2020-03-18]. https://www.paper.edu.cn/releasepaper/content/202003-208.

No.****

同行评议

未申请同行评议

全部评论

0/1000

论文编号	202003-208
论文题目	基于多视角注意力的答案选择算法研究
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.