充分利用有效的双向网络信息表示句向量的机制

焦点; 高升

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
同行评议
相关论文
评论

A Effective Bidirectional Mechanism with Pooling for Universial Sentence Representations

首发时间：2019-04-18

Dian Jiao ¹
Dian Jiao, male, master candidate, major in NLP.
Sheng Gao ¹
Sheng Gao, male, associate professor, major in NLP

1、School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, 100876

Abstract：BiLSTM with max pooling is adopted as a well-performed supervised universal sentence encoder. Max pooling is a common mechanism to get a fixed-size sentence representation. But we find that the max pooling for sentence encoder discards some useful backward and forward information at each time step and depends on a large number of parameters. In this paper, we propose an improved pooling mechanism based on max pooling for universal sentence encoder. The proposed model uses three kinds of methods to refine the backward and forward information at each time step, and then use a max-pooling layer or attention mechanism to obtain a fixed-size sentence representation from variable-length refined hidden states. Experiments conducted on Stanford Natural Language Inference (SNLI) Corpus, and we use it as a pretrained universal sentence encoder for transfer tasks. Experiments show that our model with less parameters performs better.

keywords： Computer Software and Theory Sentence Eembedding LSTM Transfer Tasks

点击查看论文中文信息

充分利用有效的双向网络信息表示句向量的机制

焦点 ¹
Dian Jiao, male, master candidate, major in NLP.
高升 ¹
Sheng Gao, male, associate professor, major in NLP

1、北京邮电大学，信息与通信工程学院，100876

摘要：双向长短时记忆网络是一个有效的获取上下问信息的网络，后面加上一层最大池化层可以有效的用于表达一个句子，但是研究表明单纯的为了获取固定长度句子的最大池化操作会丢失一些有效的上下文信息，因为这个机制仅仅选择了当前时间的最大值，而且也需要很大的参数量，在本篇文章中提出了一种基于最大池化的充分利用上下文信息的机制，本文采用了三种方法去提炼上下文中的信息，最后采用最大池化层获取一个固定长度的表示结果，本文实验在斯坦福大学的自然语言推断数据集上进行，并采用得到的模型进行迁移学习，并且获得不错的效果。

关键词：计算机软件与理论句向量长短时记忆网络迁移学习

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

Dian Jiao,Sheng Gao. A Effective Bidirectional Mechanism with Pooling for Universial Sentence Representations[EB/OL]. Beijing:Sciencepaper Online[2019-04-18]. https://www.paper.edu.cn/releasepaper/content/201904-210.

No.****

同行评议

未申请同行评议

全部评论

0/1000

论文编号	201904-210
论文题目	充分利用有效的双向网络信息表示句向量的机制
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.