融合先验知识的神经机器翻译模型研究

梁博翔; 佘春东; 刘绍华

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

融合先验知识的神经机器翻译模型研究

首发时间：2020-03-13

梁博翔 ¹
梁博翔(1995-)，男，硕士研究生，主要研究方向：机器翻译
佘春东 ¹ 刘绍华 ¹
刘绍华(1976-)，男，副教授，硕导，主要研究方向：无线通信、嵌入式系统、物联网、云计算、移动互联网

1、北京邮电大学电子工程学院，北京，100876

摘要：近些年随着深度学习的发展，通过结合大规模的语料和深层神经网络，神经机器翻译在很多方面已经超越了统计机器翻译，展现了愈加鲜活的生命力。神经机器翻译采用了编码器-解码器架构，通常由卷积神经网络或循环神经网络组成，由于其端到端的模型训练方式，没有充分利用句子中隐含的先验知识，如句子的短语结构、依存关系等信息。谷歌在2017年提出了Transformer模型，该模型摒弃了循环神经网络，由多个前馈神经网络及自注意机制构成，在许多自然语言处理任务中都达到了最好的性能指标。本文在Transformer翻译模型的基础上，提出一种利用词向量融合先验知识的方法，通过在原始文本中融合词性标注、短语结构等先验知识，提升了基于自注意机制的Transformer模型的翻译效果。本文针对中英翻译任务，通过对比基线Transformer模型，本文的方法可以提高1.19个BLEU值。

关键词：机器翻译先验知识词向量

For information in English, please click here

Modeling prior knowledge for Neural Machine Translation System

Liang Boxiang ¹
梁博翔(1995-)，男，硕士研究生，主要研究方向：机器翻译
She Chundong ¹ Liu Shaohua ¹
刘绍华(1976-)，男，副教授，硕导，主要研究方向：无线通信、嵌入式系统、物联网、云计算、移动互联网

1、School Of Electronic Engineering，Beijing University of Posts and Telecommunications，Beijing，100876

Abstract：With the rapid development of deep learning in recent years, through the combination of large-scale corpora and deep neural networks, neural machine translation has surpassed statistical machine translation in many aspects, showing an increasingly vitality. Neural machine translation model adopts encoder-decoder architecture, and usually consists of convolutional neural networks or recurrent neural networks. The neural machine translation system adopts end-to-end training method, which makes no use of the implicit prior knowledge within the sentence, such as the phrase structure and dependency information. Transformer is proposed by Google in 2017 which is based solely on feed-forward networks and self-attention mechanisms, dispensing with recurrence entirely. This paper proposed a novel method that using word embedding to integrate prior knowledge into Transformer model. By adding part-of-speech tagging and phrase structure information of the text, the translation effect of Transformer model is improved. Compared to baseline Transformer model, experiments on Chinese-English translation show our method improves 1.19 BLEU scores.

Keywords： machine translation prior knowledge word embedding

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

梁博翔，佘春东，刘绍华. 融合先验知识的神经机器翻译模型研究[EB/OL]. 北京：中国科技论文在线 [2020-03-13]. https://www.paper.edu.cn/releasepaper/content/202003-141.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202003-141
论文题目	融合先验知识的神经机器翻译模型研究
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.