面向交通场景理解的基于残差空洞注意力的语义分割方法

范海博; 刁祖龙; 张大方

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
同行评议
相关论文
评论

Residual Dilated Attention for Semantic Segmentation of Traffic Scene Understanding

首发时间：2020-05-28

Haibo~Fan ¹
Fan Haibo(1994-), Female, Master, Deep learning, computer vision and semantic seg- mentation. Email:haibo fan@hnu.edu.cn.
Zulong~Diao ² Dafang~Zhang ¹
Zhang Dafang(1959-), Male, Professor, Ph.D. supervisor, Machine learning, traffic analysis and network security. Email:dfzhang@hnu.edu.cn

1、 Hunan University,College of Computer Science and Electronic Engineering,Changsha 410082
2、 Chinese Academy of Sciences,Institute of Computing Technology,Beijing 100190

Abstract：In recent years, the convolutional neural network has achieved remarkable success in semantic segmentation of traffic scene understanding. At present, the main problems in the field of semantic segmentation are as follows: 1) The repeated pooling and downsampling operations reduce resolution of traffic images in the convolutional networks, which leads to lose abundant spatial information and poor segmentation performance. 2) Traffic images contain many objects of different scales. How to accurately recognize and segment these multi-scale objects is another key problem in semantic segmentation. To handle these problems, this paper propose an image semantic segmentation method based on the Residual Dilated Attention. This method uses spatial CNN to extract high-level semantic information, and then uses the proposed model to capture low-level semantic information, and follows the designed sampling rules to set appropriate and effective sampling rates, and effectively aggregates multi-scale context information while maintaining high resolution of feature maps. Finally, this paper also designs a fusion module to effectively fuse the results generated by the spatial CNN and the Residual Dilated Attention. The method in this paper conducts a series of simulation experiments on CULane and CamVid traffic datasets, and achieves competitive results, proving the effectiveness of the proposed method.

keywords： Computer vision Semantic segmentation Attention mechanism Dilated convolutions Multi-scale context information.

点击查看论文中文信息

面向交通场景理解的基于残差空洞注意力的语义分割方法

范海博 ¹
Fan Haibo(1994-), Female, Master, Deep learning, computer vision and semantic seg- mentation. Email:haibo fan@hnu.edu.cn.
刁祖龙 ² 张大方 ¹
Zhang Dafang(1959-), Male, Professor, Ph.D. supervisor, Machine learning, traffic analysis and network security. Email:dfzhang@hnu.edu.cn

1、湖南大学,信息科学与工程学院,长沙 410082
2、中国科学院,计算技术研究所, 北京 100190

摘要：近年来,卷积神经网络在交通场景理解的语义分割方面取得了显著的成功。目前,语义分割领域存在的主要问题有：1）卷积网络中重复的池化和下采样操作降低了交通图像的分辨率,导致空间信息的丢失,其分割性能差。2）交通图像中包含了许多不同尺度的对象,如何准确识别和分割这些多尺度对象是另一个关键问题。针对这些问题,本文提出了一种基于残差空洞注意力的图像语义分割方法。该方法使用空间CNN来提取高级语义信息,然后利用提出的残差空洞注意力模型捕捉底层语义信息,并遵循采样规则设置合适且有效的采样率,在保持特征图高分辨率的同时有效地聚合多尺度上下文信息。最后,本文还设计了一个融合模块将空间CNN和残差空洞注意力产生的结果进行有效地融合。本文的方法在CULane和CamVid两种交通数据集上进行了一系列仿真实验,取得了可竞争性的结果,证明了所提方法的有效性。

关键词：计算机视觉语义分割注意力机制空洞卷积多尺度上下文信息

基金：

1. National Natural Science Foundation of China （Grant No. 61976087）

论文图表：

引用

导出参考文献

.txt

.ris

.doc

Haibo~Fan, Zulong~Diao, Dafang~Zhang. Residual Dilated Attention for Semantic Segmentation of Traffic Scene Understanding[EB/OL]. Beijing:Sciencepaper Online[2020-05-28]. https://www.paper.edu.cn/releasepaper/content/202005-207.

No.****

同行评议

未申请同行评议

全部评论

0/1000

论文编号	202005-207
论文题目	面向交通场景理解的基于残差空洞注意力的语义分割方法
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.