面向密集人群场景的行人检测算法研究

郭尧; 宋晴

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

面向密集人群场景的行人检测算法研究

首发时间：2020-04-17

郭尧 ¹
郭尧（1994-），男，硕士研究生，主要研究方向：目标检测
宋晴 ¹
宋晴（1978-），女，博导，主要研究方向：模式识别与计算机视觉.

1、北京邮电大学自动化学院，北京，100876

摘要：行人检测在计算机视觉领域有着非常广泛的应用，已成为备受关注的热点方向。但由于行人本身外观差异大，遮挡严重等原因，导致传统图像处理方法无法满足实际应用的需求。本文围绕人工智能中的计算机视觉技术，研究基于深度学习的行人检测算法，并将其应用到密集人群场景下完成行人检测任务。本文通过改进基于卷积神经网络的单阶段检测器RetinaNet，采用迁移学习的训练方式，实现了对行人的快速与准确识别，在采集的密集场景行人数据集上达到了识别平均准确率95%以上，mAP达到72.16%，且单张图的识别时间仅0.04秒。在此基础上通过对检测失败样本进行分析，归纳出行人检测中的常见问题，设计了一种带有惩罚项的损失函数，有效解决了密集人群场景下常见的行人目标遮挡问题，进一步将行人检测mAP提升至73.45%，同时查准率95.03%，查全率86.37%，满足了密集人群场景下的行人检测应用需求。

关键词：神经网络损失函数行人检测

For information in English, please click here

Research on Algorithm of Pedestrian Detection in Dense Crowd Scene

Guo Yao ¹
郭尧（1994-），男，硕士研究生，主要研究方向：目标检测
Song Qing ¹
宋晴（1978-），女，博导，主要研究方向：模式识别与计算机视觉.

1、School of Automation,Beijing University of Posts and Telecommunications,Beijing,100876

Abstract：Pedestrian detection has a very wide range of applications in computer vision. It has become a hot topic that has attracted much attention. However, due to the differences between human appearance and serious occlusion of pedestrians, the traditional image processing methods cannot meet the needs of practical applications.This paper focuses on computer vision technology in artificial intelligence. It proposes object detection algorithms based on deep learning and applies them to dense crowd scenes to complete pedestrian localization tasks. This paper improves the single-stage detector RetinaNet and uses the training method of transfer learning to achieve fast and accurate recognition of pedestrians. The final average recognition accuracy rate is more than 95%, with mAP reaching 72.16%. And the recognition time of a single image only needs 0.04 seconds. Then, this paper proposes a loss function with a penalty term, which can effectively solve the problem of pedestrian occlusion commonly encountered in urban scenes, Also it further improves the pedestrian detection mAP to 73.45%, while the precision is 95.03%, and the recall is 86.37%，which can meet the need of application on pedestrian.

Keywords： Neural Networks Loss Function Pedestrian Detection

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

郭尧，宋晴. 面向密集人群场景的行人检测算法研究[EB/OL]. 北京：中国科技论文在线 [2020-04-17]. https://www.paper.edu.cn/releasepaper/content/202004-170.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202004-170
论文题目	面向密集人群场景的行人检测算法研究
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.