面向密集人群场景的行人检测算法研究
首发时间:2020-04-17
摘要:行人检测在计算机视觉领域有着非常广泛的应用,已成为备受关注的热点方向。但由于行人本身外观差异大,遮挡严重等原因,导致传统图像处理方法无法满足实际应用的需求。本文围绕人工智能中的计算机视觉技术,研究基于深度学习的行人检测算法,并将其应用到密集人群场景下完成行人检测任务。本文通过改进基于卷积神经网络的单阶段检测器RetinaNet,采用迁移学习的训练方式,实现了对行人的快速与准确识别,在采集的密集场景行人数据集上达到了识别平均准确率95%以上,mAP达到72.16%,且单张图的识别时间仅0.04秒。在此基础上通过对检测失败样本进行分析,归纳出行人检测中的常见问题,设计了一种带有惩罚项的损失函数,有效解决了密集人群场景下常见的行人目标遮挡问题,进一步将行人检测mAP提升至73.45%,同时查准率95.03%,查全率86.37%,满足了密集人群场景下的行人检测应用需求。
For information in English, please click here
Research on Algorithm of Pedestrian Detection in Dense Crowd Scene
Abstract:Pedestrian detection has a very wide range of applications in computer vision. It has become a hot topic that has attracted much attention. However, due to the differences between human appearance and serious occlusion of pedestrians, the traditional image processing methods cannot meet the needs of practical applications.This paper focuses on computer vision technology in artificial intelligence. It proposes object detection algorithms based on deep learning and applies them to dense crowd scenes to complete pedestrian localization tasks. This paper improves the single-stage detector RetinaNet and uses the training method of transfer learning to achieve fast and accurate recognition of pedestrians. The final average recognition accuracy rate is more than 95%, with mAP reaching 72.16%. And the recognition time of a single image only needs 0.04 seconds. Then, this paper proposes a loss function with a penalty term, which can effectively solve the problem of pedestrian occlusion commonly encountered in urban scenes, Also it further improves the pedestrian detection mAP to 73.45%, while the precision is 95.03%, and the recall is 86.37%,which can meet the need of application on pedestrian.
Keywords: Neural Networks Loss Function Pedestrian Detection
基金:
引用
No.****
动态公开评议
共计0人参与
勘误表
面向密集人群场景的行人检测算法研究
评论
全部评论0/1000