机器视觉对坑洼道路检测识别的最优模型
首发时间:2024-03-27
摘要:坑洼道路检测和识别是一种计算机视觉任务,旨在通过数字图像识别出存在坑洼的道路。这对于地质勘探、航天科学和自然灾害等领域的研究和应用具有重要意义。本文主要研究数据集使用机器学习方法,识别并提取图像样本数据,生成训练模型。以达到用此训练模型检测时能够将陌生图像区分为正常道路还是坑洼道路图像。在此基础上利用迁移学习和时间蒸馏等方法提高识别精度、识别速度、分类准确性。确立VGG-19、ResNet-18及改进VGG-19三种数学模型,同时通过训练模型基础上不断调整并优化参数,并持续提升模型识别率、速度和分类的准确性,根据代码运行结果从准确性、鲁棒性、效率评估、泛化能力、可解释性、对比实验、可用性等等方面对训练模型进行评估。利用Pyhton软件通过VGG-19、ResNet-18及改进VGG-19模型进行时间效率、准确丢失值和模型尺寸的分析最终获得最准确的预测。求解得到:改进VGG-19模型训练时间31176.55s,训练模型文件大小为100MB,训练200次时丢失值能够收敛至2.1×10-4左右,测试模型准确率能够达到98%,在实验中,经过数次实验后平均正确率为97%左右。分析结果得出带有阴影的坑洼区域错误率较低,同时工作效率较高、泛化能力强、可解释性强。改进的VGG-19模型的各项评测结果说明该模型对坑洼道路检测和识别准确度方面指标的准确性最高。
关键词: 最优化 VGG-19 ResNet-18 改进VGG-19
For information in English, please click here
The optimal model of pothole road detection and recognition by machine vision
Abstract:Pothole road detection and identification is a computer vision task designed to identify roads with potholes through digital images. This is of great significance to the research and application of geological exploration, space science and natural disasters. This paper mainly studies the use of machine learning methods in data sets to identify and extract image sample data and generate training models. In order to distinguish the unfamiliar image from the normal road image or the pothole road image when the model is trained by this method.On this basis, transfer learning and time distillation are used to improve the recognition accuracy, recognition speed and classification accuracy. Three mathematical models, VGG-19, ResNet-18 and improved VGG-19, were established, and the parameters were continuously adjusted and optimized based on the training model, and the recognition rate, speed and classification accuracy of the model were continuously improved. According to the results of code operation, the training model is evaluated from the aspects of accuracy, robustness, efficiency evaluation, generalization ability, interpretability, comparison experiment, usability and so on.Pyhton software was used to analyze the time efficiency, accurate loss value and model size through VGG-19, ResNet-18 and improved VGG-19 model, and finally the most accurate prediction was obtained. The results are as follows: the training time of the improved VGG-19 model is 31176.55s, the size of the training model file is 100MB, the lost value can converge to about 2.1×10-4 after 200 training times, and the accuracy rate of the test model can reach 98%. In the experiment, the average accuracy rate is about 97% after several experiments.The analysis results show that the error rate of the shadow pothole area is lower, and the working efficiency is higher, the generalization ability is stronger, and the interpretation is stronger. The evaluation results of the improved VGG-19 model show that the model has the highest accuracy in the pothole road detection and recognition accuracy index.
Keywords: optimization VGG-19 ResNet-18 Improved VGG-19
基金:
引用
No.****
动态公开评议
共计0人参与
勘误表
机器视觉对坑洼道路检测识别的最优模型
评论
全部评论0/1000