• 融合轻量卷积和注意力机制的道路目标检测算法

    Road object detection algorithm integrating lightweight convolution and attention mechanism

    • 目标检测技术作为计算机视觉领域的一项关键技术,已经在多个领域得到广泛应用,尤其在精确解析道路交通场景方面发挥着关键的作用。然而,在车辆密集且路况复杂的道路交通环境中,传统目标检测方法存在准确性不足、误判与漏判等问题。此外,大型模型在资源受限的边缘设备上的部署难题,进一步限制了其在实际场景中的应用。基于YOLOv8目标检测模型,提出了一种用于道路目标检测的方法。首先,采用GhostConv卷积核替代C2f结构中的Bottleneck层的常规卷积,显著降低了模型的参数规模。然后,融合GhostConv与LSK注意力机制,构建了GhostLSK模块,在降低整体参数规模的同时,显著提升了模型对交通车辆和行人的特征提取能力。最后,在检测头上新增一个小目标检测层,加强对小目标的检测能力。在YOLOv8n和YOLOv8s模型上进行的改进实验,通过KITTI和SODA10M数据集的严格验证,证实了所提方法的有效性。改进后的YOLOv8n模型在KITTI数据集上的mAP@0.5达到了92.5%,相较于原始模型提升了1.3%,同时参数量下降了29.2%。YOLOv8s模型在同一数据集上的mAP@0.5达到了94.9%,提升了0.8%,参数量减少了33.7%。

       

      Abstract: Object detection, a pivotal discipline within computer vision, has seen extensive deployment across various sectors, notably in the precise parsing of vehicular traffic contexts, where its critical function is paramount. However, in road traffic environments with dense vehicles and complex road conditions, traditional object detection methods have problems such as insufficient accuracy, misjudgment, and omission. In addition, the deployment challenges of large-scale models on resource constrained edge devices further limit their application in practical scenarios. A method for road object detection based on the YOLOv8 object detection model is proposed. Firstly, the GhostConv convolution kernel is used to replace the conventional convolution of the Bottleneck layer in the C2f structure, significantly reducing the parameter size of the model. Then, by integrating GhostConv and LSK attention mechanism, a GhostLSK module is constructed, which significantly improves the model's feature extraction ability for traffic vehicles and pedestrians while reducing the overall parameter size. Finally, add a small object detection layer on the detection head to enhance the detection capability of small objects.

       

    /

    返回文章
    返回