• 基于SAM多尺度标签优化的半监督学习遥感目标检测

    Semi-supervised remote sensing object detection based on SAM multi-scale label optimization

    • 针对遥感图像中目标分辨率低、背景复杂且获取高质量旋转框标注费用高、耗时长等问题,提出了一种多尺度标签优化的半监督学习遥感目标检测方法。该方法使用SoftTeacher模型能够充分利用大量未标注且多样化的数据,同时还能发现原始数据集中未标注的目标;借助SAM(Segment Anything Model)模型可实现基于深度学习的图像分割,并通过基于掩码的优化生成高质量的标签。通过半监督学习生成伪标注,对伪标注中的标签特征框进行多尺度处理后输入SAM模型进行优化,使用优化后的标注扩充原数据集样本重新用于全监督训练。实验结果表明:所选用的半监督目标检测模型SoftTeacher能够展现出优于全监督目标检测模型的性能,经过优化后的数据集样本能够展现相比原本伪标注数据集更精确的效果。在使用扩充后的数据集进行全监督训练时,原先的平均精度均值(mean Average Precision, mAP, mAP)从51.4%提升到53.5%。此外,全监督训练阶段使用现有的常用目标检测器进行了对比实验,进一步验证了所提方法可以有效提高遥感目标检测在标注不足情况下的准确性。

       

      Abstract: To address the challenges of low target resolution, complex backgrounds, and the high cost and time-consuming nature of obtaining high-quality rotated bounding box annotations in remote sensing images, a multi-scale label optimization method for semi-supervised remote sensing object detection is proposed. The SoftTeacher model effectively leverages large amounts of unlabeled and diverse data, while also identifying previously unlabeled targets in the original dataset. By employing the Segment Anything Model (SAM), deep learning-based image segmentation is achieved, and high-quality labels are generated through mask-based optimization. The proposed method first generates pseudo-labels through semi-supervised learning, then applies multi-scale processing to the label feature boxes before inputting them into the SAM model for optimization. The optimized labels are used to augment the original dataset, which is then employed for fully supervised training. Experimental results demonstrate that the selected semi-supervised object detection model, SoftTeacher, outperforms fully supervised detection models, with the optimized dataset samples showing more accurate results compared to the original pseudo-labeled dataset. When the augmented dataset is used for fully supervised training, the mean Average Precision (mAP) improves from 51.4% to 53.5%. Additionally, comparative experiments with existing common object detectors during the fully supervised training phase further validate that the proposed method effectively enhances the accuracy of remote sensing object detection under conditions of insufficient labeled data.

       

    /

    返回文章
    返回