• 面向自适应协同优化的半监督语义分割方法研究

    Study on semi-supervised semantic segmentation methods for adaptive collaborative optimization

    • 在自动驾驶系统中,语义分割是感知道路关键元素的重要步骤。一致性正则化方法是一种代表性的半监督语义分割方法,可有效应对全监督语义分割对大量高成本标注信息的依赖。传统的一致性正则化方法在参数传递过程中对无标签数据分布特征利用还比较薄弱,泛化性能受到一定制约。因此,提出了一种面向自适应协同优化的半监督语义分割方法。首先,设计了一种随机参数恢复机制,动态注入预训练模型参数以缓解噪声错误积累,增强模型抗干扰能力。其次,提出了基于JS散度的动态指数移动平均(Exponential Moving Average, EMA)策略。通过量化师生模型预测分布差异自适应调节参数更新强度,缓解权重耦合问题。最后,构建了基于交并比(Intersection over Union, IoU)一致性的自适应对称交叉熵损失函数,结合空间对齐约束与双向概率分布度量,缓解学生模型参数过拟合的问题。实验结果表明:在Cityscapes和Pascal VOC数据集上,采用ResNet-50作为骨干网络且有标签数据占比为数据集的1/16时,相较于SOTA方法,UniMatch在平均交并比(mean Intersection over Union, mIoU)指标上分别提升了1.09%和0.86%。

       

      Abstract: In autonomous driving systems, semantic segmentation is a crucial step for perceiving key road elements. Consistency regularization methods, as a representative semi-supervised semantic segmentation approach, can effectively mitigate the reliance of fully supervised semantic segmentation on large-scale high-cost annotated data. Traditional consistency regularization methods inadequately utilize the distribution characteristics of unlabeled data during parameter transfer, thereby constraining their generalization performance. To address these limitations, an adaptive collaborative optimization-oriented semi-supervised semantic segmentation method is proposed. Firstly, a stochastic parameter recovery mechanism that dynamically injects pre-trained model parameters to alleviate noise-induced error accumulation and enhance model robustness is designed. Secondly, a Jensen-Shannon (JS) divergence-based dynamic exponential moving average (EMA) strategy is proposed, which adaptively regulates parameter update intensity by quantifying prediction distribution discrepancies between teacher and student models, effectively addressing weight coupling issues. Finally, an intersection over union (IoU) consistency-based adaptive symmetric cross-entropy loss function is constructed, integrating spatial alignment constraints with bidirectional probability distribution metrics to alleviate overfitting in student model parameters. Experimental results demonstrate that on the Cityscapes and Pascal VOC datasets, using ResNet-50 as the backbone network with only 1/16 of labeled data, our method achieves mean Intersection over Union(mIoU) improvements of 1.09% and 0.86% respectively compared to the state-of-the-art UniMatch method.

       

    /

    返回文章
    返回