• 基于多尺度混合注意力的遥感图像超分辨率重建

    Multi-scale hybrid attention network for remote sensing image super-resolution

    • 现有基于Transformer 的方法在处理复杂遥感场景时表现不佳,容易出现伪影和细节丢失,特别是在局部信息捕捉和空间关系建模方面存在明显局限。为解决上述问题,提出了一种多尺度混合注意力网络(Multi-scale Hybrid Attention Network, MsHAN)。该网络设计了大核多尺度注意力机制(Large Kernel Multi-scale Attention Mechanism, LKMSA)、多尺度动态窗口空洞注意力模块(Multi-scale Dynamic Window Hole Attention Module, MSDWDA)和空间前馈模块(Spatial Feedforward Module, SFM),全面提升了遥感图像超分辨率重建的性能。LKMSA结合大核卷积和多尺度机制,显著提高了对长距离依赖的建模能力和细节恢复效果。MSDWDA通过动态窗口划分和多尺度空洞卷积,有效增强了局部细节捕捉和全局一致性,并抑制了伪影累积。SFM通过优化前馈网络(Feed-Forward Network, FFN)结构,提升空间信息的建模能力,同时降低了计算复杂度。在AID、UCMerced与NWPU-RESISC45数据集上,MsHAN与现有常用、最新超分辨率重建方法(如EDSR、RCAN、MAN等)进行对比实验,结果显示:在各项评价指标上均取得了优异的表现。以PSNR指标为例,MsHAN相较最新的MAN方法在AID、UCMerced数据集上分别提升了0.05 dB与0.11 dB。这些结果表明,所提方法在细节恢复和整体图像质量方面具有显著优势。

       

      Abstract: Existing Transformer-based methods perform poorly when processing complex remote sensing scenes, and are prone to artifacts and detail loss, especially in terms of local information capture and spatial relationship modeling. To solve the above problems, a Multi-scale Hybrid Attention Network (MsHAN) is proposed. The network designs a Large Kernel Multi-scale Attention Mechanism (LKMSA), a Multi-scale Dynamic Window Hole Attention Module (MSDWDA) and a Spatial Feedforward Module (SFM), which comprehensively improves the performance of remote sensing image super-resolution reconstruction. LKMSA combines large kernel convolution and multi-scale mechanism to significantly improve the modeling ability of long-distance dependencies and the effect of detail recovery. MSDWDA effectively enhances local detail capture and global consistency and suppresses artifact accumulation through dynamic window division and multi-scale hole convolution. SFM improves the modeling ability of spatial information while reducing computational complexity by optimizing the Feed-Forward Network (FFN) structure. On the AID, UCMerced and NWPU-RESISC45 datasets, MsHAN is compared with the existing commonly used and latest super-resolution reconstruction methods (such as EDSR, RCAN, MAN, etc.). The results show that it achieved excellent performance in various evaluation indicators. Taking the PSNR indicator as an example, MsHAN improved by 0.05 dB and 0.11 dB on the AID and UCMerced datasets respectively compared to the latest MAN method. These results indicate that the proposed method has significant advantages in detail recovery and overall image quality.

       

    /

    返回文章
    返回