• 基于中心加强的自然场景文本检测算法

    Center-enhanced natural scene text detection algorithm

    • 现有的自然场景文本检测算法模型复杂度大,难以部署在终端嵌入式设备上且对于小尺度文本易漏检。针对以上问题,提出了一种基于中心加强的场景文本检测算法(Center-Enhanced Network, CENet)。为了易于部署在终端嵌入式设备上,CENet使用轻量级的ResNet18作为骨干网络。引入中心加强模块(Center-Enhanced Module, CEM),加强小尺度文本的特征,解决网络中下采样操作造成小尺度文本易淹没在背景中的问题。引入浅层特征增强模块(Shallow Feature-Augmented Module, SFAM),利用高层特征中丰富的语义来指导浅层特征,对浅层特征中小尺度文本的细节信息进行增强。在公开数据集ICDAR2015和MSRA-TD500进行消融实验,验证了CENet算法的有效性。在小尺度文本数据集ICDAR2015进行对比实验,结果表明:CENet的准确率和F值比主流的DBNet算法有一定提升。

       

      Abstract: Existing text detection algorithms for complex natural scenes often suffer from high model complexity, making them challenging to deploy on edge embedded devices, while also tending to miss small-scale text. To address these issues, scene text detection algorithm called Center-enhanced Network (CENet) is proposed. To facilitate deployment on edge devices, CENet utilizes the lightweight ResNet18 as the backbone network. Center-Enhanced Module (CEM) is introduced to enhance the features of small-scale text, addressing the problem of small-scale text being easily overwhelmed by the background due to downsampling operations. Additionally, we incorporate a Shallow Feature-Augmented Module (SFAM) that leverages the rich semantics from high-level features to guide shallow features, thereby enhancing the detailed information of small-scale text in shallow layers. Ablation experiments on public datasets ICDAR2015 and MSRA-TD500 demonstrate the effectiveness of the CENet algorithm. Comparative experiments on the small-scale text dataset ICDAR2015 show that CENet improves accuracy and F-score compared to the mainstream DBNet algorithm.

       

    /

    返回文章
    返回