基于多模态遥感图像的特征融合模型
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

河北省自然科学基金(F2022208002)


Feature fusion model based on multi-modal remote sensing images
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为了解决传统单分支网络在遥感图像语义分割中存在的模型精度受限于单模态数据、参数规模庞大等问题,提出了一种基于大核卷积的多模态特征融合网络(large-kernel convolution-based multi-modal feature fusion network,LMFNet)模型。采用改进的大核MobileNetV3(GMBNetV3)作为并行主干,通过交互自注意力强化模块(cross-self-attention enhancement module,CSAEM)融合多源特征,并利用门控单元聚合模块(gating unit aggregation module,GUAM)在解码阶段整合抽象与纹理信息。在公共数据集Potsdam和Vaihingen上,将LMFNet与当前先进的多模态图像分割模型进行性能对比,并进行消融实验验证模型各模块功能。结果表明:在Potsdam数据集下,LMFNet在参数量降低29.3%~73.6%的基础上,分割性能mIoU相较于其他先进多模态分割模型提升0.32个百分点~6.50个百分点,推理速度提高1.7%~45.9%。所提模型有效地融合了差异图像特征,能够更清晰地对遥感图像进行语义分割,可为城市管理中的遥感图像实例分割提供有力支持。

    Abstract:

    To address the issues such as limited model accuracy and large parameter scale of traditional single-branch networks in semantic segmentation of remote sensing images, a large-kernel convolution-based multi-modal feature fusion network (LMFNet) module was proposed. An improved large-kernel MobileNetV3 (GMBNetV3) was adopted as the parallel backbone, and multi-source features were fused through cross-self-attention enhancement module. The gated aggregator was utilized to integrate abstract and texture information in the decoding stage. On the public datasets Potsdam and Vaihingen, LMFNet was compared with current advanced multi-modal image segmentation models in terms of performance, and ablation experiments were conducted to verify the functions of each module of the model. The results show that LMFNet improves the segmentation performance of mIoU by approximately 0.32 percentage points~6.50 percentage points compared to other advanced multi-modal segmentation models, while reducing the parameter quantity by 29.3%~73.6%, and the inference speed is increased by 1.7%~45.9% on the Potsdam dataset. The proposed model effectively fuses the differences in image features and can perform semantic segmentation of remote sensing images more clearly, providing strong support for instance segmentation of remote sensing images in urban management.

    参考文献
    相似文献
    引证文献
引用本文

王建霞,仇绍祖,杨春金,吴长莉,张晓明.基于多模态遥感图像的特征融合模型[J].河北工业科技,2025,42(6):499-509

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-03-23
  • 最后修改日期:2025-10-15
  • 录用日期:
  • 在线发布日期: 2025-12-02
  • 出版日期:
文章二维码