基于跨尺度和细粒度编码器的光学遥感图像建筑物检测
DOI:
CSTR:
作者:
作者单位:

嘉兴大学

作者简介:

通讯作者:

中图分类号:

基金项目:


Optical Remote Sensing Image Building Detection Based on Cross-scale and Fine-grained Encoders
Author:
Affiliation:

Jiaxing University

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    目前广泛使用的深度学习模型在提取光学遥感图像建筑物特征时,存在难以全面捕捉建筑物细节特征和语义信息的问题。为解决上述问题,本文提出了一种基于跨尺度和细粒度编码器的光学遥感图像建筑物检测方法。该方法设计了一个基于深度Swin变换器的跨尺度编码器和一个基于ResNeXt的细粒度编码器,从光学遥感图像中提取全局上下文信息和局部细节信息,实现建筑物高精度检测。本文还设计了分支融合模块,通过引入对位置敏感的通道注意力机制,增强模型对通道间关系建模的能力,实现了对目标建筑物的精确检测。在WHU、INRIA和Massachusetts数据集上进行了大量实验,并将提出的模型与多种先进模型进行了比较,实验结果验证了本文方法在光学遥感图像建筑物检测性能上的优越性。

    Abstract:

    Currently widely used deep learning models face challenges in comprehensively capturing detailed features and semantic information of buildings when extracting features from optical remote sensing images. To address these issues, this paper proposes a optical remote sensing image building detection method based on cross-scale and fine-grained encoders. The method designs a cross-scale encoder based on the deep Swin Transformer and a fine-grained encoder based on ResNeXt to extract global contextual information and local detailed information from optical remote sensing images, enabling high-precision building detection. This paper also designs a Branch Fusion Module that enhances the model's ability to model inter-channel relationships by introducing a position-sensitive channel attention mechanism, achieving precise detection of target buildings. Extensive experiments were conducted on the WHU, INRIA, and Massachusetts datasets, and the proposed model was compared with various state-of-the-art models. The experimental results validate the superior performance of the proposed method in building detection in optical remote sensing images.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-09-09
  • 最后修改日期:2025-11-18
  • 录用日期:2025-12-18
  • 在线发布日期:
  • 出版日期:
文章二维码