基于跨模态选择与感知细化的RGB-D显著性目标检测
DOI:
CSTR:
作者:
作者单位:

安徽理工大学 计算机科学与工程学院

作者简介:

通讯作者:

中图分类号:

TP391

基金项目:

国家自然科学基金项目(面上项目,重点项目,重大项目)


RGB-D Salient Object Detection Based on Cross-Modal Selection and Perceptual Refinement
Author:
Affiliation:

School of Computer Science and Engineering,Anhui University of Science and Technology,Huainan

Fund Project:

The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    RGB-D显著性目标检测旨在融合RGB外观信息与深度结构信息,实现对显著区域的准确定位。针对现有方法在跨模态互补建模与多层特征融合方面存在的不足,本文提出一种跨模态选择与感知细化网络。该方法通过跨模态选择融合机制,在通道、空间与频域层面自适应建模RGB与深度特征的差异性与互补性;同时在高层语义阶段引入调制融合策略,增强模态融合的选择性与稳定性;在解码阶段利用感知细化机制逐层融合高、低层特征,以提升显著目标的结构完整性与边界表达能力。在六个主流RGB-D显著性目标检测数据集上的实验结果表明,所提出的方法在检测精度、边界保持性及复杂场景鲁棒性方面均优于现有方法。

    Abstract:

    RGB-D salient object detection aims to accurately locate salient regions by jointly exploiting RGB appearance information and depth structural cues. To address the shortcomings of existing methods in modeling cross-modal complementarity and multi-level feature fusion, this paper proposes a cross-modal selection and perceptual refinement network. Specifically, a cross-modal selective fusion strategy adaptively models RGB–depth differences and complementarities at the channel, spatial, and frequency levels, while a modulated fusion mechanism enhances the selectivity and stability of high-level semantic fusion. Furthermore, a perceptual refinement scheme is incorporated into the decoding stage to progressively integrate high-level semantics with low-level structural details, thereby improving the structural integrity and boundary representation of salient objects. Experiments on six widely used RGB-D salient object detection benchmarks demonstrate that the proposed method consistently outperforms existing approaches in detection accuracy, boundary preservation, and robustness in complex scenes.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-12-17
  • 最后修改日期:2026-02-08
  • 录用日期:2026-03-16
  • 在线发布日期:
  • 出版日期:
文章二维码