基于空间频域特征细化与聚集的单目深度估计方法
DOI:
CSTR:
作者:
作者单位:

安徽理工大学

作者简介:

通讯作者:

中图分类号:

TP391

基金项目:

国家自然科学基金(62102003)、安徽省博士后科学基金(2022B623)、安徽省高校协同创新项目(GXXT-2022-038)、淮南市科技计划项目(2023A316)


Monocular Depth Estimation Method Based on Spatial Frequency Feature Refinement and Aggregation
Author:
Affiliation:

Anhui University of Science and Technology

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    自监督单目深度估计旨在仅从单张RGB图像预测像素级密集深度图,无需真值监督。针对现有方法在场景结构感知与局部细节处理的不足,本文提出基于空间频域特征细化与聚集的新方法。该方法核心包含空间频域特征细化模块与双流动态聚集模块。具体来说,空间频域特征细化模块通过空间细化单元提取并处理多尺度细粒度局部特征,同时结合频域细化单元利用离散余弦变换与多角度通道注意力机制增强场景结构感知能力,并有效抑制噪声与冗余信息;其次,双流动态聚集模块通过双流卷积注意力机制自适应融合全局与局部深度线索。实验表明,本方法在主流数据集上性能显著优于先进模型,实现精度与参数量的高效平衡,并展现出优异跨数据集泛化能力。

    Abstract:

    Self-supervised monocular depth estimation aims to predict pixel-wise dense depth maps from a single RGB image without relying on ground-truth supervision. To address the limitations of existing methods in scene structure perception and local detail handling, this paper proposes a new method based on spatial frequency domain feature refinement and aggregation., which includes a spatial frequency feature refinement module and a dual-stream dynamic aggregation module. Specifically, the spatial frequency feature refinement module extracts and processes multi-scale fine-grained local features through the spatial refinement unit, and combines the frequency refinement unit to enhance the scene structure perception ability and suppress noise and redundant information by using discrete cosine transform and multi-angle channel attention mechanism. Secondly, the dual-stream dynamic aggregation module adaptively fuses global and local depth cues through the dual-stream convolutional attention mechanism. Extensive experimental results show that the performance of the proposed method is significantly better than the existing advanced models on the mainstream datasets, which achieves a good balance between accuracy and parameter quantity, and shows excellent cross-dataset generalization ability.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-04-01
  • 最后修改日期:2025-06-26
  • 录用日期:2025-07-16
  • 在线发布日期:
  • 出版日期:
文章二维码