基于HMF-YOLO模型的钢材表面缺陷检测
DOI:
CSTR:
作者:
作者单位:

安徽理工大学计算机科学与工程学院

作者简介:

通讯作者:

中图分类号:

TP391.4

基金项目:

国家自然科学基金面上项目


Steel surface defect detection based on HMF-YOLO model
Author:
Affiliation:

School of Computer Science and Engineering, Anhui University of Science and Technology

Fund Project:

National Natural Science Foundation of China

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对现有钢材表面缺陷检测中由于特征提取和融合性不足,易受背景干扰导致模型性能受限等问题,提出一种基于YOLOv8s的改进模型HMF-YOLO。首先,采用HGNetV2作为主干网络,并结合多尺度并行卷积模块(Multi-Scale Parallel Convolution Module, MSPC)提高特征提取能力;其次,构建多层交互式融合网络(Multi-Layer Interactive Fusion Network, MIFN)作为颈部结构,实现不同层级语义信息的高效交互与融合,从而提升模型的多尺度表达能力;最后,设计了一种频域自适应增强模块(Frequency Domain Adaptive Enhancement Module, FAEM),结合小波变换卷积,充分学习特征图频域信息,增强模型的缺陷感知能力与鲁棒性。在NEU-DET和GC10-DET数据集上,HMF-YOLO的mAP分别达到80.3%和73.6%,较YOLOv8s提升5.0%和4.0%,参数量减少50.5%,计算量降低35.2%。结果表明,该方法在保证轻量化的同时显著提升检测性能,满足工业场景需求。

    Abstract:

    To address the performance limitations in steel surface defect detection caused by insufficient feature extraction and fusion, as well as susceptibility to background interference, an improved model named HMF-YOLO based on YOLOv8s is proposed. First, HGNetV2 is adopted as the backbone network, and a Multi-Scale Parallel Convolution Module (MSPC) is introduced to enhance feature extraction capability. Second, a Multi-Layer Interactive Fusion Network (MIFN) is constructed as the neck structure to enable efficient interaction and fusion of semantic information across different feature levels, thereby improving the model’s multi-scale representation ability. Finally, a Frequency Domain Adaptive Enhancement Module (FAEM) is designed, which incorporates wavelet transform convolution to effectively learn frequency-domain information from feature maps, enhancing the model’s defect perception capability and robustness. Experimental results on the NEU-DET and GC10-DET datasets show that the mAP of HMF-YOLO reaches 80.3% and 73.6%, achieving improvements of 5.0% and 4.0% over YOLOv8s, respectively, while reducing the number of parameters by 50.5% and computational cost by 35.2%. These results demonstrate that the proposed method significantly improves detection performance while maintaining a lightweight design, meeting the requirements of industrial applications.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2026-01-25
  • 最后修改日期:2026-04-22
  • 录用日期:2026-04-24
  • 在线发布日期:
  • 出版日期:
文章二维码