基于AMD-YOLOv12n的钢材表面缺陷检测算法
DOI:
CSTR:
作者:
作者单位:

1.华北理工大学电气工程学院;2.华北理工大学冶金与能源学院;3.唐山市钢铁企业流程控制与优化技术创新中心唐山阿诺达自动化有限公司

作者简介:

通讯作者:

中图分类号:

TP391.4

基金项目:

国家自然科学基金(51904107)、中央引导地方科技发展资金项目(236Z1017G)和唐山市市级科技计划项目(No.22130220G,No.22130204G)


Steel Surface Defect Detection Algorithm Based on AMD-YOLOv12n
Author:
Affiliation:

1.North China University of Science and Technology,College of Electrical Engineering,Hebei;2.Tangshan Iron and Steel Enterprise Process Control and Optimization Technology Innovation Center Tangshan ANODE Automation Co,Ltd,Hebei;3.North China University of Science and Technology,College of Metallurgy and Energy,Hebei

Fund Project:

National Natural Science Foundation of China (51904107), Central Government-Guided Local Science and Technology Development Fund Project (236Z1017G), and Tangshan Municipal Science and Technology Plan Project (No.22130220G, No.22130204G)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对现有钢材缺陷检测模型结构复杂、检测精度和实时性较差等问题,提出一种基于YOLOv12n轻量高效的钢材缺陷检测算法AMD-YOLOv12n。该方法首先通过引用自适应矩形卷积(Adaptive Rectangular Convolution,ARConv),动态学习卷积核的高度和宽度生成矩形卷积核,有效捕捉图像中不同尺度缺陷的特征;接着引入多感受野特征交互模块(Multi-Receptive Field Feature Interaction,MRFFI),通过通道维度的异构处理实现多尺度特征融合;最后重构特征交互模块C3K2_DIA(Deformable Interactive Attention,DIA),在高效提取特征的同时降低参数量和计算复杂度。在NEU-DET钢材缺陷数据集上进行实验验证,相较于YOLOv12n算法,AMD-YOLOv12n算法的mAP值和召回率分别提升3.2%和3.6%、检测速度达到260.4帧、参数量和计算量分别降低0.59M和1.4G,满足工业质检场景对高精度与实时性的双重需求。

    Abstract:

    To address the issues of complex structure, low detection accuracy, and poor real-time performance in existing steel defect detection models, the authors proposed a lightweight and efficient steel defect detection algorithm based on YOLOv12n, dubbed AMD-YOLOv12n. The method first introduced Adaptive Rectangular Convolution (ARConv), which dynamically learned the height and width of convolution kernels to generate rectangular kernels, thereby effectively capturing features of defects at different scales in images. Subsequently, a Multi-Receptive Field Feature Interaction (MRFFI) module was introduced to achieve multi-scale feature fusion through heterogeneous processing of channel dimensions. Finally, the feature interaction module C3K2_DIA (Deformable Interactive Attention, DIA) was reconstructed, reducing the number of parameters and computational load while efficiently extracting features. Experiments were conducted on the NEU-DET steel defect dataset. Compared with the YOLOv12n algorithm, the AMD-YOLOv12n algorithm increased mAP and recall rate by 3.2% and 3.6%, respectively, achieved a detection speed of 260.4 frames per second, and reduced the number of parameters and computational load by 0.59M and 1.4G, respectively. This meets the dual requirements of high precision and real-time performance in industrial quality inspection scenarios.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-09-18
  • 最后修改日期:2025-11-10
  • 录用日期:2025-12-04
  • 在线发布日期:
  • 出版日期:
文章二维码