基于超自适应策略的轻量化小目标检测网络
DOI:
CSTR:
作者:
作者单位:

1.无锡商业职业技术学院数字商务学院;2.江西师范大学计算机信息工程学院;3.南昌工学院信息与人工智能学院

作者简介:

通讯作者:

中图分类号:

TP183

基金项目:

国家自然科学(62462046);江苏省高校哲学社会科学研究一般项目(2024SJYB0708);江西省杰出青年(20252BAC220007)


A Lightweight Small Target Detection Network Based on Ultra Adaptive Strategy
Author:
Affiliation:

1.Digital Business College,WuXi Vocational Institute of Commerce,Wuxi;2.School of Computer and Information Engineering,Jiangxi Normal University,Nanchang;3.School of Information and Artificial Intelligence,Nanchang Institute of Science Technology,Nanchang

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    无人机航拍图像中小目标像素占比小、特征稀疏,通用检测方法精度受限。为此,本文提出基于YOLOv8的小目标检测网络UAST-YOLOv8。针对传统交并比(Intersection over Union, IoU)损失函数对小目标学习不充分的问题,设计动态自适应聚焦损失函数DAFIoU,通过自适应尺度感知权重和动态聚焦机制增强小目标边界框回归精度。针对检测头参数量大的问题,设计轻量级共享卷积解耦检测头(Lightweight Shared Convolutional Decoupled Detection Head, LSCDD),利用参数共享策略降低模型复杂度。为增强网络对关键区域的关注能力,提出多维联合注意力模块(Multi-Dimensional Joint At-tention, MDJA),融合位置编码与头间信息交互机制实现空间和通道维度的联合特征提取。在VisDrone2019和RSOD数据集上,UAST-YOLOv8的mAP@50分别达到40.7%和94.4%,较基线YOLOv8n分别提升7.8和8.7个百分点,参数量仅为2.3M,验证了该方法的有效性。

    Abstract:

    Small targets in UAV aerial images occupy few pixels and exhibit sparse features, limiting the accuracy of general detection methods. This paper proposes UAST-YOLOv8, a small target detection network based on YOLOv8. To address the insufficient learning of small targets by conventional IoU loss functions, a Dynamic Adaptive Focusing IoU loss function (DAFIoU) is designed. DAFIoU employs adaptive scale-aware weighting and a dynamic focusing mecha-nism to improve bounding box regression accuracy for small targets. To reduce the high pa-rameter count of the detection head, a Lightweight Shared Convolutional Decoupled Detec-tion Head (LSCDD) is developed, which adopts parameter sharing strategies to lower model complexity. To enhance the network's focus on key target regions, a Multi-Dimensional Joint Attention module (MDJA) is proposed. MDJA integrates positional encoding with inter-head information interaction to achieve joint feature extraction across spatial and channel dimen-sions. Experiments on the VisDrone2019 and RSOD datasets show that UAST-YOLOv8 achieves mAP@50 of 40.7% and 94.4%, surpassing the baseline YOLOv8n by 7.8 and 8.7 percentage points, respectively, with only 2.3M parameters. These results validate the effec-tiveness of the proposed method for small target detection in UAV aerial imagery.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2026-02-04
  • 最后修改日期:2026-04-07
  • 录用日期:2026-04-17
  • 在线发布日期:
  • 出版日期:
文章二维码