面向复杂路况的车辆行人检测模型研究与部署
DOI:
CSTR:
作者:
作者单位:

呼伦贝尔学院

作者简介:

通讯作者:

中图分类号:

基金项目:

平安边境建设下呼伦贝尔边境沿线智能化设备数据整合管理的研究和思考


Research and deployment of vehicle pedestrian detection model for complex road conditions
Author:
Affiliation:

人工智能与大数据学院

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为解决复杂路况下车辆与行人检测中小目标特征提取不足、目标遮挡及边缘设备部署适配性差等问题,本研究提出一种车辆行人检测模型AMM-YOLO。首先,设计自适应双分支注意力模块(Adaptive Dual-branch Attention,ADA),动态平衡通道与空间特征表达,增强小目标与被遮挡目标的表征能力;其次,构建多尺度加权拼接特征融合模块(Multi-scale Weighted Concatenation Fusion,MWCF),最大程度保留通道信息,实现多尺度语义高效融合;最后,引入MPDIoU(Minimum Point Distance Intersection over Union)损失函数,优化边界框定位精度并加快收敛。实验结果表明,AMM-YOLO在SODA10M数据集上mAP@0.5达63.3%,较基准模型提升5.2个百分点;在VisDrone2019数据集上mAP@0.5达45.2%,优于其他主流模型。经INT8量化后,模型大小仅3.7MB,FPS达29.5,满足轻量化与实时性需求,可为智能交通监控与自动驾驶提供技术支持。

    Abstract:

    To address the problems of insufficient feature extraction for small targets, target occlusion, and poor adaptability for edge device deployment in vehicle and pedestrian detection under complex road conditions, this paper proposes a vehicle and pedestrian detection model named AMM-YOLO. First, an Adaptive Dual-branch Attention (ADA) module is designed to dynamically balance channel and spatial feature expressions, enhancing the representation capability of small and occluded targets. Second, a Multi-scale Weighted Concatenation Fusion (MWCF) module is constructed to preserve channel information to the maximum extent and achieve efficient multi-scale semantic fusion. Finally, the Minimum Point Distance Intersection over Union (MPDIoU) loss function is introduced to optimize bounding box localization accuracy and accelerate convergence. Experimental results demonstrate that AMM-YOLO achieves a mAP@0.5 of 63.3% on the SODA10M dataset, representing a 5.2 percentage point improvement over the baseline model, and a mAP@0.5 of 45.2% on the VisDrone2019 dataset, outperforming other mainstream models. After INT8 quantization, the model size is only 3.7 MB with an FPS of 29.5, satisfying lightweight and real-time requirements and providing technical support for intelligent traffic monitoring and autonomous driving.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-12-16
  • 最后修改日期:2026-01-31
  • 录用日期:2026-02-08
  • 在线发布日期:
  • 出版日期:
文章二维码