基于双重增强网络的跨模态行人重识别

doi:10.16136/i.joel.2024.07.0783

首页 > 过刊浏览>2024年第35卷第7期 >745-752. DOI:10.16136/i.joel.2024.07.0783

基于双重增强网络的跨模态行人重识别
DOI:
                        10.16136/i.joel.2024.07.0783
                    
作者:
                        
                        
                    
作者单位:(西安工程大学 电子信息学院，陕西 西安 710600)
作者简介:卢 健 (1978－),男,博士,副教授,硕士生导师,主要从事机器人导航与智能决策、人工智能与机器视觉等方面的研究。
通讯作者:
中图分类号:
基金项目:西安市碑林区应用技术研发(GX2007)资助项目

Cross-modality person re-identification based on dual enhancement network

Author:

Affiliation:

(School of Electronics and Information, Xi′an Polytechnic University, Xi′an, Shaanxi 710600, China)

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对异质样本差异、行人遮挡及背景干扰等造成的跨模态行人重识别(person re-identification,ReID)精度不高的问题,本文提出了一种基于通道与特征学习的双重增强网络(dual enhanced network,DEN)。首先从通道级出发,通过随机交换可见光通道来挖掘可见光与红外通道间的关系,增强模型对多模态样本变化的鲁棒性。其次从特征级出发,在模态共享网络前引入基于归一化的注意力模块(normalization-based attention module,NAM),通过惩罚贡献因子较小的权重来避免噪声对模态不变信息学习造成一定干扰。同时采用特征分离模块(feature separation module,FSM)来分离出身份相关特征与身份无关特征,有效提升了模型对异质样本的识别能力。最后联合难样本三元组和加权正则化损失对网络进行监督训练,从而约束行人特征学习。在RegDB数据集上,DEN的Rank1准确率和mAP分别达到了94.86%和90.10%的高水准。

Abstract:

This paper proposes a dual enhanced network (DEN) based on channel and feature learning to address the problem of poor accuracy in cross- modality person re-identification (ReID) caused by heterogeneous sample differences,person occlusion,and background interference.At the channel level,visible channels are randomly swapped to explore the relationship between visible and infrared channels,enhancing the model′s robustness to multimodal sample changes.At the feature level,a normalization-based attention module (NAM) is introduced before module sharing network to avoid noise interference on modality-invariant information learning by punishing weights with smaller contribution factors ,and a feature separation module (FSM) is used to separate identity-related features from identity-independent features,improving the model′s recognition ability for heterogeneous samples.Finally,the network is trained and supervised using hard sample triples and weighted regularization loss to constrain pedestrian feature learning.On the RegDB dataset,DEN achieves a high level of accuracy,with a Rank1 accuracy of 94.86% and mAP of 90.10%.

参考文献

相似文献

引证文献

引用本文

陈梦蝶,卢健,张奇.基于双重增强网络的跨模态行人重识别[J].光电子激光,2024,35(7):745~752

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2022-11-17
最后修改日期:2023-03-12
录用日期:
在线发布日期: 2024-06-05
出版日期:

首页

期刊介绍

编委会

投稿指南

期刊订阅

下载中心

公告

联系我们

引用本文

分享

文章指标

历史