基于孪生网络的特征融合位移RGB-T目标跟踪

首页 > 过刊浏览>2025年第52卷第4期 >68-78

基于孪生网络的特征融合位移RGB-T目标跟踪
DOI:
                        
                    
作者:
                        李海燕 1，曹永辉 1，郎恂 1†，李海江 2李海燕 1，曹永辉 1，郎恂 1†，李海江 2
（1.云南大学 信息学院，云南 昆明，650000； 2.云南交通投资建设集团有限公司，云南 昆明 650000）
在知网中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
基金项目:

FSSiamNet：Feature Fusion Shift Siamese Network for RGB-T Target Tracking

Author:

LI Haiyan1，CAO Yonghui1，LANG Xun1†，LI Haijiang2
LI Haiyan1，CAO Yonghui1，LANG Xun1†，LI Haijiang2
（1.School of Information Science and Engineering， Yunnan University， Kunming 650000， China； 2.Yunnan Communications Investment and Construction Group Co.， Ltd.， Kunming 650000， China）
在知网中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

摘要:

为解决现有目标跟踪算法深层次特征提取困难、不能充分利用跨模态信息以及目标特征表示较弱等问题，提出了基于孪生网络的特征融合位移RGB-T目标跟踪算法.首先，基于可见光模态SiameseRPN++的目标跟踪框架，扩展设计红外模态分支，以获得多模态目标跟踪框架，设计了改进步长的ResNet50作为特征提取网络，有效挖掘目标的深层次特征.随后，设计特征交互学习模块，利用一种模态的判别信息引导另一种模态的目标外观特征学习，挖掘特征空间和通道中的跨模态信息，增强网络对前景信息的关注.然后，设计多模特征融合模块计算输入的可见光图像和红外图像的特征融合度，对不同模态的重要特征进行空间融合以去除冗余信息，并采用级联融合策略重建多模态图像，增强目标特征表示.最后，设计特征空间位移模块，分割红外模态分支的特征图并向四个不同方向移位，增强热源目标特征的边缘表示.在两个RGB-T数据集上的实验验证了提出算法的有效性，消融实验证明了设计的单个模块的优越性.

关键词:RGB-T跟踪;多模特征融合模块;特征空间位移模块;特征交互学习模块

Abstract:

To solve the problems of the existing target tracking algorithms， such as inability to extract deep-level features， failure to fully exploit cross-modal information， and weak representation of target features， a feature fusion shift Siamese network for RGB-T target tracking is proposed. First， a target tracking framework based on the visible modal SiameseRPN++ is designed to extend the infrared modal branch， in order to obtain a multimodal target tracking framework. Moreover， the improved ResNet50 network with adjusted stride as a feature extraction network enables the acquisition of deep-level features of the target. Subsequently， a multimodal feature interactive learning module （FIM） is designed to leverage the discriminative information from one modality to guide the learning process of target appearance features in the other modality. By mining the cross-modal information within the feature space and channels， the module enhances the network’s attention towards foreground information. Thereafter， a multimode feature fusion module （FAM） is designed， which calculates the degree of feature fusion between the input visible light image and the infrared image， enabling spatial fusion of significant features from different modalities to effectively eliminate redundant information and reconstructing multimodal images by employing a cascade fusion strategy. Finally， a feature space shift module （FSM） is designed， which divides the feature maps of the infrared modal branches and shifts them in four different directions to enhance the edge representation of the heat source target. Extensive experiments on two RGB-T datasets thoroughly validate the effectiveness of the proposed algorithm， while ablation experiments demonstrate the superiority of each designed module.

Key words:RGB-T tracking; multi mode feature fusion module; feature space shift module;feature interactive learning module

文章指标

PDF下载次数:
HTML阅读次数:
摘要点击次数:
引用次数:

引用本文

李海燕 ,曹永辉 ,郎恂 ?,李海江 .基于孪生网络的特征融合位移RGB-T目标跟踪[J].湖南大学学报：自然科学版,2025,52(4):68~78

复制

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2025-04-28
出版日期:

首页

期刊简介

编委会

作者中心

下载中心

学术道德

常见问题

版权声明

联系我们

English

文章指标

引用本文

历史