域不变多模态多源数据行人重识别协同优化架构

doi:10.13195/j.kzyjc.2024.0584

首页 > 过刊浏览>2025年第40卷第4期 >1276-1284. DOI:10.13195/j.kzyjc.2024.0584

域不变多模态多源数据行人重识别协同优化架构
DOI:
                        10.13195/j.kzyjc.2024.0584
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:TP391
基金项目:国家自然科学基金项目(62173160).

Domain-invariant collaborative optimization architecture for multi-modal multi-source person re-identification

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

为应对24小时全时段视频监控需求, 行人重识别任务必须要同时应对单模态与跨模态重识别的挑战, 但目前的行人重识别任务通常将单模态与跨模态作为两个分支独立研究, 忽视了这两个任务互补的应用价值. 为应对这一挑战, 提出一种可协同训练单模态和跨模态行人数据集的可迁移网络架构, 通过三路分支网络深入挖掘每个模态的行人信息, 并深入探讨协同优化架构中的域间隙和模态样本量不平衡问题, 有效实现多任务的优化问题. 针对域间隙的问题提出一种低层级特征拉近策略, 显著减少了跨域样本特征间的差异, 使得模型能够学习并提取域不变的语义特征. 同时, 为了解决模态样本量不平衡的问题, 设计一种弱模态特征挖掘策略, 通过灵活调整训练权重, 使模型更加关注弱模态的优化. 实验结果表明, 所提出的框架可以迁移到使用ResNet作为主干网络的众多主流方法上, 其中在经典方法AGW的基础上Rank1和mAP分别提高了23.79 %和17.78 %.

Abstract:

To meet the demands of 24-hour video surveillance, the person re-identification task must tackle the challenges of single-mode and cross-mode re-identification at the same time. However, current person re-identification tasks often treat single-mode and cross-mode as separate branches, neglecting their complementary application values. To address this challenge, this paper introduces a transferable network architecture that can collaboratively train datasets for both single-mode and cross-mode person re-identification. The network deeply excavates person information from each modality through a three-way branch, and thoroughly examines the issues of domain gap and imbalanced modal samples within the collaborative optimization architecture, effectively addressing multi-task optimization issues. To narrow the domain gap, the paper introduces a low-level feature alignment strategy that significantly reduces differences in cross-domain sample features, enabling the model to learn domain-invariant semantic features, thus enhancing generalization. Moreover, the paper devises a weak modality feature mining strategy to handle modality sample imbalances, dynamically adjusting training weights to prioritize optimizing the weaker modality. The experimental results show that the proposed framework can be transferred to a large number of mainstream methods that employ ResNet as the backbone network. Compared to the method AGW, Rank1 and mAP are increased by 23.79 % and 17.78 % , respectively.

参考文献

相似文献

引证文献

引用本文

邹子琳,陈莹.域不变多模态多源数据行人重识别协同优化架构[J].控制与决策,2025,40(4):1276-1284

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-05-16
最后修改日期:
录用日期:
在线发布日期: 2025-03-21
出版日期: 2025-04-20

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码