基于动态卷积和超图交互的多实例人体解析方法
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP391

基金项目:

国家自然科学基金项目(62001099);中央高校基本科研业务费专项自由探索项目(2232023D-30).


A multi-instance human parsing method based on dynamic convolution and hypergraph interaction
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    多实例人体解析旨在分割自然场景图像中的多个人体实例及其部件. 现有方法通常依赖静态卷积核并行地分割部件和实例, 导致部件与实例特征缺乏关联难以适应人体姿态和服装外观的多样性. 针对该问题, 提出一种基于动态卷积与超图交互的多实例人体解析方法. 首先, 将分割目标划分为部件、半身、实例3种层次, 并对应地配置可学习的动态卷积核; 同时, 设计多尺度掩码注意力机制来引导各层次动态卷积核聚合图像特征, 以适应人体姿态和服装外观的多样性. 然后, 提出超图交互模块, 将部件动态卷积核作为节点, 实例和半身动态卷积核作为超边, 以刻画人体结构先验. 最后, 通过超图上的消息传递来实现部件与实例间的特征交互. 实验结果表明, 所提出方法在MHP-v2.0、CIHP和Densepose数据集上可超越多种基线方法, 在$ {\rm AP}_{50}^p$、$ {\rm AP}_{\rm vol}^p $和$ {\rm PCP}_{50} $三个指标上分别平均地提升了14.6%、5.8%和10.7%. 进一步地, 消融和可视化实验结果验证了动态卷积核和超图交互模块的有效性.

    Abstract:

    Multi-instance human parsing aims to segment multiple human instances and their corresponding parts in natural scene images. Existing methods typically rely on static convolution kernels to segment parts and instances in parallel, resulting in a lack of correlation between part and instance features, and thus limiting adaptability to the diversity of human poses and clothing appearances. To address this issue, this paper proposes a multi-instance human parsing method based on dynamic convolution and hypergraph interaction. Segmentation targets are hierarchically divided into three levels: parts, half-body, and instances, with corresponding learnable dynamic convolution kernels configured for each target. Meanwhile, a multi-scale mask attention mechanism is designed to guide the dynamic convolution kernels in aggregating image features across different levels, thereby adapting to the diversity of human poses and clothing appearances. A hypergraph interaction module is proposed, where part dynamic convolution kernels serve as nodes, and instance and half-body dynamic convolution kernels are treated as hyperedges, to model structural priors of the human body. Feature interaction between parts and instances is achieved through message passing on the hypergraph. Experimental results demonstrate that the proposed method outperforms various baseline methods on the MHP-v2.0, CIHP, and Densepose datasets, achieving average improvements of 14.6%, 5.8%, and 10.7% in $ {\rm AP}_{50}^p $, $ {\rm AP}_{\rm vol}^p $, and $ {\rm PCP}_{50} $ metrics, respectively. Furthermore, ablation and visualization experiments validate the effectiveness of the dynamic convolution kernels and the hypergraph interaction module.

    参考文献
    相似文献
    引证文献
引用本文

黄荣,袁家奇,刘浩,等.基于动态卷积和超图交互的多实例人体解析方法[J].控制与决策,2026,41(1):276-288

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-03-25
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-12-30
  • 出版日期: 2026-01-10
文章二维码