基于融合特征嵌入与自适应特征重组的无训练图像合成方法
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TN911.73;TP183

基金项目:

国家自然科学基金项目(62166025).


Training-free image composition method based on feature fusion embedding and adaptive feature reorganization
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    现有基于扩散模型的无训练图像合成方法通常在背景图像指定区域嵌入前景图像特征信息, 引导图像合成过程. 然而, 这种嵌入方式会干扰扩散模型去噪过程, 导致前景与背景不一致、语义对齐不佳等问题. 为此, 提出一种新颖的无训练图像合成方法, 包括互补融合特征嵌入和自适应特征重组两个模块. 首先, 互补融合特征嵌入引入由U-Net自注意力机制提取的组合图像特征, 该特征由前景与背景图像特征构成, 能够在保留前景信息的同时, 补偿传统嵌入方式所丢失的背景语义信息; 随后, 嵌入组合图像与前景图像的融合特征以引导合成过程, 并调控嵌入特征的数量以降低合成偏差; 同时, 为解决特征嵌入带来的图像过渡区域伪影问题, 引入自适应特征重组策略, 该策略通过分析相邻特征协方差关系, 识别并替换导致不连贯伪影的异常特征, 从而提升图像的连贯性. 实验表明, 所提出方法提升了语义对齐、背景与前景一致性, 实现了更协调的合成效果, 为无训练合成任务提供了解决方案.

    Abstract:

    Existing training-free image composition methods based on diffusion models typically embed the feature information of foreground images into specified regions of the background image to guide the composition process. However, this embedding method often disrupts the denoising process of diffusion models, leading to inconsistency between foreground and background, as well as poor semantic alignment. To address these issues, we propose a novel training-free image composition method that incorporates two modules: complementary feature fusion embedding (CFFE) and adaptive feature reorganization (AFR). First, CFFE introduces composite image features extracted by the U-Net self-attention mechanism. These features are constructed from both foreground and background image information, enabling the preservation of foreground guidance while compensating for the loss of background semantic information in traditional embedding methods. Subsequently, the composite image and foreground fusion features are embedded to guide the composition process, while the number of embedded features is regulated to reduce potential composition biases. Furthermore, to address artifacts in the image transition regions caused by feature embedding, we introduce the AFR strategy. This strategy analyzes the covariance relationships of adjacent features to identify and replace abnormal features that cause discontinuous artifacts for enhancing image coherence. Experimental results demonstrate that the proposed method effectively improves foreground-background consistency and semantic alignment in synthesized images while achieving more harmonious image composition. This work also provides an effective solution for training-free image composition tasks.

    参考文献
    相似文献
    引证文献
引用本文

赵宏,郑狄威.基于融合特征嵌入与自适应特征重组的无训练图像合成方法[J].控制与决策,2026,41(3):788-800

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-05-14
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-03-04
  • 出版日期: 2026-03-10
文章二维码