Back to Search Start Over

Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer

Authors :
Ge, Yanqi
Liu, Jiaqi
Fan, Qingnan
Jiang, Xi
Huang, Ye
Qin, Shuai
Gu, Hong
Li, Wen
Duan, Lixin
Publication Year :
2024

Abstract

In this work, we target the task of text-driven style transfer in the context of text-to-image (T2I) diffusion models. The main challenge is consistent structure preservation while enabling effective style transfer effects. The past approaches in this field directly concatenate the content and style prompts for a prompt-level style injection, leading to unavoidable structure distortions. In this work, we propose a novel solution to the text-driven style transfer task, namely, Adaptive Style Incorporation~(ASI), to achieve fine-grained feature-level style incorporation. It consists of the Siamese Cross-Attention~(SiCA) to decouple the single-track cross-attention to a dual-track structure to obtain separate content and style features, and the Adaptive Content-Style Blending (AdaBlending) module to couple the content and style information from a structure-consistent manner. Experimentally, our method exhibits much better performance in both structure preservation and stylized effects.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2404.06835
Document Type :
Working Paper