"cross-attention maps" Papers

10 papers found

Scale Your Instructions: Enhance the Instruction-Following Fidelity of Unified Image Generation Model by Self-Adaptive Attention Scaling

Chao Zhou, Tianyi Wei, Nenghai Yu

ICCV 2025arXiv:2507.16240
3
citations

Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects

Weimin Qiu, Jieke Wang, Meng Tang

CVPR 2025arXiv:2411.18936
8
citations

T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting

Yifei Qian, Zhongliang Guo, Bowen Deng et al.

CVPR 2025highlightarXiv:2502.20625
9
citations

Directed Diffusion: Direct Control of Object Placement through Attention Guidance

Wan-Duo Ma, Avisek Lahiri, J. P. Lewis et al.

AAAI 2024paperarXiv:2302.13153
83
citations

Easing Concept Bleeding in Diffusion via Entity Localization and Anchoring

Jiewei Zhang, Song Guo, Peiran Dong et al.

ICML 2024

Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation

guo, Tianwei Lin

CVPR 2024arXiv:2312.10113
64
citations

Prompt-guided Precise Audio Editing with Diffusion Models

Manjie Xu, Chenxing Li, Duzhen Zhang et al.

ICML 2024arXiv:2406.04350
14
citations

Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance

Dazhong Shen, Guanglu Song, Zeyue Xue et al.

CVPR 2024arXiv:2404.05384
38
citations

T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models

Zhongqi Wang, Jie Zhang, Shiguang Shan et al.

ECCV 2024arXiv:2407.04215
28
citations

Unsupervised Keypoints from Pretrained Diffusion Models

Eric Hedlin, Gopal Sharma, Shweta Mahajan et al.

CVPR 2024highlightarXiv:2312.00065
19
citations