"controllable generation" Papers

18 papers found

Causal Composition Diffusion Model for Closed-loop Traffic Generation

Haohong Lin, Xin Huang, Tung Phan-Minh et al.

CVPR 2025arXiv:2412.17920
13
citations

InCoDe: Interpretable Compressed Descriptions For Image Generation

Armand Comas, Aditya Chattopadhyay, Feliu Formosa et al.

ICLR 2025
1
citations

LaViDa: A Large Diffusion Model for Vision-Language Understanding

Shufan Li, Konstantinos Kallidromitis, Hritik Bansal et al.

NEURIPS 2025spotlight

MallowsPO: Fine-Tune Your LLM with Preference Dispersions

Haoxian Chen, Hanyang Zhao, Henry Lam et al.

ICLR 2025arXiv:2405.14953
15
citations

MaRS: A Fast Sampler for Mean Reverting Diffusion based on ODE and SDE Solvers

Ao Li, Wei Fang, Hongbo Zhao et al.

ICLR 2025arXiv:2502.07856
5
citations

Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs

ChangHao Li, Yuchen Zhuang, Rushi Qiang et al.

NEURIPS 2025arXiv:2410.20749
6
citations

MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition

Philippe Pasquier, Jeff Ens, Nathan Fradet et al.

AAAI 2025paperarXiv:2501.17011
11
citations

Simple Guidance Mechanisms for Discrete Diffusion Models

Yair Schiff, Subham Sahoo, Hao Phung et al.

ICLR 2025arXiv:2412.10193
76
citations

Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models

Zerui Tao, Yuhta Takida, Naoki Murata et al.

ICCV 2025arXiv:2501.08727
3
citations

X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenarios

Yichen Xie, Chenfeng Xu, Chensheng Peng et al.

ICLR 2025arXiv:2411.01123
8
citations

Audio Generation with Multiple Conditional Diffusion Model

Zhifang Guo, Jianguo Mao, Tao Rui et al.

AAAI 2024paperarXiv:2308.11940
32
citations

CAGE: Controllable Articulation GEneration

Jiayi Liu, Hou In Ivan Tam, Ali Mahdavi Amiri et al.

CVPR 2024arXiv:2312.09570
42
citations

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

Ming Li, Taojiannan Yang, Huafeng Kuang et al.

ECCV 2024arXiv:2404.07987
153
citations

DiNADO: Norm-Disentangled Neurally-Decomposed Oracles for Controlling Language Models

Sidi Lu, Wenbo Zhao, Chenyang Tao et al.

ICML 2024arXiv:2306.11825
1
citations

Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models

Shengqu Cai, Duygu Ceylan, Matheus Gadelha et al.

CVPR 2024arXiv:2312.01409
26
citations

Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation

Yixiao Wang, Chen Tang, Lingfeng Sun et al.

ECCV 2024arXiv:2408.00766
17
citations

T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion

Chong Mou, Xintao Wang, Liangbin Xie et al.

AAAI 2024paperarXiv:2302.08453
1460
citations

Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models

Luozhou Wang, Guibao Shen, Wenhang Ge et al.

ECCV 2024arXiv:2306.14408
5
citations