"diffusion transformer architecture" Papers
7 papers found
Conference
Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
Boyang Wang, Xuweiyi Chen, Matheus Gadelha et al.
NEURIPS 2025oralarXiv:2505.21491
5
citations
Image Editing As Programs with Diffusion Models
Yujia Hu, Songhua Liu, Zhenxiong Tan et al.
NEURIPS 2025arXiv:2506.04158
2
citations
Mask^2DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation
Tianhao Qi, Jianlong Yuan, Wanquan Feng et al.
CVPR 2025
8
citations
RayPose: Ray Bundling Diffusion for Template Views in Unseen 6D Object Pose Estimation
Junwen Huang, Shishir Reddy Vutukur, Peter Yu et al.
ICCV 2025arXiv:2510.18521
ReSim: Reliable World Simulation for Autonomous Driving
Jiazhi Yang, Kashyap Chitta, Shenyuan Gao et al.
NEURIPS 2025spotlightarXiv:2506.09981
18
citations
ShoeFit: A New Dataset and Dual-image-stream DiT Framework for Virtual Footwear Try-On
Yuhan Li, Zhiyu Jin, Yifan Tong et al.
NEURIPS 2025
TextToucher: Fine-Grained Text-to-Touch Generation
Jiahang Tu, Hao Fu, Fengyu Yang et al.
AAAI 2025paperarXiv:2409.05427
14
citations