α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Yuki Mitsufuji
Yuki Mitsufuji
1
Affiliations
Affiliations
Sony Group Corporation
16
papers
690
total citations
papers (16)
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
ICLR 2024
arXiv
333
citations
Manifold Preserving Guided Diffusion
ICLR 2024
arXiv
129
citations
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
NEURIPS 2023
arXiv
90
citations
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
CVPR 2025
arXiv
79
citations
MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation
ICLR 2025
arXiv
18
citations
SoundCTM: Unifying Score-based and Consistency Models for Full-band Text-to-Sound Generation
ICLR 2025
arXiv
10
citations
Enhancing 3D Reconstruction for Dynamic Scenes
NEURIPS 2025
arXiv
7
citations
Classifier-Free Guidance Inside the Attraction Basin May Cause Memorization
CVPR 2025
arXiv
7
citations
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
ICLR 2025
arXiv
3
citations
Weighted Point Set Embedding for Multimodal Contrastive Learning Toward Optimal Similarity Metric
ICLR 2025
arXiv
3
citations
Transformed Low-rank Adaptation via Tensor Decomposition and Its Applications to Text-to-image Models
ICCV 2025
arXiv
3
citations
TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation
NEURIPS 2025
arXiv
3
citations
Mining your own secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
ICLR 2025
arXiv
2
citations
VinaBench: Benchmark for Faithful and Consistent Visual Narratives
CVPR 2025
arXiv
2
citations
TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models
ICCV 2025
arXiv
1
citations
Densely Connected Multi-Dilated Convolutional Networks for Dense Prediction Tasks
CVPR 2021
0
citations