α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Shentong Mo
Shentong Mo
17
papers
536
total citations
papers (17)
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation
NEURIPS 2023
arXiv
116
citations
Localizing Visual Sounds the Easy Way
ECCV 2022
arXiv
99
citations
A Closer Look at Weakly-Supervised Audio-Visual Source Localization
NEURIPS 2022
arXiv
80
citations
Audio-Visual Grouping Network for Sound Localization From Mixtures
CVPR 2023
arXiv
64
citations
DiffComplete: Diffusion-based Generative 3D Shape Completion
NEURIPS 2023
arXiv
41
citations
Audio-Visual Class-Incremental Learning
ICCV 2023
arXiv
35
citations
Class-Incremental Grouping Network for Continual Audio-Visual Learning
ICCV 2023
arXiv
31
citations
Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling
CVPR 2024
arXiv
31
citations
Weakly-Supervised Audio-Visual Segmentation
NEURIPS 2023
arXiv
20
citations
Audio-visual Generalized Zero-shot Learning the Easy Way
ECCV 2024
arXiv
8
citations
Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
ECCV 2024
arXiv
7
citations
Scaling Diffusion Mamba with Bidirectional SSMs for Efficient 3D Shape Generation
AAAI 2025
3
citations
The Dynamic Duo of Collaborative Masking and Target for Advanced Masked Autoencoder Learning
AAAI 2025
arXiv
1
citations
"Unitail: Detecting, Reading, and Matching in Retail Scene"
ECCV 2022
0
citations
GMAIL: Generative Modality Alignment for generated Image Learning
ICML 2025
0
citations
Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows
CVPR 2025
0
citations
Multi-modal Grouping Network for Weakly-Supervised Audio-Visual Video Parsing
NEURIPS 2022
0
citations