α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Po-Yao Huang
Po-Yao Huang
10
papers
835
total citations
papers (10)
Masked Autoencoders that Listen
NEURIPS 2022
arXiv
395
citations
Perception Encoder: The best visual embeddings are not at the output of the network
NEURIPS 2025
arXiv
129
citations
MAViL: Masked Audio-Video Learners
NEURIPS 2023
arXiv
75
citations
Diffusion Models as Masked Autoencoders
ICCV 2023
arXiv
75
citations
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
NEURIPS 2025
arXiv
47
citations
Space-Time Crop & Attend: Improving Cross-Modal Video Representation Learning
ICCV 2021
arXiv
36
citations
CiT: Curation in Training for Effective Vision-Language Data
ICCV 2023
arXiv
31
citations
MoDE: CLIP Data Experts via Clustering
CVPR 2024
arXiv
25
citations
STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition
CVPR 2023
arXiv
15
citations
Self-Supervised Audio-Visual Soundscape Stylization
ECCV 2024
arXiv
7
citations