by Ethan Chen Papers
2 papers found
Conference
MUVR: A Multi-Modal Untrimmed Video Retrieval Benchmark with Multi-Level Visual Correspondence
Yue Feng, Jinwei Hu, Qijia Lu et al.
NEURIPS 2025arXiv:2510.21406
1
citations
Sylber: Syllabic Embedding Representation of Speech from Raw Audio
Cheol Jun Cho, Nicholas Lee, Akshat Gupta et al.
ICLR 2025arXiv:2410.07168
15
citations