α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xize Cheng
Xize Cheng
7
papers
217
total citations
papers (7)
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
ICLR 2025
arXiv
132
citations
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
ICCV 2023
arXiv
29
citations
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
ICCV 2023
arXiv
29
citations
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
ICLR 2025
arXiv
13
citations
SpatialCLIP: Learning 3D-aware Image Representations from Spatially Discriminative Language
CVPR 2025
11
citations
A Wander Through the Multimodal Landscape: Efficient Transfer Learning via Low-rank Sequence Multimodal Adapter
AAAI 2025
arXiv
3
citations
Exploring Group Video Captioning with Efficient Relational Approximation
ICCV 2023
0
citations