ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Xize Cheng

Xize Cheng

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 15, 2026, 4:04 AM AMS

7

papers

217

total citations

papers (7)

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding

MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

SpatialCLIP: Learning 3D-aware Image Representations from Spatially Discriminative Language

A Wander Through the Multimodal Landscape: Efficient Transfer Learning via Low-rank Sequence Multimodal Adapter

Exploring Group Video Captioning with Efficient Relational Approximation