α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Tao Jin
Tao Jin
23
papers
242
total citations
papers (23)
Gloss Attention for Gloss-Free Sign Language Translation
CVPR 2023
arXiv
65
citations
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition
ICCV 2023
arXiv
29
citations
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion
ICML 2024
arXiv
19
citations
Non-confusing Generation of Customized Concepts in Diffusion Models
ICML 2024
arXiv
18
citations
DART: Implicit Doppler Tomography for Radar Novel View Synthesis
CVPR 2024
arXiv
16
citations
Borda Regret Minimization for Generalized Linear Dueling Bandits
ICML 2024
arXiv
15
citations
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
ICLR 2025
arXiv
13
citations
Bridging the Gap for Test-Time Multimodal Sentiment Analysis
AAAI 2025
arXiv
11
citations
SpatialCLIP: Learning 3D-aware Image Representations from Spatially Discriminative Language
CVPR 2025
11
citations
Towards Transformer-Based Aligned Generation with Self-Coherence Guidance
CVPR 2025
arXiv
10
citations
Smoothing the Shift: Towards Stable Test-Time Adaptation under Complex Multimodal Noises
ICLR 2025
arXiv
10
citations
Diff-Prompt: Diffusion-driven Prompt Generator with Mask Supervision
ICLR 2025
arXiv
7
citations
DATE: Domain Adaptive Product Seeker for E-Commerce
CVPR 2023
arXiv
6
citations
ConceptGuard: Continual Personalized Text-to-Image Generation with Forgetting and Confusion Mitigation
CVPR 2025
arXiv
5
citations
Open-set Cross Modal Generalization via Multimodal Unified Representation
ICCV 2025
arXiv
3
citations
A Wander Through the Multimodal Landscape: Efficient Transfer Learning via Low-rank Sequence Multimodal Adapter
AAAI 2025
arXiv
3
citations
OpenFLAME: Federated Visual Positioning System to Enable Large-Scale Augmented Reality Applications
ISMAR 2025
arXiv
1
citations
Active Ranking without Strong Stochastic Transitivity
NEURIPS 2022
0
citations
MPOD123: One Image to 3D Content Generation Using Mask-enhanced Progressive Outline-to-Detail Optimization
CVPR 2024
0
citations
Speech Watermarking with Discrete Intermediate Representations
AAAI 2025
arXiv
0
citations
Generalizable Multi-linear Attention Network
NEURIPS 2021
0
citations
Exploring Group Video Captioning with Efficient Relational Approximation
ICCV 2023
0
citations
Non-Natural Image Understanding with Advancing Frequency-based Vision Encoders
CVPR 2025
0
citations