α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Dan Guo
Dan Guo
22
papers
410
total citations
papers (22)
Iterative Context-Aware Graph Inference for Visual Dialog
CVPR 2020
arXiv
52
citations
Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition
AAAI 2025
arXiv
44
citations
EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer
AAAI 2024
arXiv
42
citations
Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture
CVPR 2024
arXiv
35
citations
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
CVPR 2025
arXiv
30
citations
Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
AAAI 2024
arXiv
27
citations
Text-Based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning
AAAI 2024
26
citations
Towards Open-Vocabulary Audio-Visual Event Localization
CVPR 2025
arXiv
25
citations
MMAD: Multi-label Micro-Action Detection in Videos
ICCV 2025
arXiv
21
citations
Dense Audio-Visual Event Localization Under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration
AAAI 2025
arXiv
20
citations
Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observations
CVPR 2025
arXiv
17
citations
Patch-level Sounding Object Tracking for Audio-Visual Question Answering
AAAI 2025
arXiv
16
citations
Multimodal Class-aware Semantic Enhancement Network for Audio-Visual Video Parsing
AAAI 2025
arXiv
15
citations
Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production
AAAI 2025
arXiv
14
citations
ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
CVPR 2025
arXiv
11
citations
MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights
AAAI 2025
arXiv
6
citations
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
AAAI 2025
arXiv
6
citations
Moderating the Generalization of Score-based Generative Model
ICCV 2025
arXiv
3
citations
Audio—Visual Segmentation
ECCV 2022
0
citations
PhysDiff: Physiology-based Dynamicity Disentangled Diffusion Model for Remote Physiological Measurement
AAAI 2025
0
citations
KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking
AAAI 2024
0
citations
Data-Free Quantization via Pseudo-label Filtering
CVPR 2024
0
citations