α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Qi Zhao
Qi Zhao
1
Affiliations
Affiliations
Karlsruhe Institute of Technology (KIT)
27
papers
389
total citations
papers (27)
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
ICLR 2024
arXiv
84
citations
SwitchTab: Switched Autoencoders Are Effective Tabular Learners
AAAI 2024
arXiv
56
citations
DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos
CVPR 2023
arXiv
48
citations
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
ICCV 2025
arXiv
44
citations
AiR: Attention with Reasoning Capability
ECCV 2020
arXiv
44
citations
REX: Reasoning-Aware and Grounded Explanation
CVPR 2022
arXiv
24
citations
Beyond Average: Individualized Visual Scanpath Prediction
CVPR 2024
arXiv
18
citations
Learning to Predict Trustworthiness with Steep Slope Loss
NEURIPS 2021
arXiv
13
citations
Synthetic Video Enhances Physical Fidelity in Video Synthesis
ICCV 2025
arXiv
11
citations
PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos
CVPR 2024
arXiv
10
citations
What Do Deep Saliency Models Learn about Visual Attention?
NEURIPS 2023
arXiv
9
citations
GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths
ECCV 2024
arXiv
8
citations
Divide and Conquer: Answering Questions With Object Factorization and Compositional Reasoning
CVPR 2023
arXiv
8
citations
n-Reference Transfer Learning for Saliency Prediction
ECCV 2020
arXiv
6
citations
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
ICML 2025
arXiv
6
citations
Predicting Human Scanpaths in Visual Question Answering
CVPR 2021
0
citations
Model Lineage Closeness Analysis
AAAI 2025
0
citations
Fantastic Answers and Where to Find Them: Immersive Question-Directed Visual Attention
CVPR 2020
0
citations
Explicit Knowledge Incorporation for Visual Reasoning
CVPR 2021
0
citations
VisualHow: Multimodal Problem Solving
CVPR 2022
0
citations
New Datasets and Models for Contextual Reasoning in Visual Dialog
ECCV 2022
0
citations
Two Sides of the Same Coin: Learning the Backdoor to Remove the Backdoor
AAAI 2025
0
citations
NN-Baker: A Neural-network Infused Algorithmic Framework for Optimization Problems on Geometric Intersection Graphs
NEURIPS 2021
0
citations
Query and Attention Augmentation for Knowledge-Based Explainable Reasoning
CVPR 2022
0
citations
ROME is Forged in Adversity: Robust Distilled Datasets via Information Bottleneck
ICML 2025
0
citations
Toward Multi-Granularity Decision-Making: Explicit Visual Reasoning with Hierarchical Knowledge
ICCV 2023
0
citations
Explainable Saliency: Articulating Reasoning with Contextual Prioritization
CVPR 2025
0
citations