Poster "egocentric video understanding" Papers
13 papers found
Conference
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark
Ronghao Dang, Yuqian Yuan, Wenqi Zhang et al.
CVPR 2025arXiv:2501.05031
16
citations
EgoLM: Multi-Modal Language Model of Egocentric Motions
Fangzhou Hong, Vladimir Guzov, Hyo Jin Kim et al.
CVPR 2025arXiv:2409.18127
12
citations
Fine-grained Spatiotemporal Grounding on Egocentric Videos
Shuo LIANG, Yiwu Zhong, Zi-Yuan Hu et al.
ICCV 2025arXiv:2508.00518
5
citations
HiERO: Understanding the Hierarchy of Human Behavior Enhances Reasoning on Egocentric Videos
Simone Alberto Peirone, Francesca Pistilli, Giuseppe Averta
ICCV 2025arXiv:2505.12911
1
citations
MMEgo: Towards Building Egocentric Multimodal LLMs for Video QA
Hanrong Ye, Haotian Zhang, Erik Daxberger et al.
ICLR 2025
12
citations
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
Baoqi Pei, Yifei Huang, Jilan Xu et al.
ICLR 2025arXiv:2503.00986
12
citations
PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement
Tewodros W. Ayalew, Xiao Zhang, Kevin Y Wu et al.
ICCV 2025arXiv:2411.17764
2
citations
Robust Egocentric Referring Video Object Segmentation via Dual-Modal Causal Intervention
Haijing Liu, Zhiyuan Song, Hefeng Wu et al.
NEURIPS 2025arXiv:2512.24323
A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives
Simone Alberto Peirone, Francesca Pistilli, Antonio Alliegro et al.
CVPR 2024arXiv:2403.03037
11
citations
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
Kang Hyolim, Jeongseok Hyun, Joungbin An et al.
ECCV 2024arXiv:2407.12987
1
citations
AMEGO: Active Memory from long EGOcentric videos
Gabriele Goletto, Tushar Nagarajan, Giuseppe Averta et al.
ECCV 2024arXiv:2409.10917
21
citations
Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding
Minh Tran, Yelin Kim, Che-Chun Su et al.
ECCV 2024
Grounded Question-Answering in Long Egocentric Videos
Shangzhe Di, Weidi Xie
CVPR 2024arXiv:2312.06505
48
citations