Poster "web agent tasks" Papers
2 papers found
Conference
Harnessing Webpage UIs for Text-Rich Visual Understanding
Junpeng Liu, Tianyue Ou, Yifan Song et al.
ICLR 2025arXiv:2410.13824
22
citations
VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks
Lawrence Jang, Yinheng Li, Dan Zhao et al.
ICLR 2025arXiv:2410.19100
26
citations