by trevor darrell Papers
11 papers found
Conference
A Coefficient Makes SVRG Effective
Yida Yin, Zhiqiu Xu, Zhiyuan Li et al.
ICLR 2025arXiv:2311.05589
5
citations
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
Junyi Zhang, Charles Herrmann, Junhwa Hur et al.
ICLR 2025arXiv:2410.03825
276
citations
Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
Leander Girrbach, Stephan Alaniz, Yiran Huang et al.
ICLR 2025arXiv:2410.19314
9
citations
SegLLM: Multi-round Reasoning Segmentation with Large Language Models
Xudong Wang, Shaolun Zhang, Shufan Li et al.
ICLR 2025
9
citations
VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models
Lisa Dunlap, Krishna Mandal, trevor darrell et al.
ICLR 2025arXiv:2410.12851
13
citations
Video Action Differencing
James Burgess, Xiaohan Wang, Yuhui Zhang et al.
ICLR 2025arXiv:2503.07860
8
citations
Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark
Tsung-Han Wu, Giscard Biamby, Jerome Quenum et al.
ICLR 2025arXiv:2407.13766
30
citations
Initializing Models with Larger Ones
Zhiqiu Xu, Yanjie Chen, Kirill Vishniakov et al.
ICLR 2024spotlightarXiv:2311.18823
31
citations
LLM-grounded Video Diffusion Models
Long Lian, Baifeng Shi, Adam Yala et al.
ICLR 2024oralarXiv:2309.17444
77
citations
Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large Language Models
Sheng Shen, Le Hou, Yanqi Zhou et al.
ICLR 2024
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game
Sam Toyer, Olivia Watkins, Ethan Mendes et al.
ICLR 2024spotlightarXiv:2311.01011
106
citations