by Carlos E Jimenez Papers
3 papers found
Conference
IMPersona: Evaluating Individual Level LLM Impersonation
Quan Shi, Carlos E Jimenez, Stephen Dong et al.
COLM 2025paper
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
John Yang, Carlos E Jimenez, Alex Zhang et al.
ICLR 2025arXiv:2410.03859
108
citations
SWE-bench: Can Language Models Resolve Real-world Github Issues?
Carlos E Jimenez, John Yang, Alexander Wettig et al.
ICLR 2024arXiv:2310.06770
1485
citations