Poster "measurement theory" Papers
2 papers found
Conference
Neither Valid nor Reliable? Investigating the Use of LLMs as Judges
Khaoula Chehbouni, Mohammed Haddou, Jackie CK Cheung et al.
NEURIPS 2025arXiv:2508.18076
11
citations
Position: Measure Dataset Diversity, Don't Just Claim It
Dora Zhao, Jerone Andrews, Orestis Papakyriakopoulos et al.
ICML 2024arXiv:2407.08188
32
citations