"fairness evaluation" Papers
5 papers found
Conference
DPA: A one-stop metric to measure bias amplification in classification datasets
Bhanu Tokas, Rahul Nair, Hannah Kerner
NEURIPS 2025arXiv:2412.11060
1
citations
FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs
Zhiting Fan, Ruizhe Chen, Tianxiang Hu et al.
ICLR 2025arXiv:2410.19317
37
citations
Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties
Jiyoung Lee, Seungho Kim, Jieun Han et al.
NEURIPS 2025arXiv:2505.20875
3
citations
Large Language Models are Geographically Biased
Rohin Manvi, Samar Khanna, Marshall Burke et al.
ICML 2024oralarXiv:2402.02680
93
citations
Position: TrustLLM: Trustworthiness in Large Language Models
Yue Huang, Lichao Sun, Haoran Wang et al.
ICML 2024