ARTICLE: Annotator Reliability Through In-Context Learning

6citations

arXiv:2409.12218 PDF Project

citations

#733

in AAAI 2025

of 3028 papers

Top Authors

Data Points

Top Authors

Sujan Dutta Deepak Pandita Tharindu Cyril Weerasooriya Marcos Zampieri Christopher M. Homan Ashiqur R. KhudaBukhsh

Topics

annotator reliability in-context learning sentiment analysis offensive speech detection annotation quality data quality assessment self-consistency estimation llm evaluation

Abstract

Ensuring annotator quality in training and evaluation data is a key piece of machine learning in NLP. Tasks such as sentiment analysis and offensive speech detection are intrinsically subjective, creating a challenging scenario for traditional quality assessment approaches because it is hard to distinguish disagreement due to poor work from that due to differences of opinions between sincere annotators. With the goal of increasing diverse perspectives in annotation while ensuring consistency, we propose \texttt{ARTICLE}, an in-context learning (ICL) framework to estimate annotation quality through self-consistency. We evaluate this framework on two offensive speech datasets using multiple LLMs and compare its performance with traditional methods. Our findings indicate that \texttt{ARTICLE} can be used as a robust method for identifying reliable annotators, hence improving data quality.

Citation History

Jan 27, 2026

Feb 4, 2026

5+1

Feb 13, 2026

6+1

Feb 13, 2026