DiPietro-Hazari Kappa: A Novel Metric for Assessing Labeling Quality via Annotation
From MaRDI portal
Publication:6411058
arXiv2209.08243MaRDI QIDQ6411058
Author name not available (Why is that?)
Publication date: 17 September 2022
Abstract: Data is a key component of modern machine learning, but statistics for assessing data label quality remain sparse in literature. Here, we introduce DiPietro-Hazari Kappa, a novel statistical metric for assessing the quality of suggested dataset labels in the context of human annotation. Rooted in the classical Fleiss's Kappa measure of inter-annotator agreement, the DiPietro-Hazari Kappa quantifies the the empirical annotator agreement differential that was attained above random chance. We offer a thorough theoretical examination of Fleiss's Kappa before turning to our derivation of DiPietro-Hazari Kappa. Finally, we conclude with a matrix formulation and set of procedural instructions for easy computational implementation.
Has companion code repository: https://github.com/dandip/dh_kappa
This page was built for publication: DiPietro-Hazari Kappa: A Novel Metric for Assessing Labeling Quality via Annotation
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6411058)