We want to study inter-rater agreement comparing groups of observers who express their ratings on a discrete or ordinal scale. The starting point is to define what we mean by "agreement". Given d observers, let the scores they assign to a given statistical unit be expressed as a d-vector in the real space. We define a deterministic ordering among such vectors which expresses the degree of the raters' agreement. The overall scoring of the raters on the sample space will be a d-dimensional random vector. We then define an associated partial ordering among the random vectors of the ratings, show some of its properties, and look at order-preserving functions (agreement measures). In this paper we also show how to test the hypothesis of greater agreement against the unrestricted hypothesis, and the hypothesis of equal agreement against the hypothesis that an agreement ordering holds. The test is applied to real data on two medical observers rating clinical guidelines.
A. Giovagnoli, J. Marzialetti, H.P. Wynn (2008). A new approach to inter-rater agreement through stochastic orderings: the discrete case. METRIKA, 67(3), 349-370 [10.1007/s00184-007-0137-4].
A new approach to inter-rater agreement through stochastic orderings: the discrete case
GIOVAGNOLI, ALESSANDRA;MARZIALETTI, JOHNNY;
2008
Abstract
We want to study inter-rater agreement comparing groups of observers who express their ratings on a discrete or ordinal scale. The starting point is to define what we mean by "agreement". Given d observers, let the scores they assign to a given statistical unit be expressed as a d-vector in the real space. We define a deterministic ordering among such vectors which expresses the degree of the raters' agreement. The overall scoring of the raters on the sample space will be a d-dimensional random vector. We then define an associated partial ordering among the random vectors of the ratings, show some of its properties, and look at order-preserving functions (agreement measures). In this paper we also show how to test the hypothesis of greater agreement against the unrestricted hypothesis, and the hypothesis of equal agreement against the hypothesis that an agreement ordering holds. The test is applied to real data on two medical observers rating clinical guidelines.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.