Cargando…

Robustness of [Formula: see text] ‐type coefficients for clinical agreement

The degree of inter‐rater agreement is usually assessed through [Formula: see text] ‐type coefficients and the extent of agreement is then characterized by comparing the value of the adopted coefficient against a benchmark scale. Through two motivating examples, it is displayed the different behavio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Vanacore, Amalia, Pellegrino, Maria Sole
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2022
Materias:	Research Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303799/ https://www.ncbi.nlm.nih.gov/pubmed/35124830 http://dx.doi.org/10.1002/sim.9341

_version_	1784751956406829056
author	Vanacore, Amalia Pellegrino, Maria Sole
author_facet	Vanacore, Amalia Pellegrino, Maria Sole
author_sort	Vanacore, Amalia
collection	PubMed
description	The degree of inter‐rater agreement is usually assessed through [Formula: see text] ‐type coefficients and the extent of agreement is then characterized by comparing the value of the adopted coefficient against a benchmark scale. Through two motivating examples, it is displayed the different behavior of some [Formula: see text] ‐type coefficients due to asymmetric distribution of marginal frequencies over categories. In order to investigate the robustness of four [Formula: see text] ‐type coefficients for nominal and ordinal classifications and of an inferential benchmarking procedure that, differently from straightforward benchmarking, does not neglect the influence of the experimental conditions, an extensive Monte Carlo simulation study has been conducted. The robustness has been investigated for several scenarios, differing for sample size, rating scale dimension, number of raters, frequency distribution of rater classifications, pattern of agreement across raters. Simulation results reveal an higher paradoxical behavior of Fleiss kappa and Conger kappa with ordinal rather than nominal classifications; the coefficients robustness improves with increasing sample size and number of raters for both nominal and ordinal classifications whereas robustness improves with rating scale dimension only for nominal classifications. By identifying the scenarios (ie, minimum sample size, number of raters, rating scale dimension) with acceptable robustness, this study provides guidelines about the design of robust agreement studies.
format	Online Article Text
id	pubmed-9303799
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-93037992022-07-28 Robustness of [Formula: see text] ‐type coefficients for clinical agreement Vanacore, Amalia Pellegrino, Maria Sole Stat Med Research Articles The degree of inter‐rater agreement is usually assessed through [Formula: see text] ‐type coefficients and the extent of agreement is then characterized by comparing the value of the adopted coefficient against a benchmark scale. Through two motivating examples, it is displayed the different behavior of some [Formula: see text] ‐type coefficients due to asymmetric distribution of marginal frequencies over categories. In order to investigate the robustness of four [Formula: see text] ‐type coefficients for nominal and ordinal classifications and of an inferential benchmarking procedure that, differently from straightforward benchmarking, does not neglect the influence of the experimental conditions, an extensive Monte Carlo simulation study has been conducted. The robustness has been investigated for several scenarios, differing for sample size, rating scale dimension, number of raters, frequency distribution of rater classifications, pattern of agreement across raters. Simulation results reveal an higher paradoxical behavior of Fleiss kappa and Conger kappa with ordinal rather than nominal classifications; the coefficients robustness improves with increasing sample size and number of raters for both nominal and ordinal classifications whereas robustness improves with rating scale dimension only for nominal classifications. By identifying the scenarios (ie, minimum sample size, number of raters, rating scale dimension) with acceptable robustness, this study provides guidelines about the design of robust agreement studies. John Wiley and Sons Inc. 2022-02-06 2022-05-20 /pmc/articles/PMC9303799/ /pubmed/35124830 http://dx.doi.org/10.1002/sim.9341 Text en © 2022 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle	Research Articles Vanacore, Amalia Pellegrino, Maria Sole Robustness of [Formula: see text] ‐type coefficients for clinical agreement
title	Robustness of [Formula: see text] ‐type coefficients for clinical agreement
title_full	Robustness of [Formula: see text] ‐type coefficients for clinical agreement
title_fullStr	Robustness of [Formula: see text] ‐type coefficients for clinical agreement
title_full_unstemmed	Robustness of [Formula: see text] ‐type coefficients for clinical agreement
title_short	Robustness of [Formula: see text] ‐type coefficients for clinical agreement
title_sort	robustness of [formula: see text] ‐type coefficients for clinical agreement
topic	Research Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303799/ https://www.ncbi.nlm.nih.gov/pubmed/35124830 http://dx.doi.org/10.1002/sim.9341
work_keys_str_mv	AT vanacoreamalia robustnessofformulaseetexttypecoefficientsforclinicalagreement AT pellegrinomariasole robustnessofformulaseetexttypecoefficientsforclinicalagreement

Robustness of [Formula: see text] ‐type coefficients for clinical agreement

Ejemplares similares