Cargando…

Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies

Many published scale validation studies determine inter‐rater reliability using the intra‐class correlation coefficient (ICC). However, the use of this statistic must consider its advantages, limitations, and applicability. This paper evaluates how interaction of subject distribution, sample size, a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mehta, Shraddha, Bastero‐Caballero, Rowena F., Sun, Yijun, Zhu, Ray, Murphy, Diane K., Hardas, Bhushan, Koch, Gary
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2018
Materias:	Research Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6174967/ https://www.ncbi.nlm.nih.gov/pubmed/29707825 http://dx.doi.org/10.1002/sim.7679

_version_	1783361398132178944
author	Mehta, Shraddha Bastero‐Caballero, Rowena F. Sun, Yijun Zhu, Ray Murphy, Diane K. Hardas, Bhushan Koch, Gary
author_facet	Mehta, Shraddha Bastero‐Caballero, Rowena F. Sun, Yijun Zhu, Ray Murphy, Diane K. Hardas, Bhushan Koch, Gary
author_sort	Mehta, Shraddha
collection	PubMed
description	Many published scale validation studies determine inter‐rater reliability using the intra‐class correlation coefficient (ICC). However, the use of this statistic must consider its advantages, limitations, and applicability. This paper evaluates how interaction of subject distribution, sample size, and levels of rater disagreement affects ICC and provides an approach for obtaining relevant ICC estimates under suboptimal conditions. Simulation results suggest that for a fixed number of subjects, ICC from the convex distribution is smaller than ICC for the uniform distribution, which in turn is smaller than ICC for the concave distribution. The variance component estimates also show that the dissimilarity of ICC among distributions is attributed to the study design (ie, distribution of subjects) component of subject variability and not the scale quality component of rater error variability. The dependency of ICC on the distribution of subjects makes it difficult to compare results across reliability studies. Hence, it is proposed that reliability studies should be designed using a uniform distribution of subjects because of the standardization it provides for representing objective disagreement. In the absence of uniform distribution, a sampling method is proposed to reduce the non‐uniformity. In addition, as expected, high levels of disagreement result in low ICC, and when the type of distribution is fixed, any increase in the number of subjects beyond a moderately large specification such as n = 80 does not have a major impact on ICC.
format	Online Article Text
id	pubmed-6174967
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-61749672018-10-15 Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies Mehta, Shraddha Bastero‐Caballero, Rowena F. Sun, Yijun Zhu, Ray Murphy, Diane K. Hardas, Bhushan Koch, Gary Stat Med Research Articles Many published scale validation studies determine inter‐rater reliability using the intra‐class correlation coefficient (ICC). However, the use of this statistic must consider its advantages, limitations, and applicability. This paper evaluates how interaction of subject distribution, sample size, and levels of rater disagreement affects ICC and provides an approach for obtaining relevant ICC estimates under suboptimal conditions. Simulation results suggest that for a fixed number of subjects, ICC from the convex distribution is smaller than ICC for the uniform distribution, which in turn is smaller than ICC for the concave distribution. The variance component estimates also show that the dissimilarity of ICC among distributions is attributed to the study design (ie, distribution of subjects) component of subject variability and not the scale quality component of rater error variability. The dependency of ICC on the distribution of subjects makes it difficult to compare results across reliability studies. Hence, it is proposed that reliability studies should be designed using a uniform distribution of subjects because of the standardization it provides for representing objective disagreement. In the absence of uniform distribution, a sampling method is proposed to reduce the non‐uniformity. In addition, as expected, high levels of disagreement result in low ICC, and when the type of distribution is fixed, any increase in the number of subjects beyond a moderately large specification such as n = 80 does not have a major impact on ICC. John Wiley and Sons Inc. 2018-04-29 2018-08-15 /pmc/articles/PMC6174967/ /pubmed/29707825 http://dx.doi.org/10.1002/sim.7679 Text en © 2018 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle	Research Articles Mehta, Shraddha Bastero‐Caballero, Rowena F. Sun, Yijun Zhu, Ray Murphy, Diane K. Hardas, Bhushan Koch, Gary Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies
title	Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies
title_full	Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies
title_fullStr	Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies
title_full_unstemmed	Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies
title_short	Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies
title_sort	performance of intraclass correlation coefficient (icc) as a reliability index under various distributions in scale reliability studies
topic	Research Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6174967/ https://www.ncbi.nlm.nih.gov/pubmed/29707825 http://dx.doi.org/10.1002/sim.7679
work_keys_str_mv	AT mehtashraddha performanceofintraclasscorrelationcoefficienticcasareliabilityindexundervariousdistributionsinscalereliabilitystudies AT basterocaballerorowenaf performanceofintraclasscorrelationcoefficienticcasareliabilityindexundervariousdistributionsinscalereliabilitystudies AT sunyijun performanceofintraclasscorrelationcoefficienticcasareliabilityindexundervariousdistributionsinscalereliabilitystudies AT zhuray performanceofintraclasscorrelationcoefficienticcasareliabilityindexundervariousdistributionsinscalereliabilitystudies AT murphydianek performanceofintraclasscorrelationcoefficienticcasareliabilityindexundervariousdistributionsinscalereliabilitystudies AT hardasbhushan performanceofintraclasscorrelationcoefficienticcasareliabilityindexundervariousdistributionsinscalereliabilitystudies AT kochgary performanceofintraclasscorrelationcoefficienticcasareliabilityindexundervariousdistributionsinscalereliabilitystudies

Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies

Ejemplares similares