Cargando…

Trial and error: A hierarchical modeling approach to test-retest reliability

The concept of test-retest reliability indexes the consistency of a measurement across time. High reliability is critical for any scientific study, but specifically for the study of individual differences. Evidence of poor reliability of commonly used behavioral and functional neuroimaging tasks is...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Gang, Pine, Daniel S., Brotman, Melissa A., Smith, Ashley R., Cox, Robert W., Haller, Simone P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10241320/
https://www.ncbi.nlm.nih.gov/pubmed/34688897
http://dx.doi.org/10.1016/j.neuroimage.2021.118647
_version_ 1785053957600575488
author Chen, Gang
Pine, Daniel S.
Brotman, Melissa A.
Smith, Ashley R.
Cox, Robert W.
Haller, Simone P.
author_facet Chen, Gang
Pine, Daniel S.
Brotman, Melissa A.
Smith, Ashley R.
Cox, Robert W.
Haller, Simone P.
author_sort Chen, Gang
collection PubMed
description The concept of test-retest reliability indexes the consistency of a measurement across time. High reliability is critical for any scientific study, but specifically for the study of individual differences. Evidence of poor reliability of commonly used behavioral and functional neuroimaging tasks is mounting. Reports on low reliability of task-based fMRI have called into question the adequacy of using even the most common, well-characterized cognitive tasks with robust population-level effects, to measure individual differences. Here, we lay out a hierarchical framework that estimates reliability as a correlation divorced from trial-level variability, and show that reliability tends to be underestimated under the conventional intraclass correlation framework through summary statistics based on condition-level modeling. In addition, we examine how reliability estimation between the two statistical frameworks diverges and assess how different factors (e.g., trial and subject sample sizes, relative magnitude of cross-trial variability) impact reliability estimates. As empirical data indicate that cross-trial variability is large in most tasks, this work highlights that a large number of trials (e.g., greater than 100) may be required to achieve precise reliability estimates. We reference the tools TRR and 3dLMEr for the community to apply trial-level models to behavior and neuroimaging data and discuss how to make these new measurements most useful for future studies.
format Online
Article
Text
id pubmed-10241320
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-102413202023-06-05 Trial and error: A hierarchical modeling approach to test-retest reliability Chen, Gang Pine, Daniel S. Brotman, Melissa A. Smith, Ashley R. Cox, Robert W. Haller, Simone P. Neuroimage Article The concept of test-retest reliability indexes the consistency of a measurement across time. High reliability is critical for any scientific study, but specifically for the study of individual differences. Evidence of poor reliability of commonly used behavioral and functional neuroimaging tasks is mounting. Reports on low reliability of task-based fMRI have called into question the adequacy of using even the most common, well-characterized cognitive tasks with robust population-level effects, to measure individual differences. Here, we lay out a hierarchical framework that estimates reliability as a correlation divorced from trial-level variability, and show that reliability tends to be underestimated under the conventional intraclass correlation framework through summary statistics based on condition-level modeling. In addition, we examine how reliability estimation between the two statistical frameworks diverges and assess how different factors (e.g., trial and subject sample sizes, relative magnitude of cross-trial variability) impact reliability estimates. As empirical data indicate that cross-trial variability is large in most tasks, this work highlights that a large number of trials (e.g., greater than 100) may be required to achieve precise reliability estimates. We reference the tools TRR and 3dLMEr for the community to apply trial-level models to behavior and neuroimaging data and discuss how to make these new measurements most useful for future studies. 2021-12-15 2021-10-22 /pmc/articles/PMC10241320/ /pubmed/34688897 http://dx.doi.org/10.1016/j.neuroimage.2021.118647 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) )
spellingShingle Article
Chen, Gang
Pine, Daniel S.
Brotman, Melissa A.
Smith, Ashley R.
Cox, Robert W.
Haller, Simone P.
Trial and error: A hierarchical modeling approach to test-retest reliability
title Trial and error: A hierarchical modeling approach to test-retest reliability
title_full Trial and error: A hierarchical modeling approach to test-retest reliability
title_fullStr Trial and error: A hierarchical modeling approach to test-retest reliability
title_full_unstemmed Trial and error: A hierarchical modeling approach to test-retest reliability
title_short Trial and error: A hierarchical modeling approach to test-retest reliability
title_sort trial and error: a hierarchical modeling approach to test-retest reliability
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10241320/
https://www.ncbi.nlm.nih.gov/pubmed/34688897
http://dx.doi.org/10.1016/j.neuroimage.2021.118647
work_keys_str_mv AT chengang trialanderrorahierarchicalmodelingapproachtotestretestreliability
AT pinedaniels trialanderrorahierarchicalmodelingapproachtotestretestreliability
AT brotmanmelissaa trialanderrorahierarchicalmodelingapproachtotestretestreliability
AT smithashleyr trialanderrorahierarchicalmodelingapproachtotestretestreliability
AT coxrobertw trialanderrorahierarchicalmodelingapproachtotestretestreliability
AT hallersimonep trialanderrorahierarchicalmodelingapproachtotestretestreliability