Cargando…

Mind the gap: Performance metric evaluation in brain‐age prediction

Estimating age based on neuroimaging‐derived data has become a popular approach to developing markers for brain integrity and health. While a variety of machine‐learning algorithms can provide accurate predictions of age based on brain characteristics, there is significant variation in model accurac...

Descripción completa

Detalles Bibliográficos
Autores principales: de Lange, Ann‐Marie G., Anatürk, Melis, Rokicki, Jaroslav, Han, Laura K. M., Franke, Katja, Alnæs, Dag, Ebmeier, Klaus P., Draganski, Bogdan, Kaufmann, Tobias, Westlye, Lars T., Hahn, Tim, Cole, James H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9188975/
https://www.ncbi.nlm.nih.gov/pubmed/35312210
http://dx.doi.org/10.1002/hbm.25837
_version_ 1784725492794916864
author de Lange, Ann‐Marie G.
Anatürk, Melis
Rokicki, Jaroslav
Han, Laura K. M.
Franke, Katja
Alnæs, Dag
Ebmeier, Klaus P.
Draganski, Bogdan
Kaufmann, Tobias
Westlye, Lars T.
Hahn, Tim
Cole, James H.
author_facet de Lange, Ann‐Marie G.
Anatürk, Melis
Rokicki, Jaroslav
Han, Laura K. M.
Franke, Katja
Alnæs, Dag
Ebmeier, Klaus P.
Draganski, Bogdan
Kaufmann, Tobias
Westlye, Lars T.
Hahn, Tim
Cole, James H.
author_sort de Lange, Ann‐Marie G.
collection PubMed
description Estimating age based on neuroimaging‐derived data has become a popular approach to developing markers for brain integrity and health. While a variety of machine‐learning algorithms can provide accurate predictions of age based on brain characteristics, there is significant variation in model accuracy reported across studies. We predicted age in two population‐based datasets, and assessed the effects of age range, sample size and age‐bias correction on the model performance metrics Pearson's correlation coefficient (r), the coefficient of determination (R (2)), Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE). The results showed that these metrics vary considerably depending on cohort age range; r and R (2) values are lower when measured in samples with a narrower age range. RMSE and MAE are also lower in samples with a narrower age range due to smaller errors/brain age delta values when predictions are closer to the mean age of the group. Across subsets with different age ranges, performance metrics improve with increasing sample size. Performance metrics further vary depending on prediction variance as well as mean age difference between training and test sets, and age‐bias corrected metrics indicate high accuracy—also for models showing poor initial performance. In conclusion, performance metrics used for evaluating age prediction models depend on cohort and study‐specific data characteristics, and cannot be directly compared across different studies. Since age‐bias corrected metrics generally indicate high accuracy, even for poorly performing models, inspection of uncorrected model results provides important information about underlying model attributes such as prediction variance.
format Online
Article
Text
id pubmed-9188975
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-91889752022-06-15 Mind the gap: Performance metric evaluation in brain‐age prediction de Lange, Ann‐Marie G. Anatürk, Melis Rokicki, Jaroslav Han, Laura K. M. Franke, Katja Alnæs, Dag Ebmeier, Klaus P. Draganski, Bogdan Kaufmann, Tobias Westlye, Lars T. Hahn, Tim Cole, James H. Hum Brain Mapp Research Articles Estimating age based on neuroimaging‐derived data has become a popular approach to developing markers for brain integrity and health. While a variety of machine‐learning algorithms can provide accurate predictions of age based on brain characteristics, there is significant variation in model accuracy reported across studies. We predicted age in two population‐based datasets, and assessed the effects of age range, sample size and age‐bias correction on the model performance metrics Pearson's correlation coefficient (r), the coefficient of determination (R (2)), Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE). The results showed that these metrics vary considerably depending on cohort age range; r and R (2) values are lower when measured in samples with a narrower age range. RMSE and MAE are also lower in samples with a narrower age range due to smaller errors/brain age delta values when predictions are closer to the mean age of the group. Across subsets with different age ranges, performance metrics improve with increasing sample size. Performance metrics further vary depending on prediction variance as well as mean age difference between training and test sets, and age‐bias corrected metrics indicate high accuracy—also for models showing poor initial performance. In conclusion, performance metrics used for evaluating age prediction models depend on cohort and study‐specific data characteristics, and cannot be directly compared across different studies. Since age‐bias corrected metrics generally indicate high accuracy, even for poorly performing models, inspection of uncorrected model results provides important information about underlying model attributes such as prediction variance. John Wiley & Sons, Inc. 2022-03-21 /pmc/articles/PMC9188975/ /pubmed/35312210 http://dx.doi.org/10.1002/hbm.25837 Text en © 2022 The Authors. Human Brain Mapping published by Wiley Periodicals LLC. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
de Lange, Ann‐Marie G.
Anatürk, Melis
Rokicki, Jaroslav
Han, Laura K. M.
Franke, Katja
Alnæs, Dag
Ebmeier, Klaus P.
Draganski, Bogdan
Kaufmann, Tobias
Westlye, Lars T.
Hahn, Tim
Cole, James H.
Mind the gap: Performance metric evaluation in brain‐age prediction
title Mind the gap: Performance metric evaluation in brain‐age prediction
title_full Mind the gap: Performance metric evaluation in brain‐age prediction
title_fullStr Mind the gap: Performance metric evaluation in brain‐age prediction
title_full_unstemmed Mind the gap: Performance metric evaluation in brain‐age prediction
title_short Mind the gap: Performance metric evaluation in brain‐age prediction
title_sort mind the gap: performance metric evaluation in brain‐age prediction
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9188975/
https://www.ncbi.nlm.nih.gov/pubmed/35312210
http://dx.doi.org/10.1002/hbm.25837
work_keys_str_mv AT delangeannmarieg mindthegapperformancemetricevaluationinbrainageprediction
AT anaturkmelis mindthegapperformancemetricevaluationinbrainageprediction
AT rokickijaroslav mindthegapperformancemetricevaluationinbrainageprediction
AT hanlaurakm mindthegapperformancemetricevaluationinbrainageprediction
AT frankekatja mindthegapperformancemetricevaluationinbrainageprediction
AT alnæsdag mindthegapperformancemetricevaluationinbrainageprediction
AT ebmeierklausp mindthegapperformancemetricevaluationinbrainageprediction
AT draganskibogdan mindthegapperformancemetricevaluationinbrainageprediction
AT kaufmanntobias mindthegapperformancemetricevaluationinbrainageprediction
AT westlyelarst mindthegapperformancemetricevaluationinbrainageprediction
AT hahntim mindthegapperformancemetricevaluationinbrainageprediction
AT colejamesh mindthegapperformancemetricevaluationinbrainageprediction