Cargando…

Relationship between prediction accuracy and feature importance reliability: An empirical and theoretical study

There is significant interest in using neuroimaging data to predict behavior. The predictive models are often interpreted by the computation of feature importance, which quantifies the predictive relevance of an imaging feature. Tian and Zalesky (2021) suggest that feature importance estimates exhib...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Jianzhong, Ooi, Leon Qi Rong, Tan, Trevor Wei Kiat, Zhang, Shaoshi, Li, Jingwei, Asplund, Christopher L., Eickhoff, Simon B, Bzdok, Danilo, Holmes, Avram J, Yeo, B.T. Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10241563/
https://www.ncbi.nlm.nih.gov/pubmed/37088322
http://dx.doi.org/10.1016/j.neuroimage.2023.120115
_version_ 1785054012356165632
author Chen, Jianzhong
Ooi, Leon Qi Rong
Tan, Trevor Wei Kiat
Zhang, Shaoshi
Li, Jingwei
Asplund, Christopher L.
Eickhoff, Simon B
Bzdok, Danilo
Holmes, Avram J
Yeo, B.T. Thomas
author_facet Chen, Jianzhong
Ooi, Leon Qi Rong
Tan, Trevor Wei Kiat
Zhang, Shaoshi
Li, Jingwei
Asplund, Christopher L.
Eickhoff, Simon B
Bzdok, Danilo
Holmes, Avram J
Yeo, B.T. Thomas
author_sort Chen, Jianzhong
collection PubMed
description There is significant interest in using neuroimaging data to predict behavior. The predictive models are often interpreted by the computation of feature importance, which quantifies the predictive relevance of an imaging feature. Tian and Zalesky (2021) suggest that feature importance estimates exhibit low split-half reliability, as well as a trade-off between prediction accuracy and feature importance reliability across parcellation resolutions. However, it is unclear whether the trade-off between prediction accuracy and feature importance reliability is universal. Here, we demonstrate that, with a sufficient sample size, feature importance (operationalized as Haufe-transformed weights) can achieve fair to excellent split-half reliability. With a sample size of 2600 participants, Haufe-transformed weights achieve average intra-class correlation coefficients of 0.75, 0.57 and 0.53 for cognitive, personality and mental health measures respectively. Haufe-transformed weights are much more reliable than original regression weights and univariate FC-behavior correlations. Original regression weights are not reliable even with 2600 participants. Intriguingly, feature importance reliability is strongly positively correlated with prediction accuracy across phenotypes. Within a particular behavioral domain, there is no clear relationship between prediction performance and feature importance reliability across regression models. Furthermore, we show mathematically that feature importance reliability is necessary, but not sufficient, for low feature importance error. In the case of linear models, lower feature importance error is mathematically related to lower prediction error. Therefore, higher feature importance reliability might yield lower feature importance error and higher prediction accuracy. Finally, we discuss how our theoretical results relate with the reliability of imaging features and behavioral measures. Overall, the current study provides empirical and theoretical insights into the relationship between prediction accuracy and feature importance reliability.
format Online
Article
Text
id pubmed-10241563
institution National Center for Biotechnology Information
language English
publishDate 2023
record_format MEDLINE/PubMed
spelling pubmed-102415632023-07-01 Relationship between prediction accuracy and feature importance reliability: An empirical and theoretical study Chen, Jianzhong Ooi, Leon Qi Rong Tan, Trevor Wei Kiat Zhang, Shaoshi Li, Jingwei Asplund, Christopher L. Eickhoff, Simon B Bzdok, Danilo Holmes, Avram J Yeo, B.T. Thomas Neuroimage Article There is significant interest in using neuroimaging data to predict behavior. The predictive models are often interpreted by the computation of feature importance, which quantifies the predictive relevance of an imaging feature. Tian and Zalesky (2021) suggest that feature importance estimates exhibit low split-half reliability, as well as a trade-off between prediction accuracy and feature importance reliability across parcellation resolutions. However, it is unclear whether the trade-off between prediction accuracy and feature importance reliability is universal. Here, we demonstrate that, with a sufficient sample size, feature importance (operationalized as Haufe-transformed weights) can achieve fair to excellent split-half reliability. With a sample size of 2600 participants, Haufe-transformed weights achieve average intra-class correlation coefficients of 0.75, 0.57 and 0.53 for cognitive, personality and mental health measures respectively. Haufe-transformed weights are much more reliable than original regression weights and univariate FC-behavior correlations. Original regression weights are not reliable even with 2600 participants. Intriguingly, feature importance reliability is strongly positively correlated with prediction accuracy across phenotypes. Within a particular behavioral domain, there is no clear relationship between prediction performance and feature importance reliability across regression models. Furthermore, we show mathematically that feature importance reliability is necessary, but not sufficient, for low feature importance error. In the case of linear models, lower feature importance error is mathematically related to lower prediction error. Therefore, higher feature importance reliability might yield lower feature importance error and higher prediction accuracy. Finally, we discuss how our theoretical results relate with the reliability of imaging features and behavioral measures. Overall, the current study provides empirical and theoretical insights into the relationship between prediction accuracy and feature importance reliability. 2023-07-01 2023-04-23 /pmc/articles/PMC10241563/ /pubmed/37088322 http://dx.doi.org/10.1016/j.neuroimage.2023.120115 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) )
spellingShingle Article
Chen, Jianzhong
Ooi, Leon Qi Rong
Tan, Trevor Wei Kiat
Zhang, Shaoshi
Li, Jingwei
Asplund, Christopher L.
Eickhoff, Simon B
Bzdok, Danilo
Holmes, Avram J
Yeo, B.T. Thomas
Relationship between prediction accuracy and feature importance reliability: An empirical and theoretical study
title Relationship between prediction accuracy and feature importance reliability: An empirical and theoretical study
title_full Relationship between prediction accuracy and feature importance reliability: An empirical and theoretical study
title_fullStr Relationship between prediction accuracy and feature importance reliability: An empirical and theoretical study
title_full_unstemmed Relationship between prediction accuracy and feature importance reliability: An empirical and theoretical study
title_short Relationship between prediction accuracy and feature importance reliability: An empirical and theoretical study
title_sort relationship between prediction accuracy and feature importance reliability: an empirical and theoretical study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10241563/
https://www.ncbi.nlm.nih.gov/pubmed/37088322
http://dx.doi.org/10.1016/j.neuroimage.2023.120115
work_keys_str_mv AT chenjianzhong relationshipbetweenpredictionaccuracyandfeatureimportancereliabilityanempiricalandtheoreticalstudy
AT ooileonqirong relationshipbetweenpredictionaccuracyandfeatureimportancereliabilityanempiricalandtheoreticalstudy
AT tantrevorweikiat relationshipbetweenpredictionaccuracyandfeatureimportancereliabilityanempiricalandtheoreticalstudy
AT zhangshaoshi relationshipbetweenpredictionaccuracyandfeatureimportancereliabilityanempiricalandtheoreticalstudy
AT lijingwei relationshipbetweenpredictionaccuracyandfeatureimportancereliabilityanempiricalandtheoreticalstudy
AT asplundchristopherl relationshipbetweenpredictionaccuracyandfeatureimportancereliabilityanempiricalandtheoreticalstudy
AT eickhoffsimonb relationshipbetweenpredictionaccuracyandfeatureimportancereliabilityanempiricalandtheoreticalstudy
AT bzdokdanilo relationshipbetweenpredictionaccuracyandfeatureimportancereliabilityanempiricalandtheoreticalstudy
AT holmesavramj relationshipbetweenpredictionaccuracyandfeatureimportancereliabilityanempiricalandtheoreticalstudy
AT yeobtthomas relationshipbetweenpredictionaccuracyandfeatureimportancereliabilityanempiricalandtheoreticalstudy