Cargando…

The unbiased estimation of the fraction of variance explained by a model

The correlation coefficient squared, r(2), is commonly used to validate quantitative models on neural data, yet it is biased by trial-to-trial variability: as trial-to-trial variability increases, measured correlation to a model’s predictions decreases. As a result, models that perfectly explain neu...

Descripción completa

Detalles Bibliográficos
Autores principales: Pospisil, Dean A., Bair, Wyeth
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8367013/
https://www.ncbi.nlm.nih.gov/pubmed/34347786
http://dx.doi.org/10.1371/journal.pcbi.1009212
_version_ 1783738991663644672
author Pospisil, Dean A.
Bair, Wyeth
author_facet Pospisil, Dean A.
Bair, Wyeth
author_sort Pospisil, Dean A.
collection PubMed
description The correlation coefficient squared, r(2), is commonly used to validate quantitative models on neural data, yet it is biased by trial-to-trial variability: as trial-to-trial variability increases, measured correlation to a model’s predictions decreases. As a result, models that perfectly explain neural tuning can appear to perform poorly. Many solutions to this problem have been proposed, but no consensus has been reached on which is the least biased estimator. Some currently used methods substantially overestimate model fit, and the utility of even the best performing methods is limited by the lack of confidence intervals and asymptotic analysis. We provide a new estimator, [Image: see text] , that outperforms all prior estimators in our testing, and we provide confidence intervals and asymptotic guarantees. We apply our estimator to a variety of neural data to validate its utility. We find that neural noise is often so great that confidence intervals of the estimator cover the entire possible range of values ([0, 1]), preventing meaningful evaluation of the quality of a model’s predictions. This leads us to propose the use of the signal-to-noise ratio (SNR) as a quality metric for making quantitative comparisons across neural recordings. Analyzing a variety of neural data sets, we find that up to ∼ 40% of some state-of-the-art neural recordings do not pass even a liberal SNR criterion. Moving toward more reliable estimates of correlation, and quantitatively comparing quality across recording modalities and data sets, will be critical to accelerating progress in modeling biological phenomena.
format Online
Article
Text
id pubmed-8367013
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83670132021-08-17 The unbiased estimation of the fraction of variance explained by a model Pospisil, Dean A. Bair, Wyeth PLoS Comput Biol Research Article The correlation coefficient squared, r(2), is commonly used to validate quantitative models on neural data, yet it is biased by trial-to-trial variability: as trial-to-trial variability increases, measured correlation to a model’s predictions decreases. As a result, models that perfectly explain neural tuning can appear to perform poorly. Many solutions to this problem have been proposed, but no consensus has been reached on which is the least biased estimator. Some currently used methods substantially overestimate model fit, and the utility of even the best performing methods is limited by the lack of confidence intervals and asymptotic analysis. We provide a new estimator, [Image: see text] , that outperforms all prior estimators in our testing, and we provide confidence intervals and asymptotic guarantees. We apply our estimator to a variety of neural data to validate its utility. We find that neural noise is often so great that confidence intervals of the estimator cover the entire possible range of values ([0, 1]), preventing meaningful evaluation of the quality of a model’s predictions. This leads us to propose the use of the signal-to-noise ratio (SNR) as a quality metric for making quantitative comparisons across neural recordings. Analyzing a variety of neural data sets, we find that up to ∼ 40% of some state-of-the-art neural recordings do not pass even a liberal SNR criterion. Moving toward more reliable estimates of correlation, and quantitatively comparing quality across recording modalities and data sets, will be critical to accelerating progress in modeling biological phenomena. Public Library of Science 2021-08-04 /pmc/articles/PMC8367013/ /pubmed/34347786 http://dx.doi.org/10.1371/journal.pcbi.1009212 Text en © 2021 Pospisil, Bair https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Pospisil, Dean A.
Bair, Wyeth
The unbiased estimation of the fraction of variance explained by a model
title The unbiased estimation of the fraction of variance explained by a model
title_full The unbiased estimation of the fraction of variance explained by a model
title_fullStr The unbiased estimation of the fraction of variance explained by a model
title_full_unstemmed The unbiased estimation of the fraction of variance explained by a model
title_short The unbiased estimation of the fraction of variance explained by a model
title_sort unbiased estimation of the fraction of variance explained by a model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8367013/
https://www.ncbi.nlm.nih.gov/pubmed/34347786
http://dx.doi.org/10.1371/journal.pcbi.1009212
work_keys_str_mv AT pospisildeana theunbiasedestimationofthefractionofvarianceexplainedbyamodel
AT bairwyeth theunbiasedestimationofthefractionofvarianceexplainedbyamodel
AT pospisildeana unbiasedestimationofthefractionofvarianceexplainedbyamodel
AT bairwyeth unbiasedestimationofthefractionofvarianceexplainedbyamodel