Cargando…

Navigating the Statistical Minefield of Model Selection and Clustering in Neuroscience

Model selection is often implicit: when performing an ANOVA, one assumes that the normal distribution is a good model of the data; fitting a tuning curve implies that an additive and a multiplicative scaler describes the behavior of the neuron; even calculating an average implicitly assumes that the...

Descripción completa

Detalles Bibliográficos
Autores principales: Király, Bálint, Hangya, Balázs
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Society for Neuroscience 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9282170/
https://www.ncbi.nlm.nih.gov/pubmed/35835556
http://dx.doi.org/10.1523/ENEURO.0066-22.2022
_version_ 1784747050348314624
author Király, Bálint
Hangya, Balázs
author_facet Király, Bálint
Hangya, Balázs
author_sort Király, Bálint
collection PubMed
description Model selection is often implicit: when performing an ANOVA, one assumes that the normal distribution is a good model of the data; fitting a tuning curve implies that an additive and a multiplicative scaler describes the behavior of the neuron; even calculating an average implicitly assumes that the data were sampled from a distribution that has a finite first statistical moment: the mean. Model selection may be explicit, when the aim is to test whether one model provides a better description of the data than a competing one. As a special case, clustering algorithms identify groups with similar properties within the data. They are widely used from spike sorting to cell type identification to gene expression analysis. We discuss model selection and clustering techniques from a statistician’s point of view, revealing the assumptions behind, and the logic that governs the various approaches. We also showcase important neuroscience applications and provide suggestions how neuroscientists could put model selection algorithms to best use as well as what mistakes should be avoided.
format Online
Article
Text
id pubmed-9282170
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Society for Neuroscience
record_format MEDLINE/PubMed
spelling pubmed-92821702022-08-01 Navigating the Statistical Minefield of Model Selection and Clustering in Neuroscience Király, Bálint Hangya, Balázs eNeuro Review Model selection is often implicit: when performing an ANOVA, one assumes that the normal distribution is a good model of the data; fitting a tuning curve implies that an additive and a multiplicative scaler describes the behavior of the neuron; even calculating an average implicitly assumes that the data were sampled from a distribution that has a finite first statistical moment: the mean. Model selection may be explicit, when the aim is to test whether one model provides a better description of the data than a competing one. As a special case, clustering algorithms identify groups with similar properties within the data. They are widely used from spike sorting to cell type identification to gene expression analysis. We discuss model selection and clustering techniques from a statistician’s point of view, revealing the assumptions behind, and the logic that governs the various approaches. We also showcase important neuroscience applications and provide suggestions how neuroscientists could put model selection algorithms to best use as well as what mistakes should be avoided. Society for Neuroscience 2022-07-07 /pmc/articles/PMC9282170/ /pubmed/35835556 http://dx.doi.org/10.1523/ENEURO.0066-22.2022 Text en Copyright © 2022 Király and Hangya https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.
spellingShingle Review
Király, Bálint
Hangya, Balázs
Navigating the Statistical Minefield of Model Selection and Clustering in Neuroscience
title Navigating the Statistical Minefield of Model Selection and Clustering in Neuroscience
title_full Navigating the Statistical Minefield of Model Selection and Clustering in Neuroscience
title_fullStr Navigating the Statistical Minefield of Model Selection and Clustering in Neuroscience
title_full_unstemmed Navigating the Statistical Minefield of Model Selection and Clustering in Neuroscience
title_short Navigating the Statistical Minefield of Model Selection and Clustering in Neuroscience
title_sort navigating the statistical minefield of model selection and clustering in neuroscience
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9282170/
https://www.ncbi.nlm.nih.gov/pubmed/35835556
http://dx.doi.org/10.1523/ENEURO.0066-22.2022
work_keys_str_mv AT kiralybalint navigatingthestatisticalminefieldofmodelselectionandclusteringinneuroscience
AT hangyabalazs navigatingthestatisticalminefieldofmodelselectionandclusteringinneuroscience