Cargando…

Cluster tendency assessment in neuronal spike data

Sorting spikes from extracellular recording into clusters associated with distinct single units (putative neurons) is a fundamental step in analyzing neuronal populations. Such spike sorting is intrinsically unsupervised, as the number of neurons are not known a priori. Therefor, any spike sorting i...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahallati, Sara, Bezdek, James C., Popovic, Milos R., Valiante, Taufik A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6850537/
https://www.ncbi.nlm.nih.gov/pubmed/31714913
http://dx.doi.org/10.1371/journal.pone.0224547
_version_ 1783469446296240128
author Mahallati, Sara
Bezdek, James C.
Popovic, Milos R.
Valiante, Taufik A.
author_facet Mahallati, Sara
Bezdek, James C.
Popovic, Milos R.
Valiante, Taufik A.
author_sort Mahallati, Sara
collection PubMed
description Sorting spikes from extracellular recording into clusters associated with distinct single units (putative neurons) is a fundamental step in analyzing neuronal populations. Such spike sorting is intrinsically unsupervised, as the number of neurons are not known a priori. Therefor, any spike sorting is an unsupervised learning problem that requires either of the two approaches: specification of a fixed value k for the number of clusters to seek, or generation of candidate partitions for several possible values of c, followed by selection of a best candidate based on various post-clustering validation criteria. In this paper, we investigate the first approach and evaluate the utility of several methods for providing lower dimensional visualization of the cluster structure and on subsequent spike clustering. We also introduce a visualization technique called improved visual assessment of cluster tendency (iVAT) to estimate possible cluster structures in data without the need for dimensionality reduction. Experimental results are conducted on two datasets with ground truth labels. In data with a relatively small number of clusters, iVAT is beneficial in estimating the number of clusters to inform the initialization of clustering algorithms. With larger numbers of clusters, iVAT gives a useful estimate of the coarse cluster structure but sometimes fails to indicate the presumptive number of clusters. We show that noise associated with recording extracellular neuronal potentials can disrupt computational clustering schemes, highlighting the benefit of probabilistic clustering models. Our results show that t-Distributed Stochastic Neighbor Embedding (t-SNE) provides representations of the data that yield more accurate visualization of potential cluster structure to inform the clustering stage. Moreover, The clusters obtained using t-SNE features were more reliable than the clusters obtained using the other methods, which indicates that t-SNE can potentially be used for both visualization and to extract features to be used by any clustering algorithm.
format Online
Article
Text
id pubmed-6850537
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-68505372019-11-22 Cluster tendency assessment in neuronal spike data Mahallati, Sara Bezdek, James C. Popovic, Milos R. Valiante, Taufik A. PLoS One Research Article Sorting spikes from extracellular recording into clusters associated with distinct single units (putative neurons) is a fundamental step in analyzing neuronal populations. Such spike sorting is intrinsically unsupervised, as the number of neurons are not known a priori. Therefor, any spike sorting is an unsupervised learning problem that requires either of the two approaches: specification of a fixed value k for the number of clusters to seek, or generation of candidate partitions for several possible values of c, followed by selection of a best candidate based on various post-clustering validation criteria. In this paper, we investigate the first approach and evaluate the utility of several methods for providing lower dimensional visualization of the cluster structure and on subsequent spike clustering. We also introduce a visualization technique called improved visual assessment of cluster tendency (iVAT) to estimate possible cluster structures in data without the need for dimensionality reduction. Experimental results are conducted on two datasets with ground truth labels. In data with a relatively small number of clusters, iVAT is beneficial in estimating the number of clusters to inform the initialization of clustering algorithms. With larger numbers of clusters, iVAT gives a useful estimate of the coarse cluster structure but sometimes fails to indicate the presumptive number of clusters. We show that noise associated with recording extracellular neuronal potentials can disrupt computational clustering schemes, highlighting the benefit of probabilistic clustering models. Our results show that t-Distributed Stochastic Neighbor Embedding (t-SNE) provides representations of the data that yield more accurate visualization of potential cluster structure to inform the clustering stage. Moreover, The clusters obtained using t-SNE features were more reliable than the clusters obtained using the other methods, which indicates that t-SNE can potentially be used for both visualization and to extract features to be used by any clustering algorithm. Public Library of Science 2019-11-12 /pmc/articles/PMC6850537/ /pubmed/31714913 http://dx.doi.org/10.1371/journal.pone.0224547 Text en © 2019 Mahallati et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mahallati, Sara
Bezdek, James C.
Popovic, Milos R.
Valiante, Taufik A.
Cluster tendency assessment in neuronal spike data
title Cluster tendency assessment in neuronal spike data
title_full Cluster tendency assessment in neuronal spike data
title_fullStr Cluster tendency assessment in neuronal spike data
title_full_unstemmed Cluster tendency assessment in neuronal spike data
title_short Cluster tendency assessment in neuronal spike data
title_sort cluster tendency assessment in neuronal spike data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6850537/
https://www.ncbi.nlm.nih.gov/pubmed/31714913
http://dx.doi.org/10.1371/journal.pone.0224547
work_keys_str_mv AT mahallatisara clustertendencyassessmentinneuronalspikedata
AT bezdekjamesc clustertendencyassessmentinneuronalspikedata
AT popovicmilosr clustertendencyassessmentinneuronalspikedata
AT valiantetaufika clustertendencyassessmentinneuronalspikedata