Cargando…

Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications

Regression clustering is a mixture of unsupervised and supervised statistical learning and data mining method which is found in a wide range of applications including artificial intelligence and neuroscience. It performs unsupervised learning when it clusters the data according to their respective u...

Descripción completa

Detalles Bibliográficos
Autores principales: Qian, Guoqi, Wu, Yuehua, Ferrari, Davide, Qiao, Puxue, Hollande, Frédéric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4861814/
https://www.ncbi.nlm.nih.gov/pubmed/27212939
http://dx.doi.org/10.1155/2016/4037380
_version_ 1782431259532722176
author Qian, Guoqi
Wu, Yuehua
Ferrari, Davide
Qiao, Puxue
Hollande, Frédéric
author_facet Qian, Guoqi
Wu, Yuehua
Ferrari, Davide
Qiao, Puxue
Hollande, Frédéric
author_sort Qian, Guoqi
collection PubMed
description Regression clustering is a mixture of unsupervised and supervised statistical learning and data mining method which is found in a wide range of applications including artificial intelligence and neuroscience. It performs unsupervised learning when it clusters the data according to their respective unobserved regression hyperplanes. The method also performs supervised learning when it fits regression hyperplanes to the corresponding data clusters. Applying regression clustering in practice requires means of determining the underlying number of clusters in the data, finding the cluster label of each data point, and estimating the regression coefficients of the model. In this paper, we review the estimation and selection issues in regression clustering with regard to the least squares and robust statistical methods. We also provide a model selection based technique to determine the number of regression clusters underlying the data. We further develop a computing procedure for regression clustering estimation and selection. Finally, simulation studies are presented for assessing the procedure, together with analyzing a real data set on RGB cell marking in neuroscience to illustrate and interpret the method.
format Online
Article
Text
id pubmed-4861814
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-48618142016-05-22 Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications Qian, Guoqi Wu, Yuehua Ferrari, Davide Qiao, Puxue Hollande, Frédéric Comput Intell Neurosci Research Article Regression clustering is a mixture of unsupervised and supervised statistical learning and data mining method which is found in a wide range of applications including artificial intelligence and neuroscience. It performs unsupervised learning when it clusters the data according to their respective unobserved regression hyperplanes. The method also performs supervised learning when it fits regression hyperplanes to the corresponding data clusters. Applying regression clustering in practice requires means of determining the underlying number of clusters in the data, finding the cluster label of each data point, and estimating the regression coefficients of the model. In this paper, we review the estimation and selection issues in regression clustering with regard to the least squares and robust statistical methods. We also provide a model selection based technique to determine the number of regression clusters underlying the data. We further develop a computing procedure for regression clustering estimation and selection. Finally, simulation studies are presented for assessing the procedure, together with analyzing a real data set on RGB cell marking in neuroscience to illustrate and interpret the method. Hindawi Publishing Corporation 2016 2016-04-26 /pmc/articles/PMC4861814/ /pubmed/27212939 http://dx.doi.org/10.1155/2016/4037380 Text en Copyright © 2016 Guoqi Qian et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Qian, Guoqi
Wu, Yuehua
Ferrari, Davide
Qiao, Puxue
Hollande, Frédéric
Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications
title Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications
title_full Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications
title_fullStr Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications
title_full_unstemmed Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications
title_short Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications
title_sort semisupervised clustering by iterative partition and regression with neuroscience applications
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4861814/
https://www.ncbi.nlm.nih.gov/pubmed/27212939
http://dx.doi.org/10.1155/2016/4037380
work_keys_str_mv AT qianguoqi semisupervisedclusteringbyiterativepartitionandregressionwithneuroscienceapplications
AT wuyuehua semisupervisedclusteringbyiterativepartitionandregressionwithneuroscienceapplications
AT ferraridavide semisupervisedclusteringbyiterativepartitionandregressionwithneuroscienceapplications
AT qiaopuxue semisupervisedclusteringbyiterativepartitionandregressionwithneuroscienceapplications
AT hollandefrederic semisupervisedclusteringbyiterativepartitionandregressionwithneuroscienceapplications