Cargando…
Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications
Regression clustering is a mixture of unsupervised and supervised statistical learning and data mining method which is found in a wide range of applications including artificial intelligence and neuroscience. It performs unsupervised learning when it clusters the data according to their respective u...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4861814/ https://www.ncbi.nlm.nih.gov/pubmed/27212939 http://dx.doi.org/10.1155/2016/4037380 |
_version_ | 1782431259532722176 |
---|---|
author | Qian, Guoqi Wu, Yuehua Ferrari, Davide Qiao, Puxue Hollande, Frédéric |
author_facet | Qian, Guoqi Wu, Yuehua Ferrari, Davide Qiao, Puxue Hollande, Frédéric |
author_sort | Qian, Guoqi |
collection | PubMed |
description | Regression clustering is a mixture of unsupervised and supervised statistical learning and data mining method which is found in a wide range of applications including artificial intelligence and neuroscience. It performs unsupervised learning when it clusters the data according to their respective unobserved regression hyperplanes. The method also performs supervised learning when it fits regression hyperplanes to the corresponding data clusters. Applying regression clustering in practice requires means of determining the underlying number of clusters in the data, finding the cluster label of each data point, and estimating the regression coefficients of the model. In this paper, we review the estimation and selection issues in regression clustering with regard to the least squares and robust statistical methods. We also provide a model selection based technique to determine the number of regression clusters underlying the data. We further develop a computing procedure for regression clustering estimation and selection. Finally, simulation studies are presented for assessing the procedure, together with analyzing a real data set on RGB cell marking in neuroscience to illustrate and interpret the method. |
format | Online Article Text |
id | pubmed-4861814 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-48618142016-05-22 Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications Qian, Guoqi Wu, Yuehua Ferrari, Davide Qiao, Puxue Hollande, Frédéric Comput Intell Neurosci Research Article Regression clustering is a mixture of unsupervised and supervised statistical learning and data mining method which is found in a wide range of applications including artificial intelligence and neuroscience. It performs unsupervised learning when it clusters the data according to their respective unobserved regression hyperplanes. The method also performs supervised learning when it fits regression hyperplanes to the corresponding data clusters. Applying regression clustering in practice requires means of determining the underlying number of clusters in the data, finding the cluster label of each data point, and estimating the regression coefficients of the model. In this paper, we review the estimation and selection issues in regression clustering with regard to the least squares and robust statistical methods. We also provide a model selection based technique to determine the number of regression clusters underlying the data. We further develop a computing procedure for regression clustering estimation and selection. Finally, simulation studies are presented for assessing the procedure, together with analyzing a real data set on RGB cell marking in neuroscience to illustrate and interpret the method. Hindawi Publishing Corporation 2016 2016-04-26 /pmc/articles/PMC4861814/ /pubmed/27212939 http://dx.doi.org/10.1155/2016/4037380 Text en Copyright © 2016 Guoqi Qian et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Qian, Guoqi Wu, Yuehua Ferrari, Davide Qiao, Puxue Hollande, Frédéric Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications |
title | Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications |
title_full | Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications |
title_fullStr | Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications |
title_full_unstemmed | Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications |
title_short | Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications |
title_sort | semisupervised clustering by iterative partition and regression with neuroscience applications |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4861814/ https://www.ncbi.nlm.nih.gov/pubmed/27212939 http://dx.doi.org/10.1155/2016/4037380 |
work_keys_str_mv | AT qianguoqi semisupervisedclusteringbyiterativepartitionandregressionwithneuroscienceapplications AT wuyuehua semisupervisedclusteringbyiterativepartitionandregressionwithneuroscienceapplications AT ferraridavide semisupervisedclusteringbyiterativepartitionandregressionwithneuroscienceapplications AT qiaopuxue semisupervisedclusteringbyiterativepartitionandregressionwithneuroscienceapplications AT hollandefrederic semisupervisedclusteringbyiterativepartitionandregressionwithneuroscienceapplications |