Cargando…

Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis

Due to the advancement in sensor technology, the growing large medical image data have the ability to visualize the anatomical changes in biological tissues. As a consequence, the medical images have the potential to enhance the diagnosis of disease, the prediction of clinical outcomes and the chara...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Nan, Jiang, Junhai, Guo, Shicheng, Xiong, Momiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4510534/
https://www.ncbi.nlm.nih.gov/pubmed/26196383
http://dx.doi.org/10.1371/journal.pone.0132945
_version_ 1782382186947674112
author Lin, Nan
Jiang, Junhai
Guo, Shicheng
Xiong, Momiao
author_facet Lin, Nan
Jiang, Junhai
Guo, Shicheng
Xiong, Momiao
author_sort Lin, Nan
collection PubMed
description Due to the advancement in sensor technology, the growing large medical image data have the ability to visualize the anatomical changes in biological tissues. As a consequence, the medical images have the potential to enhance the diagnosis of disease, the prediction of clinical outcomes and the characterization of disease progression. But in the meantime, the growing data dimensions pose great methodological and computational challenges for the representation and selection of features in image cluster analysis. To address these challenges, we first extend the functional principal component analysis (FPCA) from one dimension to two dimensions to fully capture the space variation of image the signals. The image signals contain a large number of redundant features which provide no additional information for clustering analysis. The widely used methods for removing the irrelevant features are sparse clustering algorithms using a lasso-type penalty to select the features. However, the accuracy of clustering using a lasso-type penalty depends on the selection of the penalty parameters and the threshold value. In practice, they are difficult to determine. Recently, randomized algorithms have received a great deal of attentions in big data analysis. This paper presents a randomized algorithm for accurate feature selection in image clustering analysis. The proposed method is applied to both the liver and kidney cancer histology image data from the TCGA database. The results demonstrate that the randomized feature selection method coupled with functional principal component analysis substantially outperforms the current sparse clustering algorithms in image cluster analysis.
format Online
Article
Text
id pubmed-4510534
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45105342015-07-24 Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis Lin, Nan Jiang, Junhai Guo, Shicheng Xiong, Momiao PLoS One Research Article Due to the advancement in sensor technology, the growing large medical image data have the ability to visualize the anatomical changes in biological tissues. As a consequence, the medical images have the potential to enhance the diagnosis of disease, the prediction of clinical outcomes and the characterization of disease progression. But in the meantime, the growing data dimensions pose great methodological and computational challenges for the representation and selection of features in image cluster analysis. To address these challenges, we first extend the functional principal component analysis (FPCA) from one dimension to two dimensions to fully capture the space variation of image the signals. The image signals contain a large number of redundant features which provide no additional information for clustering analysis. The widely used methods for removing the irrelevant features are sparse clustering algorithms using a lasso-type penalty to select the features. However, the accuracy of clustering using a lasso-type penalty depends on the selection of the penalty parameters and the threshold value. In practice, they are difficult to determine. Recently, randomized algorithms have received a great deal of attentions in big data analysis. This paper presents a randomized algorithm for accurate feature selection in image clustering analysis. The proposed method is applied to both the liver and kidney cancer histology image data from the TCGA database. The results demonstrate that the randomized feature selection method coupled with functional principal component analysis substantially outperforms the current sparse clustering algorithms in image cluster analysis. Public Library of Science 2015-07-21 /pmc/articles/PMC4510534/ /pubmed/26196383 http://dx.doi.org/10.1371/journal.pone.0132945 Text en © 2015 Lin et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lin, Nan
Jiang, Junhai
Guo, Shicheng
Xiong, Momiao
Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis
title Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis
title_full Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis
title_fullStr Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis
title_full_unstemmed Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis
title_short Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis
title_sort functional principal component analysis and randomized sparse clustering algorithm for medical image analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4510534/
https://www.ncbi.nlm.nih.gov/pubmed/26196383
http://dx.doi.org/10.1371/journal.pone.0132945
work_keys_str_mv AT linnan functionalprincipalcomponentanalysisandrandomizedsparseclusteringalgorithmformedicalimageanalysis
AT jiangjunhai functionalprincipalcomponentanalysisandrandomizedsparseclusteringalgorithmformedicalimageanalysis
AT guoshicheng functionalprincipalcomponentanalysisandrandomizedsparseclusteringalgorithmformedicalimageanalysis
AT xiongmomiao functionalprincipalcomponentanalysisandrandomizedsparseclusteringalgorithmformedicalimageanalysis