Cargando…

The biological knowledge discovery by PCCF measure and PCA-F projection

In the process of biological knowledge discovery, PCA is commonly used to complement the clustering analysis, but PCA typically gives the poor visualizations for most gene expression data sets. Here, we propose a PCCF measure, and use PCA-F to display clusters of PCCF, where PCCF and PCA-F are model...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Xingang, Zhu, Guanqun, Han, Qiuhong, Lu, Zuhong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388332/
https://www.ncbi.nlm.nih.gov/pubmed/28399180
http://dx.doi.org/10.1371/journal.pone.0175104
_version_ 1782521113411059712
author Jia, Xingang
Zhu, Guanqun
Han, Qiuhong
Lu, Zuhong
author_facet Jia, Xingang
Zhu, Guanqun
Han, Qiuhong
Lu, Zuhong
author_sort Jia, Xingang
collection PubMed
description In the process of biological knowledge discovery, PCA is commonly used to complement the clustering analysis, but PCA typically gives the poor visualizations for most gene expression data sets. Here, we propose a PCCF measure, and use PCA-F to display clusters of PCCF, where PCCF and PCA-F are modeled from the modified cumulative probabilities of genes. From the analysis of simulated and experimental data sets, we demonstrate that PCCF is more appropriate and reliable for analyzing gene expression data compared to other commonly used distances or similarity measures, and PCA-F is a good visualization technique for identifying clusters of PCCF, where we aim at such data sets that the expression values of genes are collected at different time points.
format Online
Article
Text
id pubmed-5388332
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53883322017-05-03 The biological knowledge discovery by PCCF measure and PCA-F projection Jia, Xingang Zhu, Guanqun Han, Qiuhong Lu, Zuhong PLoS One Research Article In the process of biological knowledge discovery, PCA is commonly used to complement the clustering analysis, but PCA typically gives the poor visualizations for most gene expression data sets. Here, we propose a PCCF measure, and use PCA-F to display clusters of PCCF, where PCCF and PCA-F are modeled from the modified cumulative probabilities of genes. From the analysis of simulated and experimental data sets, we demonstrate that PCCF is more appropriate and reliable for analyzing gene expression data compared to other commonly used distances or similarity measures, and PCA-F is a good visualization technique for identifying clusters of PCCF, where we aim at such data sets that the expression values of genes are collected at different time points. Public Library of Science 2017-04-11 /pmc/articles/PMC5388332/ /pubmed/28399180 http://dx.doi.org/10.1371/journal.pone.0175104 Text en © 2017 Jia et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Jia, Xingang
Zhu, Guanqun
Han, Qiuhong
Lu, Zuhong
The biological knowledge discovery by PCCF measure and PCA-F projection
title The biological knowledge discovery by PCCF measure and PCA-F projection
title_full The biological knowledge discovery by PCCF measure and PCA-F projection
title_fullStr The biological knowledge discovery by PCCF measure and PCA-F projection
title_full_unstemmed The biological knowledge discovery by PCCF measure and PCA-F projection
title_short The biological knowledge discovery by PCCF measure and PCA-F projection
title_sort biological knowledge discovery by pccf measure and pca-f projection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388332/
https://www.ncbi.nlm.nih.gov/pubmed/28399180
http://dx.doi.org/10.1371/journal.pone.0175104
work_keys_str_mv AT jiaxingang thebiologicalknowledgediscoverybypccfmeasureandpcafprojection
AT zhuguanqun thebiologicalknowledgediscoverybypccfmeasureandpcafprojection
AT hanqiuhong thebiologicalknowledgediscoverybypccfmeasureandpcafprojection
AT luzuhong thebiologicalknowledgediscoverybypccfmeasureandpcafprojection
AT jiaxingang biologicalknowledgediscoverybypccfmeasureandpcafprojection
AT zhuguanqun biologicalknowledgediscoverybypccfmeasureandpcafprojection
AT hanqiuhong biologicalknowledgediscoverybypccfmeasureandpcafprojection
AT luzuhong biologicalknowledgediscoverybypccfmeasureandpcafprojection