Cargando…
Multiple‐cumulative probabilities used to cluster and visualize transcriptomes
Analysis of gene expression data by clustering and visualizing played a central role in obtaining biological knowledge. Here, we used Pearson's correlation coefficient of multiple‐cumulative probabilities (PCC‐MCP) of genes to define the similarity of gene expression behaviors. To answer the ch...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5715267/ https://www.ncbi.nlm.nih.gov/pubmed/29226087 http://dx.doi.org/10.1002/2211-5463.12327 |
_version_ | 1783283731717423104 |
---|---|
author | Jia, Xingang Liu, Yisu Han, Qiuhong Lu, Zuhong |
author_facet | Jia, Xingang Liu, Yisu Han, Qiuhong Lu, Zuhong |
author_sort | Jia, Xingang |
collection | PubMed |
description | Analysis of gene expression data by clustering and visualizing played a central role in obtaining biological knowledge. Here, we used Pearson's correlation coefficient of multiple‐cumulative probabilities (PCC‐MCP) of genes to define the similarity of gene expression behaviors. To answer the challenge of the high‐dimensional MCPs, we used icc‐cluster, a clustering algorithm that obtained solutions by iterating clustering centers, with PCC‐MCP to group genes. We then used t‐statistic stochastic neighbor embedding (t‐SNE) of KC‐data to generate optimal maps for clusters of MCP (t‐SNE‐MCP‐O maps). From the analysis of several transcriptome data sets, we demonstrated clear advantages for using icc‐cluster with PCC‐MCP over commonly used clustering methods. t‐SNE‐MCP‐O was also shown to give clearly projecting boundaries for clusters of PCC‐MCP, which made the relationships between clusters easy to visualize and understand. |
format | Online Article Text |
id | pubmed-5715267 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-57152672017-12-08 Multiple‐cumulative probabilities used to cluster and visualize transcriptomes Jia, Xingang Liu, Yisu Han, Qiuhong Lu, Zuhong FEBS Open Bio Methods Analysis of gene expression data by clustering and visualizing played a central role in obtaining biological knowledge. Here, we used Pearson's correlation coefficient of multiple‐cumulative probabilities (PCC‐MCP) of genes to define the similarity of gene expression behaviors. To answer the challenge of the high‐dimensional MCPs, we used icc‐cluster, a clustering algorithm that obtained solutions by iterating clustering centers, with PCC‐MCP to group genes. We then used t‐statistic stochastic neighbor embedding (t‐SNE) of KC‐data to generate optimal maps for clusters of MCP (t‐SNE‐MCP‐O maps). From the analysis of several transcriptome data sets, we demonstrated clear advantages for using icc‐cluster with PCC‐MCP over commonly used clustering methods. t‐SNE‐MCP‐O was also shown to give clearly projecting boundaries for clusters of PCC‐MCP, which made the relationships between clusters easy to visualize and understand. John Wiley and Sons Inc. 2017-11-13 /pmc/articles/PMC5715267/ /pubmed/29226087 http://dx.doi.org/10.1002/2211-5463.12327 Text en © 2017 The Authors. Published by FEBS Press and John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Jia, Xingang Liu, Yisu Han, Qiuhong Lu, Zuhong Multiple‐cumulative probabilities used to cluster and visualize transcriptomes |
title | Multiple‐cumulative probabilities used to cluster and visualize transcriptomes |
title_full | Multiple‐cumulative probabilities used to cluster and visualize transcriptomes |
title_fullStr | Multiple‐cumulative probabilities used to cluster and visualize transcriptomes |
title_full_unstemmed | Multiple‐cumulative probabilities used to cluster and visualize transcriptomes |
title_short | Multiple‐cumulative probabilities used to cluster and visualize transcriptomes |
title_sort | multiple‐cumulative probabilities used to cluster and visualize transcriptomes |
topic | Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5715267/ https://www.ncbi.nlm.nih.gov/pubmed/29226087 http://dx.doi.org/10.1002/2211-5463.12327 |
work_keys_str_mv | AT jiaxingang multiplecumulativeprobabilitiesusedtoclusterandvisualizetranscriptomes AT liuyisu multiplecumulativeprobabilitiesusedtoclusterandvisualizetranscriptomes AT hanqiuhong multiplecumulativeprobabilitiesusedtoclusterandvisualizetranscriptomes AT luzuhong multiplecumulativeprobabilitiesusedtoclusterandvisualizetranscriptomes |