Cargando…

Multiple‐cumulative probabilities used to cluster and visualize transcriptomes

Analysis of gene expression data by clustering and visualizing played a central role in obtaining biological knowledge. Here, we used Pearson's correlation coefficient of multiple‐cumulative probabilities (PCC‐MCP) of genes to define the similarity of gene expression behaviors. To answer the ch...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Xingang, Liu, Yisu, Han, Qiuhong, Lu, Zuhong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5715267/
https://www.ncbi.nlm.nih.gov/pubmed/29226087
http://dx.doi.org/10.1002/2211-5463.12327
_version_ 1783283731717423104
author Jia, Xingang
Liu, Yisu
Han, Qiuhong
Lu, Zuhong
author_facet Jia, Xingang
Liu, Yisu
Han, Qiuhong
Lu, Zuhong
author_sort Jia, Xingang
collection PubMed
description Analysis of gene expression data by clustering and visualizing played a central role in obtaining biological knowledge. Here, we used Pearson's correlation coefficient of multiple‐cumulative probabilities (PCC‐MCP) of genes to define the similarity of gene expression behaviors. To answer the challenge of the high‐dimensional MCPs, we used icc‐cluster, a clustering algorithm that obtained solutions by iterating clustering centers, with PCC‐MCP to group genes. We then used t‐statistic stochastic neighbor embedding (t‐SNE) of KC‐data to generate optimal maps for clusters of MCP (t‐SNE‐MCP‐O maps). From the analysis of several transcriptome data sets, we demonstrated clear advantages for using icc‐cluster with PCC‐MCP over commonly used clustering methods. t‐SNE‐MCP‐O was also shown to give clearly projecting boundaries for clusters of PCC‐MCP, which made the relationships between clusters easy to visualize and understand.
format Online
Article
Text
id pubmed-5715267
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-57152672017-12-08 Multiple‐cumulative probabilities used to cluster and visualize transcriptomes Jia, Xingang Liu, Yisu Han, Qiuhong Lu, Zuhong FEBS Open Bio Methods Analysis of gene expression data by clustering and visualizing played a central role in obtaining biological knowledge. Here, we used Pearson's correlation coefficient of multiple‐cumulative probabilities (PCC‐MCP) of genes to define the similarity of gene expression behaviors. To answer the challenge of the high‐dimensional MCPs, we used icc‐cluster, a clustering algorithm that obtained solutions by iterating clustering centers, with PCC‐MCP to group genes. We then used t‐statistic stochastic neighbor embedding (t‐SNE) of KC‐data to generate optimal maps for clusters of MCP (t‐SNE‐MCP‐O maps). From the analysis of several transcriptome data sets, we demonstrated clear advantages for using icc‐cluster with PCC‐MCP over commonly used clustering methods. t‐SNE‐MCP‐O was also shown to give clearly projecting boundaries for clusters of PCC‐MCP, which made the relationships between clusters easy to visualize and understand. John Wiley and Sons Inc. 2017-11-13 /pmc/articles/PMC5715267/ /pubmed/29226087 http://dx.doi.org/10.1002/2211-5463.12327 Text en © 2017 The Authors. Published by FEBS Press and John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Jia, Xingang
Liu, Yisu
Han, Qiuhong
Lu, Zuhong
Multiple‐cumulative probabilities used to cluster and visualize transcriptomes
title Multiple‐cumulative probabilities used to cluster and visualize transcriptomes
title_full Multiple‐cumulative probabilities used to cluster and visualize transcriptomes
title_fullStr Multiple‐cumulative probabilities used to cluster and visualize transcriptomes
title_full_unstemmed Multiple‐cumulative probabilities used to cluster and visualize transcriptomes
title_short Multiple‐cumulative probabilities used to cluster and visualize transcriptomes
title_sort multiple‐cumulative probabilities used to cluster and visualize transcriptomes
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5715267/
https://www.ncbi.nlm.nih.gov/pubmed/29226087
http://dx.doi.org/10.1002/2211-5463.12327
work_keys_str_mv AT jiaxingang multiplecumulativeprobabilitiesusedtoclusterandvisualizetranscriptomes
AT liuyisu multiplecumulativeprobabilitiesusedtoclusterandvisualizetranscriptomes
AT hanqiuhong multiplecumulativeprobabilitiesusedtoclusterandvisualizetranscriptomes
AT luzuhong multiplecumulativeprobabilitiesusedtoclusterandvisualizetranscriptomes