Cargando…

Molecular phenotyping using networks, diffusion, and topology: soft tissue sarcoma

Many biological datasets are high-dimensional yet manifest an underlying order. In this paper, we describe an unsupervised data analysis methodology that operates in the setting of a multivariate dataset and a network which expresses influence between the variables of the given set. The technique in...

Descripción completa

Detalles Bibliográficos
Autores principales: Mathews, James C., Pouryahya, Maryam, Moosmüller, Caroline, Kevrekidis, Yannis G., Deasy, Joseph O., Tannenbaum, Allen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6764992/
https://www.ncbi.nlm.nih.gov/pubmed/31562358
http://dx.doi.org/10.1038/s41598-019-50300-2
_version_ 1783454480827678720
author Mathews, James C.
Pouryahya, Maryam
Moosmüller, Caroline
Kevrekidis, Yannis G.
Deasy, Joseph O.
Tannenbaum, Allen
author_facet Mathews, James C.
Pouryahya, Maryam
Moosmüller, Caroline
Kevrekidis, Yannis G.
Deasy, Joseph O.
Tannenbaum, Allen
author_sort Mathews, James C.
collection PubMed
description Many biological datasets are high-dimensional yet manifest an underlying order. In this paper, we describe an unsupervised data analysis methodology that operates in the setting of a multivariate dataset and a network which expresses influence between the variables of the given set. The technique involves network geometry employing the Wasserstein distance, global spectral analysis in the form of diffusion maps, and topological data analysis using the Mapper algorithm. The prototypical application is to gene expression profiles obtained from RNA-Seq experiments on a collection of tissue samples, considering only genes whose protein products participate in a known pathway or network of interest. Employing the technique, we discern several coherent states or signatures displayed by the gene expression profiles of the sarcomas in the Cancer Genome Atlas along the TP53 (p53) signaling network. The signatures substantially recover the leiomyosarcoma, dedifferentiated liposarcoma (DDLPS), and synovial sarcoma histological subtype diagnoses, and they also include a new signature defined by activation and inactivation of about a dozen genes, including activation of serine endopeptidase inhibitor SERPINE1 and inactivation of TP53-family tumor suppressor gene TP73.
format Online
Article
Text
id pubmed-6764992
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-67649922019-10-02 Molecular phenotyping using networks, diffusion, and topology: soft tissue sarcoma Mathews, James C. Pouryahya, Maryam Moosmüller, Caroline Kevrekidis, Yannis G. Deasy, Joseph O. Tannenbaum, Allen Sci Rep Article Many biological datasets are high-dimensional yet manifest an underlying order. In this paper, we describe an unsupervised data analysis methodology that operates in the setting of a multivariate dataset and a network which expresses influence between the variables of the given set. The technique involves network geometry employing the Wasserstein distance, global spectral analysis in the form of diffusion maps, and topological data analysis using the Mapper algorithm. The prototypical application is to gene expression profiles obtained from RNA-Seq experiments on a collection of tissue samples, considering only genes whose protein products participate in a known pathway or network of interest. Employing the technique, we discern several coherent states or signatures displayed by the gene expression profiles of the sarcomas in the Cancer Genome Atlas along the TP53 (p53) signaling network. The signatures substantially recover the leiomyosarcoma, dedifferentiated liposarcoma (DDLPS), and synovial sarcoma histological subtype diagnoses, and they also include a new signature defined by activation and inactivation of about a dozen genes, including activation of serine endopeptidase inhibitor SERPINE1 and inactivation of TP53-family tumor suppressor gene TP73. Nature Publishing Group UK 2019-09-27 /pmc/articles/PMC6764992/ /pubmed/31562358 http://dx.doi.org/10.1038/s41598-019-50300-2 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Mathews, James C.
Pouryahya, Maryam
Moosmüller, Caroline
Kevrekidis, Yannis G.
Deasy, Joseph O.
Tannenbaum, Allen
Molecular phenotyping using networks, diffusion, and topology: soft tissue sarcoma
title Molecular phenotyping using networks, diffusion, and topology: soft tissue sarcoma
title_full Molecular phenotyping using networks, diffusion, and topology: soft tissue sarcoma
title_fullStr Molecular phenotyping using networks, diffusion, and topology: soft tissue sarcoma
title_full_unstemmed Molecular phenotyping using networks, diffusion, and topology: soft tissue sarcoma
title_short Molecular phenotyping using networks, diffusion, and topology: soft tissue sarcoma
title_sort molecular phenotyping using networks, diffusion, and topology: soft tissue sarcoma
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6764992/
https://www.ncbi.nlm.nih.gov/pubmed/31562358
http://dx.doi.org/10.1038/s41598-019-50300-2
work_keys_str_mv AT mathewsjamesc molecularphenotypingusingnetworksdiffusionandtopologysofttissuesarcoma
AT pouryahyamaryam molecularphenotypingusingnetworksdiffusionandtopologysofttissuesarcoma
AT moosmullercaroline molecularphenotypingusingnetworksdiffusionandtopologysofttissuesarcoma
AT kevrekidisyannisg molecularphenotypingusingnetworksdiffusionandtopologysofttissuesarcoma
AT deasyjosepho molecularphenotypingusingnetworksdiffusionandtopologysofttissuesarcoma
AT tannenbaumallen molecularphenotypingusingnetworksdiffusionandtopologysofttissuesarcoma