Cargando…

Molecular heterogeneity at the network level: high-dimensional testing, clustering and a TCGA case study

MOTIVATION: Molecular pathways and networks play a key role in basic and disease biology. An emerging notion is that networks encoding patterns of molecular interplay may themselves differ between contexts, such as cell type, tissue or disease (sub)type. However, while statistical testing of differe...

Descripción completa

Detalles Bibliográficos
Autores principales: Städler, Nicolas, Dondelinger, Frank, Hill, Steven M, Akbani, Rehan, Lu, Yiling, Mills, Gordon B, Mukherjee, Sach
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5590725/
https://www.ncbi.nlm.nih.gov/pubmed/28535188
http://dx.doi.org/10.1093/bioinformatics/btx322
_version_ 1783262576422944768
author Städler, Nicolas
Dondelinger, Frank
Hill, Steven M
Akbani, Rehan
Lu, Yiling
Mills, Gordon B
Mukherjee, Sach
author_facet Städler, Nicolas
Dondelinger, Frank
Hill, Steven M
Akbani, Rehan
Lu, Yiling
Mills, Gordon B
Mukherjee, Sach
author_sort Städler, Nicolas
collection PubMed
description MOTIVATION: Molecular pathways and networks play a key role in basic and disease biology. An emerging notion is that networks encoding patterns of molecular interplay may themselves differ between contexts, such as cell type, tissue or disease (sub)type. However, while statistical testing of differences in mean expression levels has been extensively studied, testing of network differences remains challenging. Furthermore, since network differences could provide important and biologically interpretable information to identify molecular subgroups, there is a need to consider the unsupervised task of learning subgroups and networks that define them. This is a nontrivial clustering problem, with neither subgroups nor subgroup-specific networks known at the outset. RESULTS: We leverage recent ideas from high-dimensional statistics for testing and clustering in the network biology setting. The methods we describe can be applied directly to most continuous molecular measurements and networks do not need to be specified beforehand. We illustrate the ideas and methods in a case study using protein data from The Cancer Genome Atlas (TCGA). This provides evidence that patterns of interplay between signalling proteins differ significantly between cancer types. Furthermore, we show how the proposed approaches can be used to learn subtypes and the molecular networks that define them. AVAILABILITY AND IMPLEMENTATION: As the Bioconductor package nethet. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5590725
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55907252017-09-15 Molecular heterogeneity at the network level: high-dimensional testing, clustering and a TCGA case study Städler, Nicolas Dondelinger, Frank Hill, Steven M Akbani, Rehan Lu, Yiling Mills, Gordon B Mukherjee, Sach Bioinformatics Original Papers MOTIVATION: Molecular pathways and networks play a key role in basic and disease biology. An emerging notion is that networks encoding patterns of molecular interplay may themselves differ between contexts, such as cell type, tissue or disease (sub)type. However, while statistical testing of differences in mean expression levels has been extensively studied, testing of network differences remains challenging. Furthermore, since network differences could provide important and biologically interpretable information to identify molecular subgroups, there is a need to consider the unsupervised task of learning subgroups and networks that define them. This is a nontrivial clustering problem, with neither subgroups nor subgroup-specific networks known at the outset. RESULTS: We leverage recent ideas from high-dimensional statistics for testing and clustering in the network biology setting. The methods we describe can be applied directly to most continuous molecular measurements and networks do not need to be specified beforehand. We illustrate the ideas and methods in a case study using protein data from The Cancer Genome Atlas (TCGA). This provides evidence that patterns of interplay between signalling proteins differ significantly between cancer types. Furthermore, we show how the proposed approaches can be used to learn subtypes and the molecular networks that define them. AVAILABILITY AND IMPLEMENTATION: As the Bioconductor package nethet. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-09-15 2017-05-23 /pmc/articles/PMC5590725/ /pubmed/28535188 http://dx.doi.org/10.1093/bioinformatics/btx322 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Städler, Nicolas
Dondelinger, Frank
Hill, Steven M
Akbani, Rehan
Lu, Yiling
Mills, Gordon B
Mukherjee, Sach
Molecular heterogeneity at the network level: high-dimensional testing, clustering and a TCGA case study
title Molecular heterogeneity at the network level: high-dimensional testing, clustering and a TCGA case study
title_full Molecular heterogeneity at the network level: high-dimensional testing, clustering and a TCGA case study
title_fullStr Molecular heterogeneity at the network level: high-dimensional testing, clustering and a TCGA case study
title_full_unstemmed Molecular heterogeneity at the network level: high-dimensional testing, clustering and a TCGA case study
title_short Molecular heterogeneity at the network level: high-dimensional testing, clustering and a TCGA case study
title_sort molecular heterogeneity at the network level: high-dimensional testing, clustering and a tcga case study
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5590725/
https://www.ncbi.nlm.nih.gov/pubmed/28535188
http://dx.doi.org/10.1093/bioinformatics/btx322
work_keys_str_mv AT stadlernicolas molecularheterogeneityatthenetworklevelhighdimensionaltestingclusteringandatcgacasestudy
AT dondelingerfrank molecularheterogeneityatthenetworklevelhighdimensionaltestingclusteringandatcgacasestudy
AT hillstevenm molecularheterogeneityatthenetworklevelhighdimensionaltestingclusteringandatcgacasestudy
AT akbanirehan molecularheterogeneityatthenetworklevelhighdimensionaltestingclusteringandatcgacasestudy
AT luyiling molecularheterogeneityatthenetworklevelhighdimensionaltestingclusteringandatcgacasestudy
AT millsgordonb molecularheterogeneityatthenetworklevelhighdimensionaltestingclusteringandatcgacasestudy
AT mukherjeesach molecularheterogeneityatthenetworklevelhighdimensionaltestingclusteringandatcgacasestudy