Cargando…

SPSNet: subpopulation-sensitive network-based analysis of heterogeneous gene expression data

BACKGROUND: Transcriptomic datasets often contain undeclared heterogeneity arising from biological variation such as diversity of disease subtypes, treatment subgroups, time-series gene expression, nested experimental conditions, as well as technical variation due to batch effects, platform differen...

Descripción completa

Detalles Bibliográficos
Autores principales: Belorkar, Abha, Vadigepalli, Rajanikanth, Wong, Limsoon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5861489/
https://www.ncbi.nlm.nih.gov/pubmed/29560831
http://dx.doi.org/10.1186/s12918-018-0538-1
_version_ 1783308106857447424
author Belorkar, Abha
Vadigepalli, Rajanikanth
Wong, Limsoon
author_facet Belorkar, Abha
Vadigepalli, Rajanikanth
Wong, Limsoon
author_sort Belorkar, Abha
collection PubMed
description BACKGROUND: Transcriptomic datasets often contain undeclared heterogeneity arising from biological variation such as diversity of disease subtypes, treatment subgroups, time-series gene expression, nested experimental conditions, as well as technical variation due to batch effects, platform differences in integrated meta-analyses, etc. However, current analysis approaches are primarily designed to handle comparisons between experimental conditions represented by homogeneous samples, thus precluding the discovery of underlying subphenotypes. Unsupervised methods for subtype identification are typically based on individual gene level analysis, which often result in irreproducible gene signatures for potential subtypes. Emerging methods to study heterogeneity have been largely developed in the context of single-cell datasets containing hundreds to thousands of samples, limiting their use to select contexts. RESULTS: We present a novel analysis method, SPSNet, which identifies subtype-specific gene expression signatures based on the activity of subnetworks in biological pathways. SPSNet identifies the gene subnetworks capturing the diversity of underlying biological mechanisms, indicating potential sample subphenotypes. In the presence of extrinsic or non-biological heterogeneity (e.g. batch effects), SPSNet identifies subnetworks that are particularly affected by such variation, thus helping eliminate factors irrelevant to the biology of the phenotypes under study. CONCLUSION: Using multiple publicly available datasets, we illustrate that SPSNet is able to consistently uncover patterns within gene expression data that correspond to meaningful heterogeneity of various origins. We also demonstrate the performance of SPSNet as a sensitive and reliable tool for understanding the structure and nature of such heterogeneity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12918-018-0538-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5861489
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58614892018-03-26 SPSNet: subpopulation-sensitive network-based analysis of heterogeneous gene expression data Belorkar, Abha Vadigepalli, Rajanikanth Wong, Limsoon BMC Syst Biol Research BACKGROUND: Transcriptomic datasets often contain undeclared heterogeneity arising from biological variation such as diversity of disease subtypes, treatment subgroups, time-series gene expression, nested experimental conditions, as well as technical variation due to batch effects, platform differences in integrated meta-analyses, etc. However, current analysis approaches are primarily designed to handle comparisons between experimental conditions represented by homogeneous samples, thus precluding the discovery of underlying subphenotypes. Unsupervised methods for subtype identification are typically based on individual gene level analysis, which often result in irreproducible gene signatures for potential subtypes. Emerging methods to study heterogeneity have been largely developed in the context of single-cell datasets containing hundreds to thousands of samples, limiting their use to select contexts. RESULTS: We present a novel analysis method, SPSNet, which identifies subtype-specific gene expression signatures based on the activity of subnetworks in biological pathways. SPSNet identifies the gene subnetworks capturing the diversity of underlying biological mechanisms, indicating potential sample subphenotypes. In the presence of extrinsic or non-biological heterogeneity (e.g. batch effects), SPSNet identifies subnetworks that are particularly affected by such variation, thus helping eliminate factors irrelevant to the biology of the phenotypes under study. CONCLUSION: Using multiple publicly available datasets, we illustrate that SPSNet is able to consistently uncover patterns within gene expression data that correspond to meaningful heterogeneity of various origins. We also demonstrate the performance of SPSNet as a sensitive and reliable tool for understanding the structure and nature of such heterogeneity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12918-018-0538-1) contains supplementary material, which is available to authorized users. BioMed Central 2018-03-19 /pmc/articles/PMC5861489/ /pubmed/29560831 http://dx.doi.org/10.1186/s12918-018-0538-1 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Belorkar, Abha
Vadigepalli, Rajanikanth
Wong, Limsoon
SPSNet: subpopulation-sensitive network-based analysis of heterogeneous gene expression data
title SPSNet: subpopulation-sensitive network-based analysis of heterogeneous gene expression data
title_full SPSNet: subpopulation-sensitive network-based analysis of heterogeneous gene expression data
title_fullStr SPSNet: subpopulation-sensitive network-based analysis of heterogeneous gene expression data
title_full_unstemmed SPSNet: subpopulation-sensitive network-based analysis of heterogeneous gene expression data
title_short SPSNet: subpopulation-sensitive network-based analysis of heterogeneous gene expression data
title_sort spsnet: subpopulation-sensitive network-based analysis of heterogeneous gene expression data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5861489/
https://www.ncbi.nlm.nih.gov/pubmed/29560831
http://dx.doi.org/10.1186/s12918-018-0538-1
work_keys_str_mv AT belorkarabha spsnetsubpopulationsensitivenetworkbasedanalysisofheterogeneousgeneexpressiondata
AT vadigepallirajanikanth spsnetsubpopulationsensitivenetworkbasedanalysisofheterogeneousgeneexpressiondata
AT wonglimsoon spsnetsubpopulationsensitivenetworkbasedanalysisofheterogeneousgeneexpressiondata