Cargando…

Identification of differentially expressed subnetworks based on multivariate ANOVA

BACKGROUND: Since high-throughput protein-protein interaction (PPI) data has recently become available for humans, there has been a growing interest in combining PPI data with other genome-wide data. In particular, the identification of phenotype-related PPI subnetworks using gene expression data ha...

Descripción completa

Detalles Bibliográficos
Autores principales: Hwang, Taeyoung, Park, Taesung
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2696448/
https://www.ncbi.nlm.nih.gov/pubmed/19405941
http://dx.doi.org/10.1186/1471-2105-10-128
_version_ 1782168267318624256
author Hwang, Taeyoung
Park, Taesung
author_facet Hwang, Taeyoung
Park, Taesung
author_sort Hwang, Taeyoung
collection PubMed
description BACKGROUND: Since high-throughput protein-protein interaction (PPI) data has recently become available for humans, there has been a growing interest in combining PPI data with other genome-wide data. In particular, the identification of phenotype-related PPI subnetworks using gene expression data has been of great concern. Successful integration for the identification of significant subnetworks requires the use of a search algorithm with a proper scoring method. Here we propose a multivariate analysis of variance (MANOVA)-based scoring method with a greedy search for identifying differentially expressed PPI subnetworks. RESULTS: Given the MANOVA-based scoring method, we performed a greedy search to identify the subnetworks with the maximum scores in the PPI network. Our approach was successfully applied to human microarray datasets. Each identified subnetwork was annotated with the Gene Ontology (GO) term, resulting in the phenotype-related functional pathway or complex. We also compared these results with those of other scoring methods such as t statistic- and mutual information-based scoring methods. The MANOVA-based method produced subnetworks with a larger number of proteins than the other methods. Furthermore, the subnetworks identified by the MANOVA-based method tended to consist of highly correlated proteins. CONCLUSION: This article proposes a MANOVA-based scoring method to combine PPI data with expression data using a greedy search. This method is recommended for the highly sensitive detection of large subnetworks.
format Text
id pubmed-2696448
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26964482009-06-16 Identification of differentially expressed subnetworks based on multivariate ANOVA Hwang, Taeyoung Park, Taesung BMC Bioinformatics Methodology Article BACKGROUND: Since high-throughput protein-protein interaction (PPI) data has recently become available for humans, there has been a growing interest in combining PPI data with other genome-wide data. In particular, the identification of phenotype-related PPI subnetworks using gene expression data has been of great concern. Successful integration for the identification of significant subnetworks requires the use of a search algorithm with a proper scoring method. Here we propose a multivariate analysis of variance (MANOVA)-based scoring method with a greedy search for identifying differentially expressed PPI subnetworks. RESULTS: Given the MANOVA-based scoring method, we performed a greedy search to identify the subnetworks with the maximum scores in the PPI network. Our approach was successfully applied to human microarray datasets. Each identified subnetwork was annotated with the Gene Ontology (GO) term, resulting in the phenotype-related functional pathway or complex. We also compared these results with those of other scoring methods such as t statistic- and mutual information-based scoring methods. The MANOVA-based method produced subnetworks with a larger number of proteins than the other methods. Furthermore, the subnetworks identified by the MANOVA-based method tended to consist of highly correlated proteins. CONCLUSION: This article proposes a MANOVA-based scoring method to combine PPI data with expression data using a greedy search. This method is recommended for the highly sensitive detection of large subnetworks. BioMed Central 2009-04-30 /pmc/articles/PMC2696448/ /pubmed/19405941 http://dx.doi.org/10.1186/1471-2105-10-128 Text en Copyright © 2009 Hwang and Park; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Hwang, Taeyoung
Park, Taesung
Identification of differentially expressed subnetworks based on multivariate ANOVA
title Identification of differentially expressed subnetworks based on multivariate ANOVA
title_full Identification of differentially expressed subnetworks based on multivariate ANOVA
title_fullStr Identification of differentially expressed subnetworks based on multivariate ANOVA
title_full_unstemmed Identification of differentially expressed subnetworks based on multivariate ANOVA
title_short Identification of differentially expressed subnetworks based on multivariate ANOVA
title_sort identification of differentially expressed subnetworks based on multivariate anova
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2696448/
https://www.ncbi.nlm.nih.gov/pubmed/19405941
http://dx.doi.org/10.1186/1471-2105-10-128
work_keys_str_mv AT hwangtaeyoung identificationofdifferentiallyexpressedsubnetworksbasedonmultivariateanova
AT parktaesung identificationofdifferentiallyexpressedsubnetworksbasedonmultivariateanova