Cargando…

Identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network

BACKGROUND: Finding reliable gene markers for accurate disease classification is very challenging due to a number of reasons, including the small sample size of typical clinical data, high noise in gene expression measurements, and the heterogeneity across patients. In fact, gene markers identified...

Descripción completa

Detalles Bibliográficos
Autores principales: Su, Junjie, Yoon, Byung-Jun, Dougherty, Edward R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3026382/
https://www.ncbi.nlm.nih.gov/pubmed/20946619
http://dx.doi.org/10.1186/1471-2105-11-S6-S8
_version_ 1782197040578560000
author Su, Junjie
Yoon, Byung-Jun
Dougherty, Edward R
author_facet Su, Junjie
Yoon, Byung-Jun
Dougherty, Edward R
author_sort Su, Junjie
collection PubMed
description BACKGROUND: Finding reliable gene markers for accurate disease classification is very challenging due to a number of reasons, including the small sample size of typical clinical data, high noise in gene expression measurements, and the heterogeneity across patients. In fact, gene markers identified in independent studies often do not coincide with each other, suggesting that many of the predicted markers may have no biological significance and may be simply artifacts of the analyzed dataset. To find more reliable and reproducible diagnostic markers, several studies proposed to analyze the gene expression data at the level of groups of functionally related genes, such as pathways. Studies have shown that pathway markers tend to be more robust and yield more accurate classification results. One practical problem of the pathway-based approach is the limited coverage of genes by currently known pathways. As a result, potentially important genes that play critical roles in cancer development may be excluded. To overcome this problem, we propose a novel method for identifying reliable subnetwork markers in a human protein-protein interaction (PPI) network. RESULTS: In this method, we overlay the gene expression data with the PPI network and look for the most discriminative linear paths that consist of discriminative genes that are highly correlated to each other. The overlapping linear paths are then optimally combined into subnetworks that can potentially serve as effective diagnostic markers. We tested our method on two independent large-scale breast cancer datasets and compared the effectiveness and reproducibility of the identified subnetwork markers with gene-based and pathway-based markers. We also compared the proposed method with an existing subnetwork-based method. CONCLUSIONS: The proposed method can efficiently find reliable subnetwork markers that outperform the gene-based and pathway-based markers in terms of discriminative power, reproducibility and classification performance. Subnetwork markers found by our method are highly enriched in common GO terms, and they can more accurately classify breast cancer metastasis compared to markers found by a previous method.
format Text
id pubmed-3026382
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30263822011-01-26 Identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network Su, Junjie Yoon, Byung-Jun Dougherty, Edward R BMC Bioinformatics Proceedings BACKGROUND: Finding reliable gene markers for accurate disease classification is very challenging due to a number of reasons, including the small sample size of typical clinical data, high noise in gene expression measurements, and the heterogeneity across patients. In fact, gene markers identified in independent studies often do not coincide with each other, suggesting that many of the predicted markers may have no biological significance and may be simply artifacts of the analyzed dataset. To find more reliable and reproducible diagnostic markers, several studies proposed to analyze the gene expression data at the level of groups of functionally related genes, such as pathways. Studies have shown that pathway markers tend to be more robust and yield more accurate classification results. One practical problem of the pathway-based approach is the limited coverage of genes by currently known pathways. As a result, potentially important genes that play critical roles in cancer development may be excluded. To overcome this problem, we propose a novel method for identifying reliable subnetwork markers in a human protein-protein interaction (PPI) network. RESULTS: In this method, we overlay the gene expression data with the PPI network and look for the most discriminative linear paths that consist of discriminative genes that are highly correlated to each other. The overlapping linear paths are then optimally combined into subnetworks that can potentially serve as effective diagnostic markers. We tested our method on two independent large-scale breast cancer datasets and compared the effectiveness and reproducibility of the identified subnetwork markers with gene-based and pathway-based markers. We also compared the proposed method with an existing subnetwork-based method. CONCLUSIONS: The proposed method can efficiently find reliable subnetwork markers that outperform the gene-based and pathway-based markers in terms of discriminative power, reproducibility and classification performance. Subnetwork markers found by our method are highly enriched in common GO terms, and they can more accurately classify breast cancer metastasis compared to markers found by a previous method. BioMed Central 2010-10-07 /pmc/articles/PMC3026382/ /pubmed/20946619 http://dx.doi.org/10.1186/1471-2105-11-S6-S8 Text en Copyright ©2010 Yoon et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Su, Junjie
Yoon, Byung-Jun
Dougherty, Edward R
Identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network
title Identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network
title_full Identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network
title_fullStr Identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network
title_full_unstemmed Identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network
title_short Identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network
title_sort identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3026382/
https://www.ncbi.nlm.nih.gov/pubmed/20946619
http://dx.doi.org/10.1186/1471-2105-11-S6-S8
work_keys_str_mv AT sujunjie identificationofdiagnosticsubnetworkmarkersforcancerinhumanproteinproteininteractionnetwork
AT yoonbyungjun identificationofdiagnosticsubnetworkmarkersforcancerinhumanproteinproteininteractionnetwork
AT doughertyedwardr identificationofdiagnosticsubnetworkmarkersforcancerinhumanproteinproteininteractionnetwork