Cargando…
Mining breast cancer genes with a network based noise-tolerant approach
BACKGROUND: Mining novel breast cancer genes is an important task in breast cancer research. Many approaches prioritize candidate genes based on their similarity to known cancer genes, usually by integrating multiple data sources. However, different types of data often contain varying degrees of noi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3702465/ https://www.ncbi.nlm.nih.gov/pubmed/23799982 http://dx.doi.org/10.1186/1752-0509-7-49 |
_version_ | 1782275812887625728 |
---|---|
author | Nie, Yaling Yu, Jingkai |
author_facet | Nie, Yaling Yu, Jingkai |
author_sort | Nie, Yaling |
collection | PubMed |
description | BACKGROUND: Mining novel breast cancer genes is an important task in breast cancer research. Many approaches prioritize candidate genes based on their similarity to known cancer genes, usually by integrating multiple data sources. However, different types of data often contain varying degrees of noise. For effective data integration, it’s important to design methods that work robustly with respect to noise. RESULTS: Gene Ontology (GO) annotations were often utilized in cancer gene mining works. However, the vast majority of GO annotations were computationally derived, thus not completely accurate. A set of genes annotated with breast cancer enriched GO terms was adopted here as a set of source data with realistic noise. A novel noise tolerant approach was proposed to rank candidate breast cancer genes using noisy source data within the framework of a comprehensive human Protein-Protein Interaction (PPI) network. Performance of the proposed method was quantitatively evaluated by comparing it with the more established random walk approach. Results showed that the proposed method exhibited better performance in ranking known breast cancer genes and higher robustness against data noise than the random walk approach. When noise started to increase, the proposed method was able to maintained relatively stable performance, while the random walk approach showed drastic performance decline; when noise increased to a large extent, the proposed method was still able to achieve better performance than random walk did. CONCLUSIONS: A novel noise tolerant method was proposed to mine breast cancer genes. Compared to the well established random walk approach, it showed better performance in correctly ranking cancer genes and worked robustly with respect to noise within source data. To the best of our knowledge, it’s the first such effort to quantitatively analyze noise tolerance between different breast cancer gene mining methods. The sorted gene list can be valuable for breast cancer research. The proposed quantitative noise analysis method may also prove useful for other data integration efforts. It is hoped that the current work can lead to more discussions about influence of data noise on different computational methods for mining disease genes. |
format | Online Article Text |
id | pubmed-3702465 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-37024652013-07-10 Mining breast cancer genes with a network based noise-tolerant approach Nie, Yaling Yu, Jingkai BMC Syst Biol Research Article BACKGROUND: Mining novel breast cancer genes is an important task in breast cancer research. Many approaches prioritize candidate genes based on their similarity to known cancer genes, usually by integrating multiple data sources. However, different types of data often contain varying degrees of noise. For effective data integration, it’s important to design methods that work robustly with respect to noise. RESULTS: Gene Ontology (GO) annotations were often utilized in cancer gene mining works. However, the vast majority of GO annotations were computationally derived, thus not completely accurate. A set of genes annotated with breast cancer enriched GO terms was adopted here as a set of source data with realistic noise. A novel noise tolerant approach was proposed to rank candidate breast cancer genes using noisy source data within the framework of a comprehensive human Protein-Protein Interaction (PPI) network. Performance of the proposed method was quantitatively evaluated by comparing it with the more established random walk approach. Results showed that the proposed method exhibited better performance in ranking known breast cancer genes and higher robustness against data noise than the random walk approach. When noise started to increase, the proposed method was able to maintained relatively stable performance, while the random walk approach showed drastic performance decline; when noise increased to a large extent, the proposed method was still able to achieve better performance than random walk did. CONCLUSIONS: A novel noise tolerant method was proposed to mine breast cancer genes. Compared to the well established random walk approach, it showed better performance in correctly ranking cancer genes and worked robustly with respect to noise within source data. To the best of our knowledge, it’s the first such effort to quantitatively analyze noise tolerance between different breast cancer gene mining methods. The sorted gene list can be valuable for breast cancer research. The proposed quantitative noise analysis method may also prove useful for other data integration efforts. It is hoped that the current work can lead to more discussions about influence of data noise on different computational methods for mining disease genes. BioMed Central 2013-06-25 /pmc/articles/PMC3702465/ /pubmed/23799982 http://dx.doi.org/10.1186/1752-0509-7-49 Text en Copyright © 2013 Nie and Yu; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Nie, Yaling Yu, Jingkai Mining breast cancer genes with a network based noise-tolerant approach |
title | Mining breast cancer genes with a network based noise-tolerant approach |
title_full | Mining breast cancer genes with a network based noise-tolerant approach |
title_fullStr | Mining breast cancer genes with a network based noise-tolerant approach |
title_full_unstemmed | Mining breast cancer genes with a network based noise-tolerant approach |
title_short | Mining breast cancer genes with a network based noise-tolerant approach |
title_sort | mining breast cancer genes with a network based noise-tolerant approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3702465/ https://www.ncbi.nlm.nih.gov/pubmed/23799982 http://dx.doi.org/10.1186/1752-0509-7-49 |
work_keys_str_mv | AT nieyaling miningbreastcancergeneswithanetworkbasednoisetolerantapproach AT yujingkai miningbreastcancergeneswithanetworkbasednoisetolerantapproach |