Cargando…

Predicting gene function using similarity learning

BACKGROUND: Computational methods that make use of heterogeneous biological datasets to predict gene function provide a cost-effective and rapid way for annotating genomes. A common framework shared by many such methods is to construct a combined functional association network from multiple networks...

Descripción completa

Detalles Bibliográficos
Autores principales: Phuong, Tu Minh, Nhung, Ngo Phuong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3849496/
https://www.ncbi.nlm.nih.gov/pubmed/24266903
http://dx.doi.org/10.1186/1471-2164-14-S4-S4
_version_ 1782293936318971904
author Phuong, Tu Minh
Nhung, Ngo Phuong
author_facet Phuong, Tu Minh
Nhung, Ngo Phuong
author_sort Phuong, Tu Minh
collection PubMed
description BACKGROUND: Computational methods that make use of heterogeneous biological datasets to predict gene function provide a cost-effective and rapid way for annotating genomes. A common framework shared by many such methods is to construct a combined functional association network from multiple networks representing different sources of data, and use this combined network as input to network-based or kernel-based learning algorithms. In these methods, a key factor contributing to the prediction accuracy is the network quality, which is the ability of the network to reflect the functional relatedness of gene pairs. To improve the network quality, a large effort has been spent on developing methods for network integration. These methods, however, produce networks, which then remain unchanged, and nearly no effort has been made to optimize the networks after their construction. RESULTS: Here, we propose an alternative method to improve the network quality. The proposed method takes as input a combined network produced by an existing network integration algorithm, and reconstructs this network to better represent the co-functionality relationships between gene pairs. At the core of the method is a learning algorithm that can learn a measure of functional similarity between genes, which we then use to reconstruct the input network. In experiments with yeast and human, the proposed method produced improved networks and achieved more accurate results than two other leading gene function prediction approaches. CONCLUSIONS: The results show that it is possible to improve the accuracy of network-based gene function prediction methods by optimizing combined networks with appropriate similarity measures learned from data. The proposed learning procedure can handle noisy training data and scales well to large genomes.
format Online
Article
Text
id pubmed-3849496
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38494962013-12-06 Predicting gene function using similarity learning Phuong, Tu Minh Nhung, Ngo Phuong BMC Genomics Research BACKGROUND: Computational methods that make use of heterogeneous biological datasets to predict gene function provide a cost-effective and rapid way for annotating genomes. A common framework shared by many such methods is to construct a combined functional association network from multiple networks representing different sources of data, and use this combined network as input to network-based or kernel-based learning algorithms. In these methods, a key factor contributing to the prediction accuracy is the network quality, which is the ability of the network to reflect the functional relatedness of gene pairs. To improve the network quality, a large effort has been spent on developing methods for network integration. These methods, however, produce networks, which then remain unchanged, and nearly no effort has been made to optimize the networks after their construction. RESULTS: Here, we propose an alternative method to improve the network quality. The proposed method takes as input a combined network produced by an existing network integration algorithm, and reconstructs this network to better represent the co-functionality relationships between gene pairs. At the core of the method is a learning algorithm that can learn a measure of functional similarity between genes, which we then use to reconstruct the input network. In experiments with yeast and human, the proposed method produced improved networks and achieved more accurate results than two other leading gene function prediction approaches. CONCLUSIONS: The results show that it is possible to improve the accuracy of network-based gene function prediction methods by optimizing combined networks with appropriate similarity measures learned from data. The proposed learning procedure can handle noisy training data and scales well to large genomes. BioMed Central 2013-10-01 /pmc/articles/PMC3849496/ /pubmed/24266903 http://dx.doi.org/10.1186/1471-2164-14-S4-S4 Text en Copyright © 2013 Phuong and Nhung; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Phuong, Tu Minh
Nhung, Ngo Phuong
Predicting gene function using similarity learning
title Predicting gene function using similarity learning
title_full Predicting gene function using similarity learning
title_fullStr Predicting gene function using similarity learning
title_full_unstemmed Predicting gene function using similarity learning
title_short Predicting gene function using similarity learning
title_sort predicting gene function using similarity learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3849496/
https://www.ncbi.nlm.nih.gov/pubmed/24266903
http://dx.doi.org/10.1186/1471-2164-14-S4-S4
work_keys_str_mv AT phuongtuminh predictinggenefunctionusingsimilaritylearning
AT nhungngophuong predictinggenefunctionusingsimilaritylearning