Cargando…

Integrating Information in Biological Ontologies and Molecular Networks to Infer Novel Terms

Currently most terms and term-term relationships in Gene Ontology (GO) are defined manually, which creates cost, consistency and completeness issues. Recent studies have demonstrated the feasibility of inferring GO automatically from biological networks, which represents an important complementary a...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Le, Yip, Kevin Y.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5157009/
https://www.ncbi.nlm.nih.gov/pubmed/27976738
http://dx.doi.org/10.1038/srep39237
_version_ 1782481371900411904
author Li, Le
Yip, Kevin Y.
author_facet Li, Le
Yip, Kevin Y.
author_sort Li, Le
collection PubMed
description Currently most terms and term-term relationships in Gene Ontology (GO) are defined manually, which creates cost, consistency and completeness issues. Recent studies have demonstrated the feasibility of inferring GO automatically from biological networks, which represents an important complementary approach to GO construction. These methods (NeXO and CliXO) are unsupervised, which means 1) they cannot use the information contained in existing GO, 2) the way they integrate biological networks may not optimize the accuracy, and 3) they are not customized to infer the three different sub-ontologies of GO. Here we present a semi-supervised method called Unicorn that extends these previous methods to tackle the three problems. Unicorn uses a sub-tree of an existing GO sub-ontology as training part to learn parameters in integrating multiple networks. Cross-validation results show that Unicorn reliably inferred the left-out parts of each specific GO sub-ontology. In addition, by training Unicorn with an old version of GO together with biological networks, it successfully re-discovered some terms and term-term relationships present only in a new version of GO. Unicorn also successfully inferred some novel terms that were not contained in GO but have biological meanings well-supported by the literature.Availability: Source code of Unicorn is available at http://yiplab.cse.cuhk.edu.hk/unicorn/.
format Online
Article
Text
id pubmed-5157009
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-51570092016-12-20 Integrating Information in Biological Ontologies and Molecular Networks to Infer Novel Terms Li, Le Yip, Kevin Y. Sci Rep Article Currently most terms and term-term relationships in Gene Ontology (GO) are defined manually, which creates cost, consistency and completeness issues. Recent studies have demonstrated the feasibility of inferring GO automatically from biological networks, which represents an important complementary approach to GO construction. These methods (NeXO and CliXO) are unsupervised, which means 1) they cannot use the information contained in existing GO, 2) the way they integrate biological networks may not optimize the accuracy, and 3) they are not customized to infer the three different sub-ontologies of GO. Here we present a semi-supervised method called Unicorn that extends these previous methods to tackle the three problems. Unicorn uses a sub-tree of an existing GO sub-ontology as training part to learn parameters in integrating multiple networks. Cross-validation results show that Unicorn reliably inferred the left-out parts of each specific GO sub-ontology. In addition, by training Unicorn with an old version of GO together with biological networks, it successfully re-discovered some terms and term-term relationships present only in a new version of GO. Unicorn also successfully inferred some novel terms that were not contained in GO but have biological meanings well-supported by the literature.Availability: Source code of Unicorn is available at http://yiplab.cse.cuhk.edu.hk/unicorn/. Nature Publishing Group 2016-12-15 /pmc/articles/PMC5157009/ /pubmed/27976738 http://dx.doi.org/10.1038/srep39237 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Li, Le
Yip, Kevin Y.
Integrating Information in Biological Ontologies and Molecular Networks to Infer Novel Terms
title Integrating Information in Biological Ontologies and Molecular Networks to Infer Novel Terms
title_full Integrating Information in Biological Ontologies and Molecular Networks to Infer Novel Terms
title_fullStr Integrating Information in Biological Ontologies and Molecular Networks to Infer Novel Terms
title_full_unstemmed Integrating Information in Biological Ontologies and Molecular Networks to Infer Novel Terms
title_short Integrating Information in Biological Ontologies and Molecular Networks to Infer Novel Terms
title_sort integrating information in biological ontologies and molecular networks to infer novel terms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5157009/
https://www.ncbi.nlm.nih.gov/pubmed/27976738
http://dx.doi.org/10.1038/srep39237
work_keys_str_mv AT lile integratinginformationinbiologicalontologiesandmolecularnetworkstoinfernovelterms
AT yipkeviny integratinginformationinbiologicalontologiesandmolecularnetworkstoinfernovelterms