Cargando…

Missing gene identification using functional coherence scores

Reconstructing metabolic and signaling pathways is an effective way of interpreting a genome sequence. A challenge in a pathway reconstruction is that often genes in a pathway cannot be easily found, reflecting current imperfect information of the target organism. In this work, we developed a new me...

Descripción completa

Detalles Bibliográficos
Autores principales: Chitale, Meghana, Khan, Ishita K., Kihara, Daisuke
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4995438/
https://www.ncbi.nlm.nih.gov/pubmed/27552989
http://dx.doi.org/10.1038/srep31725
_version_ 1782449471225856000
author Chitale, Meghana
Khan, Ishita K.
Kihara, Daisuke
author_facet Chitale, Meghana
Khan, Ishita K.
Kihara, Daisuke
author_sort Chitale, Meghana
collection PubMed
description Reconstructing metabolic and signaling pathways is an effective way of interpreting a genome sequence. A challenge in a pathway reconstruction is that often genes in a pathway cannot be easily found, reflecting current imperfect information of the target organism. In this work, we developed a new method for finding missing genes, which integrates multiple features, including gene expression, phylogenetic profile, and function association scores. Particularly, for considering function association between candidate genes and neighboring proteins to the target missing gene in the network, we used Co-occurrence Association Score (CAS) and PubMed Association Score (PAS), which are designed for capturing functional coherence of proteins. We showed that adding CAS and PAS substantially improve the accuracy of identifying missing genes in the yeast enzyme-enzyme network compared to the cases when only the conventional features, gene expression, phylogenetic profile, were used. Finally, it was also demonstrated that the accuracy improves by considering indirect neighbors to the target enzyme position in the network using a proper network-topology-based weighting scheme.
format Online
Article
Text
id pubmed-4995438
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-49954382016-08-30 Missing gene identification using functional coherence scores Chitale, Meghana Khan, Ishita K. Kihara, Daisuke Sci Rep Article Reconstructing metabolic and signaling pathways is an effective way of interpreting a genome sequence. A challenge in a pathway reconstruction is that often genes in a pathway cannot be easily found, reflecting current imperfect information of the target organism. In this work, we developed a new method for finding missing genes, which integrates multiple features, including gene expression, phylogenetic profile, and function association scores. Particularly, for considering function association between candidate genes and neighboring proteins to the target missing gene in the network, we used Co-occurrence Association Score (CAS) and PubMed Association Score (PAS), which are designed for capturing functional coherence of proteins. We showed that adding CAS and PAS substantially improve the accuracy of identifying missing genes in the yeast enzyme-enzyme network compared to the cases when only the conventional features, gene expression, phylogenetic profile, were used. Finally, it was also demonstrated that the accuracy improves by considering indirect neighbors to the target enzyme position in the network using a proper network-topology-based weighting scheme. Nature Publishing Group 2016-08-24 /pmc/articles/PMC4995438/ /pubmed/27552989 http://dx.doi.org/10.1038/srep31725 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Chitale, Meghana
Khan, Ishita K.
Kihara, Daisuke
Missing gene identification using functional coherence scores
title Missing gene identification using functional coherence scores
title_full Missing gene identification using functional coherence scores
title_fullStr Missing gene identification using functional coherence scores
title_full_unstemmed Missing gene identification using functional coherence scores
title_short Missing gene identification using functional coherence scores
title_sort missing gene identification using functional coherence scores
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4995438/
https://www.ncbi.nlm.nih.gov/pubmed/27552989
http://dx.doi.org/10.1038/srep31725
work_keys_str_mv AT chitalemeghana missinggeneidentificationusingfunctionalcoherencescores
AT khanishitak missinggeneidentificationusingfunctionalcoherencescores
AT kiharadaisuke missinggeneidentificationusingfunctionalcoherencescores