Cargando…
Missing gene identification using functional coherence scores
Reconstructing metabolic and signaling pathways is an effective way of interpreting a genome sequence. A challenge in a pathway reconstruction is that often genes in a pathway cannot be easily found, reflecting current imperfect information of the target organism. In this work, we developed a new me...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4995438/ https://www.ncbi.nlm.nih.gov/pubmed/27552989 http://dx.doi.org/10.1038/srep31725 |
_version_ | 1782449471225856000 |
---|---|
author | Chitale, Meghana Khan, Ishita K. Kihara, Daisuke |
author_facet | Chitale, Meghana Khan, Ishita K. Kihara, Daisuke |
author_sort | Chitale, Meghana |
collection | PubMed |
description | Reconstructing metabolic and signaling pathways is an effective way of interpreting a genome sequence. A challenge in a pathway reconstruction is that often genes in a pathway cannot be easily found, reflecting current imperfect information of the target organism. In this work, we developed a new method for finding missing genes, which integrates multiple features, including gene expression, phylogenetic profile, and function association scores. Particularly, for considering function association between candidate genes and neighboring proteins to the target missing gene in the network, we used Co-occurrence Association Score (CAS) and PubMed Association Score (PAS), which are designed for capturing functional coherence of proteins. We showed that adding CAS and PAS substantially improve the accuracy of identifying missing genes in the yeast enzyme-enzyme network compared to the cases when only the conventional features, gene expression, phylogenetic profile, were used. Finally, it was also demonstrated that the accuracy improves by considering indirect neighbors to the target enzyme position in the network using a proper network-topology-based weighting scheme. |
format | Online Article Text |
id | pubmed-4995438 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-49954382016-08-30 Missing gene identification using functional coherence scores Chitale, Meghana Khan, Ishita K. Kihara, Daisuke Sci Rep Article Reconstructing metabolic and signaling pathways is an effective way of interpreting a genome sequence. A challenge in a pathway reconstruction is that often genes in a pathway cannot be easily found, reflecting current imperfect information of the target organism. In this work, we developed a new method for finding missing genes, which integrates multiple features, including gene expression, phylogenetic profile, and function association scores. Particularly, for considering function association between candidate genes and neighboring proteins to the target missing gene in the network, we used Co-occurrence Association Score (CAS) and PubMed Association Score (PAS), which are designed for capturing functional coherence of proteins. We showed that adding CAS and PAS substantially improve the accuracy of identifying missing genes in the yeast enzyme-enzyme network compared to the cases when only the conventional features, gene expression, phylogenetic profile, were used. Finally, it was also demonstrated that the accuracy improves by considering indirect neighbors to the target enzyme position in the network using a proper network-topology-based weighting scheme. Nature Publishing Group 2016-08-24 /pmc/articles/PMC4995438/ /pubmed/27552989 http://dx.doi.org/10.1038/srep31725 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Chitale, Meghana Khan, Ishita K. Kihara, Daisuke Missing gene identification using functional coherence scores |
title | Missing gene identification using functional coherence scores |
title_full | Missing gene identification using functional coherence scores |
title_fullStr | Missing gene identification using functional coherence scores |
title_full_unstemmed | Missing gene identification using functional coherence scores |
title_short | Missing gene identification using functional coherence scores |
title_sort | missing gene identification using functional coherence scores |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4995438/ https://www.ncbi.nlm.nih.gov/pubmed/27552989 http://dx.doi.org/10.1038/srep31725 |
work_keys_str_mv | AT chitalemeghana missinggeneidentificationusingfunctionalcoherencescores AT khanishitak missinggeneidentificationusingfunctionalcoherencescores AT kiharadaisuke missinggeneidentificationusingfunctionalcoherencescores |