Cargando…
Identifying metabolic enzymes with multiple types of association evidence
BACKGROUND: Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding ge...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1450304/ https://www.ncbi.nlm.nih.gov/pubmed/16571130 http://dx.doi.org/10.1186/1471-2105-7-177 |
_version_ | 1782127393273544704 |
---|---|
author | Kharchenko, Peter Chen, Lifeng Freund, Yoav Vitkup, Dennis Church, George M |
author_facet | Kharchenko, Peter Chen, Lifeng Freund, Yoav Vitkup, Dennis Church, George M |
author_sort | Kharchenko, Peter |
collection | PubMed |
description | BACKGROUND: Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. RESULTS: We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. CONCLUSION: We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. |
format | Text |
id | pubmed-1450304 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-14503042006-05-01 Identifying metabolic enzymes with multiple types of association evidence Kharchenko, Peter Chen, Lifeng Freund, Yoav Vitkup, Dennis Church, George M BMC Bioinformatics Methodology Article BACKGROUND: Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. RESULTS: We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. CONCLUSION: We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. BioMed Central 2006-03-29 /pmc/articles/PMC1450304/ /pubmed/16571130 http://dx.doi.org/10.1186/1471-2105-7-177 Text en Copyright © 2006 Kharchenko et al; licensee BioMed Central Ltd. |
spellingShingle | Methodology Article Kharchenko, Peter Chen, Lifeng Freund, Yoav Vitkup, Dennis Church, George M Identifying metabolic enzymes with multiple types of association evidence |
title | Identifying metabolic enzymes with multiple types of association evidence |
title_full | Identifying metabolic enzymes with multiple types of association evidence |
title_fullStr | Identifying metabolic enzymes with multiple types of association evidence |
title_full_unstemmed | Identifying metabolic enzymes with multiple types of association evidence |
title_short | Identifying metabolic enzymes with multiple types of association evidence |
title_sort | identifying metabolic enzymes with multiple types of association evidence |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1450304/ https://www.ncbi.nlm.nih.gov/pubmed/16571130 http://dx.doi.org/10.1186/1471-2105-7-177 |
work_keys_str_mv | AT kharchenkopeter identifyingmetabolicenzymeswithmultipletypesofassociationevidence AT chenlifeng identifyingmetabolicenzymeswithmultipletypesofassociationevidence AT freundyoav identifyingmetabolicenzymeswithmultipletypesofassociationevidence AT vitkupdennis identifyingmetabolicenzymeswithmultipletypesofassociationevidence AT churchgeorgem identifyingmetabolicenzymeswithmultipletypesofassociationevidence |