Cargando…

Identifying metabolic enzymes with multiple types of association evidence

BACKGROUND: Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Kharchenko, Peter, Chen, Lifeng, Freund, Yoav, Vitkup, Dennis, Church, George M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1450304/
https://www.ncbi.nlm.nih.gov/pubmed/16571130
http://dx.doi.org/10.1186/1471-2105-7-177
_version_ 1782127393273544704
author Kharchenko, Peter
Chen, Lifeng
Freund, Yoav
Vitkup, Dennis
Church, George M
author_facet Kharchenko, Peter
Chen, Lifeng
Freund, Yoav
Vitkup, Dennis
Church, George M
author_sort Kharchenko, Peter
collection PubMed
description BACKGROUND: Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. RESULTS: We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. CONCLUSION: We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities.
format Text
id pubmed-1450304
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14503042006-05-01 Identifying metabolic enzymes with multiple types of association evidence Kharchenko, Peter Chen, Lifeng Freund, Yoav Vitkup, Dennis Church, George M BMC Bioinformatics Methodology Article BACKGROUND: Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. RESULTS: We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. CONCLUSION: We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. BioMed Central 2006-03-29 /pmc/articles/PMC1450304/ /pubmed/16571130 http://dx.doi.org/10.1186/1471-2105-7-177 Text en Copyright © 2006 Kharchenko et al; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Kharchenko, Peter
Chen, Lifeng
Freund, Yoav
Vitkup, Dennis
Church, George M
Identifying metabolic enzymes with multiple types of association evidence
title Identifying metabolic enzymes with multiple types of association evidence
title_full Identifying metabolic enzymes with multiple types of association evidence
title_fullStr Identifying metabolic enzymes with multiple types of association evidence
title_full_unstemmed Identifying metabolic enzymes with multiple types of association evidence
title_short Identifying metabolic enzymes with multiple types of association evidence
title_sort identifying metabolic enzymes with multiple types of association evidence
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1450304/
https://www.ncbi.nlm.nih.gov/pubmed/16571130
http://dx.doi.org/10.1186/1471-2105-7-177
work_keys_str_mv AT kharchenkopeter identifyingmetabolicenzymeswithmultipletypesofassociationevidence
AT chenlifeng identifyingmetabolicenzymeswithmultipletypesofassociationevidence
AT freundyoav identifyingmetabolicenzymeswithmultipletypesofassociationevidence
AT vitkupdennis identifyingmetabolicenzymeswithmultipletypesofassociationevidence
AT churchgeorgem identifyingmetabolicenzymeswithmultipletypesofassociationevidence