Cargando…

Prediction and identification of sequences coding for orphan enzymes using genomic and metagenomic neighbours

Despite the current wealth of sequencing data, one-third of all biochemically characterized metabolic enzymes lack a corresponding gene or protein sequence, and as such can be considered orphan enzymes. They represent a major gap between our molecular and biochemical knowledge, and consequently are...

Descripción completa

Detalles Bibliográficos
Autores principales: Yamada, Takuji, Waller, Alison S, Raes, Jeroen, Zelezniak, Aleksej, Perchat, Nadia, Perret, Alain, Salanoubat, Marcel, Patil, Kiran R, Weissenbach, Jean, Bork, Peer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: European Molecular Biology Organization 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3377989/
https://www.ncbi.nlm.nih.gov/pubmed/22569339
http://dx.doi.org/10.1038/msb.2012.13
_version_ 1782236013300547584
author Yamada, Takuji
Waller, Alison S
Raes, Jeroen
Zelezniak, Aleksej
Perchat, Nadia
Perret, Alain
Salanoubat, Marcel
Patil, Kiran R
Weissenbach, Jean
Bork, Peer
author_facet Yamada, Takuji
Waller, Alison S
Raes, Jeroen
Zelezniak, Aleksej
Perchat, Nadia
Perret, Alain
Salanoubat, Marcel
Patil, Kiran R
Weissenbach, Jean
Bork, Peer
author_sort Yamada, Takuji
collection PubMed
description Despite the current wealth of sequencing data, one-third of all biochemically characterized metabolic enzymes lack a corresponding gene or protein sequence, and as such can be considered orphan enzymes. They represent a major gap between our molecular and biochemical knowledge, and consequently are not amenable to modern systemic analyses. As 555 of these orphan enzymes have metabolic pathway neighbours, we developed a global framework that utilizes the pathway and (meta)genomic neighbour information to assign candidate sequences to orphan enzymes. For 131 orphan enzymes (37% of those for which (meta)genomic neighbours are available), we associate sequences to them using scoring parameters with an estimated accuracy of 70%, implying functional annotation of 16 345 gene sequences in numerous (meta)genomes. As a case in point, two of these candidate sequences were experimentally validated to encode the predicted activity. In addition, we augmented the currently available genome-scale metabolic models with these new sequence–function associations and were able to expand the models by on average 8%, with a considerable change in the flux connectivity patterns and improved essentiality prediction.
format Online
Article
Text
id pubmed-3377989
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher European Molecular Biology Organization
record_format MEDLINE/PubMed
spelling pubmed-33779892012-06-20 Prediction and identification of sequences coding for orphan enzymes using genomic and metagenomic neighbours Yamada, Takuji Waller, Alison S Raes, Jeroen Zelezniak, Aleksej Perchat, Nadia Perret, Alain Salanoubat, Marcel Patil, Kiran R Weissenbach, Jean Bork, Peer Mol Syst Biol Article Despite the current wealth of sequencing data, one-third of all biochemically characterized metabolic enzymes lack a corresponding gene or protein sequence, and as such can be considered orphan enzymes. They represent a major gap between our molecular and biochemical knowledge, and consequently are not amenable to modern systemic analyses. As 555 of these orphan enzymes have metabolic pathway neighbours, we developed a global framework that utilizes the pathway and (meta)genomic neighbour information to assign candidate sequences to orphan enzymes. For 131 orphan enzymes (37% of those for which (meta)genomic neighbours are available), we associate sequences to them using scoring parameters with an estimated accuracy of 70%, implying functional annotation of 16 345 gene sequences in numerous (meta)genomes. As a case in point, two of these candidate sequences were experimentally validated to encode the predicted activity. In addition, we augmented the currently available genome-scale metabolic models with these new sequence–function associations and were able to expand the models by on average 8%, with a considerable change in the flux connectivity patterns and improved essentiality prediction. European Molecular Biology Organization 2012-05-08 /pmc/articles/PMC3377989/ /pubmed/22569339 http://dx.doi.org/10.1038/msb.2012.13 Text en Copyright © 2012, EMBO and Macmillan Publishers Limited https://creativecommons.org/licenses/by-nc-sa/3.0/This is an open-access article distributed under the terms of the Creative Commons Attribution Noncommercial Share Alike 3.0 Unported License, which allows readers to alter, transform, or build upon the article and then distribute the resulting work under the same or similar license to this one. The work must be attributed back to the original author and commercial use is not permitted without specific permission.
spellingShingle Article
Yamada, Takuji
Waller, Alison S
Raes, Jeroen
Zelezniak, Aleksej
Perchat, Nadia
Perret, Alain
Salanoubat, Marcel
Patil, Kiran R
Weissenbach, Jean
Bork, Peer
Prediction and identification of sequences coding for orphan enzymes using genomic and metagenomic neighbours
title Prediction and identification of sequences coding for orphan enzymes using genomic and metagenomic neighbours
title_full Prediction and identification of sequences coding for orphan enzymes using genomic and metagenomic neighbours
title_fullStr Prediction and identification of sequences coding for orphan enzymes using genomic and metagenomic neighbours
title_full_unstemmed Prediction and identification of sequences coding for orphan enzymes using genomic and metagenomic neighbours
title_short Prediction and identification of sequences coding for orphan enzymes using genomic and metagenomic neighbours
title_sort prediction and identification of sequences coding for orphan enzymes using genomic and metagenomic neighbours
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3377989/
https://www.ncbi.nlm.nih.gov/pubmed/22569339
http://dx.doi.org/10.1038/msb.2012.13
work_keys_str_mv AT yamadatakuji predictionandidentificationofsequencescodingfororphanenzymesusinggenomicandmetagenomicneighbours
AT walleralisons predictionandidentificationofsequencescodingfororphanenzymesusinggenomicandmetagenomicneighbours
AT raesjeroen predictionandidentificationofsequencescodingfororphanenzymesusinggenomicandmetagenomicneighbours
AT zelezniakaleksej predictionandidentificationofsequencescodingfororphanenzymesusinggenomicandmetagenomicneighbours
AT perchatnadia predictionandidentificationofsequencescodingfororphanenzymesusinggenomicandmetagenomicneighbours
AT perretalain predictionandidentificationofsequencescodingfororphanenzymesusinggenomicandmetagenomicneighbours
AT salanoubatmarcel predictionandidentificationofsequencescodingfororphanenzymesusinggenomicandmetagenomicneighbours
AT patilkiranr predictionandidentificationofsequencescodingfororphanenzymesusinggenomicandmetagenomicneighbours
AT weissenbachjean predictionandidentificationofsequencescodingfororphanenzymesusinggenomicandmetagenomicneighbours
AT borkpeer predictionandidentificationofsequencescodingfororphanenzymesusinggenomicandmetagenomicneighbours