Cargando…

A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function

Determining the molecular function of enzymes discovered by genome sequencing represents a primary foundation for understanding many aspects of biology. Historically, classification of enzyme reactions has used the enzyme nomenclature system developed to describe the overall reactions performed by b...

Descripción completa

Detalles Bibliográficos
Autores principales: Holliday, Gemma L, Brown, Shoshana D, Mischel, David, Polacco, Benjamin J, Babbitt, Patricia C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7246345/
https://www.ncbi.nlm.nih.gov/pubmed/32449511
http://dx.doi.org/10.1093/database/baaa034
_version_ 1783537924679139328
author Holliday, Gemma L
Brown, Shoshana D
Mischel, David
Polacco, Benjamin J
Babbitt, Patricia C
author_facet Holliday, Gemma L
Brown, Shoshana D
Mischel, David
Polacco, Benjamin J
Babbitt, Patricia C
author_sort Holliday, Gemma L
collection PubMed
description Determining the molecular function of enzymes discovered by genome sequencing represents a primary foundation for understanding many aspects of biology. Historically, classification of enzyme reactions has used the enzyme nomenclature system developed to describe the overall reactions performed by biochemically characterized enzymes, irrespective of their associated sequences. In contrast, functional classification and assignment for the millions of protein sequences of unknown function now available is largely done in two computational steps, first by similarity-based assignment of newly obtained sequences to homologous groups, followed by transferring to them the known functions of similar biochemically characterized homologs. Due to the fundamental differences in their etiologies and practice, `how’ these chemistry- and evolution-centric functional classification systems relate to each other has been difficult to explore on a large scale. To investigate this issue in a new way, we integrated two published ontologies that had previously described each of these classification systems independently. The resulting infrastructure was then used to compare the functional assignments obtained from each classification system for the well-studied and functionally diverse enolase superfamily. Mapping these function assignments to protein structure and reaction similarity networks shows a profound and complex disconnect between the homology- and chemistry-based classification systems. This conclusion mirrors previous observations suggesting that except for closely related sequences, facile annotation transfer from small numbers of characterized enzymes to the huge number uncharacterized homologs to which they are related is problematic. Our extension of these comparisons to large enzyme superfamilies in a computationally intelligent manner provides a foundation for new directions in protein function prediction for the huge proportion of sequences of unknown function represented in major databases. Interactive sequence, reaction, substrate and product similarity networks computed for this work for the enolase and two other superfamilies are freely available for download from the Structure Function Linkage Database Archive (http://sfld.rbvi.ucsf.edu).
format Online
Article
Text
id pubmed-7246345
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-72463452020-05-28 A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function Holliday, Gemma L Brown, Shoshana D Mischel, David Polacco, Benjamin J Babbitt, Patricia C Database (Oxford) Original Article Determining the molecular function of enzymes discovered by genome sequencing represents a primary foundation for understanding many aspects of biology. Historically, classification of enzyme reactions has used the enzyme nomenclature system developed to describe the overall reactions performed by biochemically characterized enzymes, irrespective of their associated sequences. In contrast, functional classification and assignment for the millions of protein sequences of unknown function now available is largely done in two computational steps, first by similarity-based assignment of newly obtained sequences to homologous groups, followed by transferring to them the known functions of similar biochemically characterized homologs. Due to the fundamental differences in their etiologies and practice, `how’ these chemistry- and evolution-centric functional classification systems relate to each other has been difficult to explore on a large scale. To investigate this issue in a new way, we integrated two published ontologies that had previously described each of these classification systems independently. The resulting infrastructure was then used to compare the functional assignments obtained from each classification system for the well-studied and functionally diverse enolase superfamily. Mapping these function assignments to protein structure and reaction similarity networks shows a profound and complex disconnect between the homology- and chemistry-based classification systems. This conclusion mirrors previous observations suggesting that except for closely related sequences, facile annotation transfer from small numbers of characterized enzymes to the huge number uncharacterized homologs to which they are related is problematic. Our extension of these comparisons to large enzyme superfamilies in a computationally intelligent manner provides a foundation for new directions in protein function prediction for the huge proportion of sequences of unknown function represented in major databases. Interactive sequence, reaction, substrate and product similarity networks computed for this work for the enolase and two other superfamilies are freely available for download from the Structure Function Linkage Database Archive (http://sfld.rbvi.ucsf.edu). Oxford University Press 2020-05-25 /pmc/articles/PMC7246345/ /pubmed/32449511 http://dx.doi.org/10.1093/database/baaa034 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Holliday, Gemma L
Brown, Shoshana D
Mischel, David
Polacco, Benjamin J
Babbitt, Patricia C
A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function
title A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function
title_full A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function
title_fullStr A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function
title_full_unstemmed A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function
title_short A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function
title_sort strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7246345/
https://www.ncbi.nlm.nih.gov/pubmed/32449511
http://dx.doi.org/10.1093/database/baaa034
work_keys_str_mv AT hollidaygemmal astrategyforlargescalecomparisonofevolutionaryandreactionbasedclassificationsofenzymefunction
AT brownshoshanad astrategyforlargescalecomparisonofevolutionaryandreactionbasedclassificationsofenzymefunction
AT mischeldavid astrategyforlargescalecomparisonofevolutionaryandreactionbasedclassificationsofenzymefunction
AT polaccobenjaminj astrategyforlargescalecomparisonofevolutionaryandreactionbasedclassificationsofenzymefunction
AT babbittpatriciac astrategyforlargescalecomparisonofevolutionaryandreactionbasedclassificationsofenzymefunction
AT hollidaygemmal strategyforlargescalecomparisonofevolutionaryandreactionbasedclassificationsofenzymefunction
AT brownshoshanad strategyforlargescalecomparisonofevolutionaryandreactionbasedclassificationsofenzymefunction
AT mischeldavid strategyforlargescalecomparisonofevolutionaryandreactionbasedclassificationsofenzymefunction
AT polaccobenjaminj strategyforlargescalecomparisonofevolutionaryandreactionbasedclassificationsofenzymefunction
AT babbittpatriciac strategyforlargescalecomparisonofevolutionaryandreactionbasedclassificationsofenzymefunction