Cargando…

Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins

BACKGROUND: When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example o...

Descripción completa

Detalles Bibliográficos
Autores principales: Bradley, Michael E, Benner, Steven A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1475641/
https://www.ncbi.nlm.nih.gov/pubmed/16504077
http://dx.doi.org/10.1186/1471-2105-7-89
_version_ 1782128133107875840
author Bradley, Michael E
Benner, Steven A
author_facet Bradley, Michael E
Benner, Steven A
author_sort Bradley, Michael E
collection PubMed
description BACKGROUND: When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example of how bioinformatics can be used to guide experimental research. However, progress in this direction has been slowed by the lack of a publicly available resource suitable for general use. RESULTS: The precomputed Magnum database offers a solution to this problem for ca. 1,800 full-length protein families with at least one crystal structure. The Magnum deliverables include 1) multiple sequence alignments, 2) mapping of alignment sites to crystal structure sites, 3) phylogenetic trees, 4) inferred ancestral sequences at internal tree nodes, and 5) amino acid replacements along tree branches. Comprehensive evaluations revealed that the automated procedures used to construct Magnum produced accurate models of how proteins divergently evolve, or genealogies, and correctly integrated these with the structural data. To demonstrate Magnum's capabilities, we asked for amino acid replacements requiring three nucleotide substitutions, located at internal protein structure sites, and occurring on short phylogenetic tree branches. In the cellular retinoid binding protein family a site that potentially modulates ligand binding affinity was discovered. Recruitment of cellular retinol binding protein to function as a lens crystallin in the diurnal gecko afforded another opportunity to showcase the predictive value of a browsable database containing branch replacement patterns integrated with protein structures. CONCLUSION: We integrated two areas of protein science, evolution and structure, on a large scale and created a precomputed database, known as Magnum, which is the first freely available resource of its kind. Magnum provides evolutionary and structural bioinformatics resources that are useful for identifying experimentally testable hypotheses about the molecular basis of protein behaviors and functions, as illustrated with the examples from the cellular retinoid binding proteins.
format Text
id pubmed-1475641
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14756412006-06-12 Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins Bradley, Michael E Benner, Steven A BMC Bioinformatics Research Article BACKGROUND: When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example of how bioinformatics can be used to guide experimental research. However, progress in this direction has been slowed by the lack of a publicly available resource suitable for general use. RESULTS: The precomputed Magnum database offers a solution to this problem for ca. 1,800 full-length protein families with at least one crystal structure. The Magnum deliverables include 1) multiple sequence alignments, 2) mapping of alignment sites to crystal structure sites, 3) phylogenetic trees, 4) inferred ancestral sequences at internal tree nodes, and 5) amino acid replacements along tree branches. Comprehensive evaluations revealed that the automated procedures used to construct Magnum produced accurate models of how proteins divergently evolve, or genealogies, and correctly integrated these with the structural data. To demonstrate Magnum's capabilities, we asked for amino acid replacements requiring three nucleotide substitutions, located at internal protein structure sites, and occurring on short phylogenetic tree branches. In the cellular retinoid binding protein family a site that potentially modulates ligand binding affinity was discovered. Recruitment of cellular retinol binding protein to function as a lens crystallin in the diurnal gecko afforded another opportunity to showcase the predictive value of a browsable database containing branch replacement patterns integrated with protein structures. CONCLUSION: We integrated two areas of protein science, evolution and structure, on a large scale and created a precomputed database, known as Magnum, which is the first freely available resource of its kind. Magnum provides evolutionary and structural bioinformatics resources that are useful for identifying experimentally testable hypotheses about the molecular basis of protein behaviors and functions, as illustrated with the examples from the cellular retinoid binding proteins. BioMed Central 2006-02-23 /pmc/articles/PMC1475641/ /pubmed/16504077 http://dx.doi.org/10.1186/1471-2105-7-89 Text en Copyright © 2006 Bradley and Benner; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Bradley, Michael E
Benner, Steven A
Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins
title Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins
title_full Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins
title_fullStr Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins
title_full_unstemmed Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins
title_short Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins
title_sort integrating protein structures and precomputed genealogies in the magnum database: examples with cellular retinoid binding proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1475641/
https://www.ncbi.nlm.nih.gov/pubmed/16504077
http://dx.doi.org/10.1186/1471-2105-7-89
work_keys_str_mv AT bradleymichaele integratingproteinstructuresandprecomputedgenealogiesinthemagnumdatabaseexampleswithcellularretinoidbindingproteins
AT bennerstevena integratingproteinstructuresandprecomputedgenealogiesinthemagnumdatabaseexampleswithcellularretinoidbindingproteins