Cargando…

Reconstruction of ancestral protein sequences and its applications

BACKGROUND: Modern-day proteins were selected during long evolutionary history as descendants of ancient life forms. In silico reconstruction of such ancestral protein sequences facilitates our understanding of evolutionary processes, protein classification and biological function. Additionally, rec...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Wei, Pei, Jimin, Grishin, Nick V
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC522809/
https://www.ncbi.nlm.nih.gov/pubmed/15377393
http://dx.doi.org/10.1186/1471-2148-4-33
_version_ 1782121851803140096
author Cai, Wei
Pei, Jimin
Grishin, Nick V
author_facet Cai, Wei
Pei, Jimin
Grishin, Nick V
author_sort Cai, Wei
collection PubMed
description BACKGROUND: Modern-day proteins were selected during long evolutionary history as descendants of ancient life forms. In silico reconstruction of such ancestral protein sequences facilitates our understanding of evolutionary processes, protein classification and biological function. Additionally, reconstructed ancestral protein sequences could serve to fill in sequence space thus aiding remote homology inference. RESULTS: We developed ANCESCON, a package for distance-based phylogenetic inference and reconstruction of ancestral protein sequences that takes into account the observed variation of evolutionary rates between positions that more precisely describes the evolution of protein families. To improve the accuracy of evolutionary distance estimation and ancestral sequence reconstruction, two approaches are proposed to estimate position-specific evolutionary rates. Comparisons show that at large evolutionary distances our method gives more accurate ancestral sequence reconstruction than PAML, PHYLIP and PAUP*. We apply the reconstructed ancestral sequences to homology inference and functional site prediction. We show that the usage of hypothetical ancestors together with the present day sequences improves profile-based sequence similarity searches; and that ancestral sequence reconstruction methods can be used to predict positions with functional specificity. CONCLUSIONS: As a computational tool to reconstruct ancestral protein sequences from a given multiple sequence alignment, ANCESCON shows high accuracy in tests and helps detection of remote homologs and prediction of functional sites. ANCESCON is freely available for non-commercial use. Pre-compiled versions for several platforms can be downloaded from .
format Text
id pubmed-522809
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5228092004-10-17 Reconstruction of ancestral protein sequences and its applications Cai, Wei Pei, Jimin Grishin, Nick V BMC Evol Biol Methodology Article BACKGROUND: Modern-day proteins were selected during long evolutionary history as descendants of ancient life forms. In silico reconstruction of such ancestral protein sequences facilitates our understanding of evolutionary processes, protein classification and biological function. Additionally, reconstructed ancestral protein sequences could serve to fill in sequence space thus aiding remote homology inference. RESULTS: We developed ANCESCON, a package for distance-based phylogenetic inference and reconstruction of ancestral protein sequences that takes into account the observed variation of evolutionary rates between positions that more precisely describes the evolution of protein families. To improve the accuracy of evolutionary distance estimation and ancestral sequence reconstruction, two approaches are proposed to estimate position-specific evolutionary rates. Comparisons show that at large evolutionary distances our method gives more accurate ancestral sequence reconstruction than PAML, PHYLIP and PAUP*. We apply the reconstructed ancestral sequences to homology inference and functional site prediction. We show that the usage of hypothetical ancestors together with the present day sequences improves profile-based sequence similarity searches; and that ancestral sequence reconstruction methods can be used to predict positions with functional specificity. CONCLUSIONS: As a computational tool to reconstruct ancestral protein sequences from a given multiple sequence alignment, ANCESCON shows high accuracy in tests and helps detection of remote homologs and prediction of functional sites. ANCESCON is freely available for non-commercial use. Pre-compiled versions for several platforms can be downloaded from . BioMed Central 2004-09-17 /pmc/articles/PMC522809/ /pubmed/15377393 http://dx.doi.org/10.1186/1471-2148-4-33 Text en Copyright © 2004 Cai et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open-access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Cai, Wei
Pei, Jimin
Grishin, Nick V
Reconstruction of ancestral protein sequences and its applications
title Reconstruction of ancestral protein sequences and its applications
title_full Reconstruction of ancestral protein sequences and its applications
title_fullStr Reconstruction of ancestral protein sequences and its applications
title_full_unstemmed Reconstruction of ancestral protein sequences and its applications
title_short Reconstruction of ancestral protein sequences and its applications
title_sort reconstruction of ancestral protein sequences and its applications
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC522809/
https://www.ncbi.nlm.nih.gov/pubmed/15377393
http://dx.doi.org/10.1186/1471-2148-4-33
work_keys_str_mv AT caiwei reconstructionofancestralproteinsequencesanditsapplications
AT peijimin reconstructionofancestralproteinsequencesanditsapplications
AT grishinnickv reconstructionofancestralproteinsequencesanditsapplications