Cargando…

The Adaptive Evolution Database (TAED)

BACKGROUND: The Master Catalog is a collection of evolutionary families, including multiple sequence alignments, phylogenetic trees and reconstructed ancestral sequences, for all protein-sequence modules encoded by genes in GenBank. It can therefore support large-scale genomic surveys, of which we p...

Descripción completa

Detalles Bibliográficos
Autores principales: Liberles, David A, Schreiber, David R, Govindarajan, Sridhar, Chamberlin, Stephen G, Benner, Steven A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2001
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC55325/
https://www.ncbi.nlm.nih.gov/pubmed/11532212
_version_ 1782120030745395200
author Liberles, David A
Schreiber, David R
Govindarajan, Sridhar
Chamberlin, Stephen G
Benner, Steven A
author_facet Liberles, David A
Schreiber, David R
Govindarajan, Sridhar
Chamberlin, Stephen G
Benner, Steven A
author_sort Liberles, David A
collection PubMed
description BACKGROUND: The Master Catalog is a collection of evolutionary families, including multiple sequence alignments, phylogenetic trees and reconstructed ancestral sequences, for all protein-sequence modules encoded by genes in GenBank. It can therefore support large-scale genomic surveys, of which we present here The Adaptive Evolution Database (TAED). In TAED, potential examples of positive adaptation are identified by high values for the normalized ratio of nonsynonymous to synonymous nucleotide substitution rates (K(A)/K(S) values) on branches of an evolutionary tree between nodes representing reconstructed ancestral sequences. RESULTS: Evolutionary trees and reconstructed ancestral sequences were extracted from the Master Catalog for every subtree containing proteins from the Chordata only or the Embryophyta only. Branches with high K(A)/K(S) values were identified. These represent candidate episodes in the history of the protein family when the protein may have undergone positive selection, where the mutant form conferred more fitness than the ancestral form. Such episodes are frequently associated with change in function. An unexpectedly large number of families (between 10% and 20% of those families examined) were found to have at least one branch with high K(A)/K(S) values above arbitrarily chosen cut-offs (1 and 0.6). Most of these survived a robustness test and were collected into TAED. CONCLUSIONS: TAED is a raw resource for bioinformaticists interested in data mining and for experimental evolutionists seeking candidate examples of adaptive evolution for further experimental study. It can be expanded to include other evolutionary information (for example changes in gene regulation or splicing) placed in a phylogenetic perspective.
format Text
id pubmed-55325
institution National Center for Biotechnology Information
language English
publishDate 2001
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-553252001-09-10 The Adaptive Evolution Database (TAED) Liberles, David A Schreiber, David R Govindarajan, Sridhar Chamberlin, Stephen G Benner, Steven A Genome Biol Research BACKGROUND: The Master Catalog is a collection of evolutionary families, including multiple sequence alignments, phylogenetic trees and reconstructed ancestral sequences, for all protein-sequence modules encoded by genes in GenBank. It can therefore support large-scale genomic surveys, of which we present here The Adaptive Evolution Database (TAED). In TAED, potential examples of positive adaptation are identified by high values for the normalized ratio of nonsynonymous to synonymous nucleotide substitution rates (K(A)/K(S) values) on branches of an evolutionary tree between nodes representing reconstructed ancestral sequences. RESULTS: Evolutionary trees and reconstructed ancestral sequences were extracted from the Master Catalog for every subtree containing proteins from the Chordata only or the Embryophyta only. Branches with high K(A)/K(S) values were identified. These represent candidate episodes in the history of the protein family when the protein may have undergone positive selection, where the mutant form conferred more fitness than the ancestral form. Such episodes are frequently associated with change in function. An unexpectedly large number of families (between 10% and 20% of those families examined) were found to have at least one branch with high K(A)/K(S) values above arbitrarily chosen cut-offs (1 and 0.6). Most of these survived a robustness test and were collected into TAED. CONCLUSIONS: TAED is a raw resource for bioinformaticists interested in data mining and for experimental evolutionists seeking candidate examples of adaptive evolution for further experimental study. It can be expanded to include other evolutionary information (for example changes in gene regulation or splicing) placed in a phylogenetic perspective. BioMed Central 2001 2001-07-24 /pmc/articles/PMC55325/ /pubmed/11532212 Text en Copyright © 2001 Liberles et al., licensee BioMed Central Ltd
spellingShingle Research
Liberles, David A
Schreiber, David R
Govindarajan, Sridhar
Chamberlin, Stephen G
Benner, Steven A
The Adaptive Evolution Database (TAED)
title The Adaptive Evolution Database (TAED)
title_full The Adaptive Evolution Database (TAED)
title_fullStr The Adaptive Evolution Database (TAED)
title_full_unstemmed The Adaptive Evolution Database (TAED)
title_short The Adaptive Evolution Database (TAED)
title_sort adaptive evolution database (taed)
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC55325/
https://www.ncbi.nlm.nih.gov/pubmed/11532212
work_keys_str_mv AT liberlesdavida theadaptiveevolutiondatabasetaed
AT schreiberdavidr theadaptiveevolutiondatabasetaed
AT govindarajansridhar theadaptiveevolutiondatabasetaed
AT chamberlinstepheng theadaptiveevolutiondatabasetaed
AT bennerstevena theadaptiveevolutiondatabasetaed
AT liberlesdavida adaptiveevolutiondatabasetaed
AT schreiberdavidr adaptiveevolutiondatabasetaed
AT govindarajansridhar adaptiveevolutiondatabasetaed
AT chamberlinstepheng adaptiveevolutiondatabasetaed
AT bennerstevena adaptiveevolutiondatabasetaed