Cargando…

The Adaptive Evolution Database (TAED)

BACKGROUND: The Master Catalog is a collection of evolutionary families, including multiple sequence alignments, phylogenetic trees and reconstructed ancestral sequences, for all protein-sequence modules encoded by genes in GenBank. It can therefore support large-scale genomic surveys, of which we p...

Descripción completa

Detalles Bibliográficos
Autores principales: Liberles, David A, Schreiber, David R, Govindarajan, Sridhar, Chamberlin, Stephen G, Benner, Steven A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2001
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC55325/
https://www.ncbi.nlm.nih.gov/pubmed/11532212
Descripción
Sumario:BACKGROUND: The Master Catalog is a collection of evolutionary families, including multiple sequence alignments, phylogenetic trees and reconstructed ancestral sequences, for all protein-sequence modules encoded by genes in GenBank. It can therefore support large-scale genomic surveys, of which we present here The Adaptive Evolution Database (TAED). In TAED, potential examples of positive adaptation are identified by high values for the normalized ratio of nonsynonymous to synonymous nucleotide substitution rates (K(A)/K(S) values) on branches of an evolutionary tree between nodes representing reconstructed ancestral sequences. RESULTS: Evolutionary trees and reconstructed ancestral sequences were extracted from the Master Catalog for every subtree containing proteins from the Chordata only or the Embryophyta only. Branches with high K(A)/K(S) values were identified. These represent candidate episodes in the history of the protein family when the protein may have undergone positive selection, where the mutant form conferred more fitness than the ancestral form. Such episodes are frequently associated with change in function. An unexpectedly large number of families (between 10% and 20% of those families examined) were found to have at least one branch with high K(A)/K(S) values above arbitrarily chosen cut-offs (1 and 0.6). Most of these survived a robustness test and were collected into TAED. CONCLUSIONS: TAED is a raw resource for bioinformaticists interested in data mining and for experimental evolutionists seeking candidate examples of adaptive evolution for further experimental study. It can be expanded to include other evolutionary information (for example changes in gene regulation or splicing) placed in a phylogenetic perspective.