Cargando…
Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases
BACKGROUND: Clan AA of aspartic peptidases relates the family of pepsin monomers evolutionarily with all dimeric peptidases encoded by eukaryotic LTR retroelements. Recent findings describing various pools of single-domain nonviral host peptidases, in prokaryotes and eukaryotes, indicate that the di...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2642776/ https://www.ncbi.nlm.nih.gov/pubmed/19173708 http://dx.doi.org/10.1186/1745-6150-4-3 |
_version_ | 1782164649726181376 |
---|---|
author | Llorens, Carlos Futami, Ricardo Renaud, Gabriel Moya, Andrés |
author_facet | Llorens, Carlos Futami, Ricardo Renaud, Gabriel Moya, Andrés |
author_sort | Llorens, Carlos |
collection | PubMed |
description | BACKGROUND: Clan AA of aspartic peptidases relates the family of pepsin monomers evolutionarily with all dimeric peptidases encoded by eukaryotic LTR retroelements. Recent findings describing various pools of single-domain nonviral host peptidases, in prokaryotes and eukaryotes, indicate that the diversity of clan AA is larger than previously thought. The ensuing approach to investigate this enzyme group is by studying its phylogeny. However, clan AA is a difficult case to study due to the low similarity and different rates of evolution. This work is an ongoing attempt to investigate the different clan AA families to understand the cause of their diversity. RESULTS: In this paper, we describe in-progress database and bioinformatic flowchart designed to characterize the clan AA protein domain based on all possible protein families through ancestral reconstructions, sequence logos, and hidden markov models (HMMs). The flowchart includes the characterization of a major consensus sequence based on 6 amino acid patterns with correspondence with Andreeva's model, the structural template describing the clan AA peptidase fold. The set of tools is work in progress we have organized in a database within the GyDB project, referred to as Clan AA Reference Database . CONCLUSION: The pre-existing classification combined with the evolutionary history of LTR retroelements permits a consistent taxonomical collection of sequence logos and HMMs. This set is useful for gene annotation but also a reference to evaluate the diversity of, and the relationships among, the different families. Comparisons among HMMs suggest a common ancestor for all dimeric clan AA peptidases that is halfway between single-domain nonviral peptidases and those coded by Ty3/Gypsy LTR retroelements. Sequence logos reveal how all clan AA families follow similar protein domain architecture related to the peptidase fold. In particular, each family nucleates a particular consensus motif in the sequence position related to the flap. The different motifs constitute a network where an alanine-asparagine-like variable motif predominates, instead of the canonical flap of the HIV-1 peptidase and closer relatives. REVIEWERS: This article was reviewed by Daniel H. Haft, Vladimir Kapitonov (nominated by Jerry Jurka), and Ben M. Dunn (nominated by Claus Wilke). |
format | Text |
id | pubmed-2642776 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26427762009-02-14 Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases Llorens, Carlos Futami, Ricardo Renaud, Gabriel Moya, Andrés Biol Direct Research BACKGROUND: Clan AA of aspartic peptidases relates the family of pepsin monomers evolutionarily with all dimeric peptidases encoded by eukaryotic LTR retroelements. Recent findings describing various pools of single-domain nonviral host peptidases, in prokaryotes and eukaryotes, indicate that the diversity of clan AA is larger than previously thought. The ensuing approach to investigate this enzyme group is by studying its phylogeny. However, clan AA is a difficult case to study due to the low similarity and different rates of evolution. This work is an ongoing attempt to investigate the different clan AA families to understand the cause of their diversity. RESULTS: In this paper, we describe in-progress database and bioinformatic flowchart designed to characterize the clan AA protein domain based on all possible protein families through ancestral reconstructions, sequence logos, and hidden markov models (HMMs). The flowchart includes the characterization of a major consensus sequence based on 6 amino acid patterns with correspondence with Andreeva's model, the structural template describing the clan AA peptidase fold. The set of tools is work in progress we have organized in a database within the GyDB project, referred to as Clan AA Reference Database . CONCLUSION: The pre-existing classification combined with the evolutionary history of LTR retroelements permits a consistent taxonomical collection of sequence logos and HMMs. This set is useful for gene annotation but also a reference to evaluate the diversity of, and the relationships among, the different families. Comparisons among HMMs suggest a common ancestor for all dimeric clan AA peptidases that is halfway between single-domain nonviral peptidases and those coded by Ty3/Gypsy LTR retroelements. Sequence logos reveal how all clan AA families follow similar protein domain architecture related to the peptidase fold. In particular, each family nucleates a particular consensus motif in the sequence position related to the flap. The different motifs constitute a network where an alanine-asparagine-like variable motif predominates, instead of the canonical flap of the HIV-1 peptidase and closer relatives. REVIEWERS: This article was reviewed by Daniel H. Haft, Vladimir Kapitonov (nominated by Jerry Jurka), and Ben M. Dunn (nominated by Claus Wilke). BioMed Central 2009-01-27 /pmc/articles/PMC2642776/ /pubmed/19173708 http://dx.doi.org/10.1186/1745-6150-4-3 Text en Copyright © 2009 Llorens et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Llorens, Carlos Futami, Ricardo Renaud, Gabriel Moya, Andrés Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases |
title | Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases |
title_full | Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases |
title_fullStr | Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases |
title_full_unstemmed | Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases |
title_short | Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases |
title_sort | bioinformatic flowchart and database to investigate the origins and diversity of clan aa peptidases |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2642776/ https://www.ncbi.nlm.nih.gov/pubmed/19173708 http://dx.doi.org/10.1186/1745-6150-4-3 |
work_keys_str_mv | AT llorenscarlos bioinformaticflowchartanddatabasetoinvestigatetheoriginsanddiversityofclanaapeptidases AT futamiricardo bioinformaticflowchartanddatabasetoinvestigatetheoriginsanddiversityofclanaapeptidases AT renaudgabriel bioinformaticflowchartanddatabasetoinvestigatetheoriginsanddiversityofclanaapeptidases AT moyaandres bioinformaticflowchartanddatabasetoinvestigatetheoriginsanddiversityofclanaapeptidases |