Cargando…

Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases

BACKGROUND: Clan AA of aspartic peptidases relates the family of pepsin monomers evolutionarily with all dimeric peptidases encoded by eukaryotic LTR retroelements. Recent findings describing various pools of single-domain nonviral host peptidases, in prokaryotes and eukaryotes, indicate that the di...

Descripción completa

Detalles Bibliográficos
Autores principales: Llorens, Carlos, Futami, Ricardo, Renaud, Gabriel, Moya, Andrés
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2642776/
https://www.ncbi.nlm.nih.gov/pubmed/19173708
http://dx.doi.org/10.1186/1745-6150-4-3
_version_ 1782164649726181376
author Llorens, Carlos
Futami, Ricardo
Renaud, Gabriel
Moya, Andrés
author_facet Llorens, Carlos
Futami, Ricardo
Renaud, Gabriel
Moya, Andrés
author_sort Llorens, Carlos
collection PubMed
description BACKGROUND: Clan AA of aspartic peptidases relates the family of pepsin monomers evolutionarily with all dimeric peptidases encoded by eukaryotic LTR retroelements. Recent findings describing various pools of single-domain nonviral host peptidases, in prokaryotes and eukaryotes, indicate that the diversity of clan AA is larger than previously thought. The ensuing approach to investigate this enzyme group is by studying its phylogeny. However, clan AA is a difficult case to study due to the low similarity and different rates of evolution. This work is an ongoing attempt to investigate the different clan AA families to understand the cause of their diversity. RESULTS: In this paper, we describe in-progress database and bioinformatic flowchart designed to characterize the clan AA protein domain based on all possible protein families through ancestral reconstructions, sequence logos, and hidden markov models (HMMs). The flowchart includes the characterization of a major consensus sequence based on 6 amino acid patterns with correspondence with Andreeva's model, the structural template describing the clan AA peptidase fold. The set of tools is work in progress we have organized in a database within the GyDB project, referred to as Clan AA Reference Database . CONCLUSION: The pre-existing classification combined with the evolutionary history of LTR retroelements permits a consistent taxonomical collection of sequence logos and HMMs. This set is useful for gene annotation but also a reference to evaluate the diversity of, and the relationships among, the different families. Comparisons among HMMs suggest a common ancestor for all dimeric clan AA peptidases that is halfway between single-domain nonviral peptidases and those coded by Ty3/Gypsy LTR retroelements. Sequence logos reveal how all clan AA families follow similar protein domain architecture related to the peptidase fold. In particular, each family nucleates a particular consensus motif in the sequence position related to the flap. The different motifs constitute a network where an alanine-asparagine-like variable motif predominates, instead of the canonical flap of the HIV-1 peptidase and closer relatives. REVIEWERS: This article was reviewed by Daniel H. Haft, Vladimir Kapitonov (nominated by Jerry Jurka), and Ben M. Dunn (nominated by Claus Wilke).
format Text
id pubmed-2642776
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26427762009-02-14 Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases Llorens, Carlos Futami, Ricardo Renaud, Gabriel Moya, Andrés Biol Direct Research BACKGROUND: Clan AA of aspartic peptidases relates the family of pepsin monomers evolutionarily with all dimeric peptidases encoded by eukaryotic LTR retroelements. Recent findings describing various pools of single-domain nonviral host peptidases, in prokaryotes and eukaryotes, indicate that the diversity of clan AA is larger than previously thought. The ensuing approach to investigate this enzyme group is by studying its phylogeny. However, clan AA is a difficult case to study due to the low similarity and different rates of evolution. This work is an ongoing attempt to investigate the different clan AA families to understand the cause of their diversity. RESULTS: In this paper, we describe in-progress database and bioinformatic flowchart designed to characterize the clan AA protein domain based on all possible protein families through ancestral reconstructions, sequence logos, and hidden markov models (HMMs). The flowchart includes the characterization of a major consensus sequence based on 6 amino acid patterns with correspondence with Andreeva's model, the structural template describing the clan AA peptidase fold. The set of tools is work in progress we have organized in a database within the GyDB project, referred to as Clan AA Reference Database . CONCLUSION: The pre-existing classification combined with the evolutionary history of LTR retroelements permits a consistent taxonomical collection of sequence logos and HMMs. This set is useful for gene annotation but also a reference to evaluate the diversity of, and the relationships among, the different families. Comparisons among HMMs suggest a common ancestor for all dimeric clan AA peptidases that is halfway between single-domain nonviral peptidases and those coded by Ty3/Gypsy LTR retroelements. Sequence logos reveal how all clan AA families follow similar protein domain architecture related to the peptidase fold. In particular, each family nucleates a particular consensus motif in the sequence position related to the flap. The different motifs constitute a network where an alanine-asparagine-like variable motif predominates, instead of the canonical flap of the HIV-1 peptidase and closer relatives. REVIEWERS: This article was reviewed by Daniel H. Haft, Vladimir Kapitonov (nominated by Jerry Jurka), and Ben M. Dunn (nominated by Claus Wilke). BioMed Central 2009-01-27 /pmc/articles/PMC2642776/ /pubmed/19173708 http://dx.doi.org/10.1186/1745-6150-4-3 Text en Copyright © 2009 Llorens et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Llorens, Carlos
Futami, Ricardo
Renaud, Gabriel
Moya, Andrés
Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases
title Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases
title_full Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases
title_fullStr Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases
title_full_unstemmed Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases
title_short Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases
title_sort bioinformatic flowchart and database to investigate the origins and diversity of clan aa peptidases
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2642776/
https://www.ncbi.nlm.nih.gov/pubmed/19173708
http://dx.doi.org/10.1186/1745-6150-4-3
work_keys_str_mv AT llorenscarlos bioinformaticflowchartanddatabasetoinvestigatetheoriginsanddiversityofclanaapeptidases
AT futamiricardo bioinformaticflowchartanddatabasetoinvestigatetheoriginsanddiversityofclanaapeptidases
AT renaudgabriel bioinformaticflowchartanddatabasetoinvestigatetheoriginsanddiversityofclanaapeptidases
AT moyaandres bioinformaticflowchartanddatabasetoinvestigatetheoriginsanddiversityofclanaapeptidases