Cargando…

Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae

BACKGROUND: The most abundant family of insect cuticular proteins, the CPR family, is recognized by the R&R Consensus, a domain of about 64 amino acids that binds to chitin and is present throughout arthropods. Several species have now been shown to have more than 100 CPR genes, inviting specula...

Descripción completa

Detalles Bibliográficos
Autores principales: Cornman, R Scott, Togawa, Toru, Dunn, W Augustine, He, Ningjia, Emmons, Aaron C, Willis, Judith H
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2259329/
https://www.ncbi.nlm.nih.gov/pubmed/18205929
http://dx.doi.org/10.1186/1471-2164-9-22
_version_ 1782151374740389888
author Cornman, R Scott
Togawa, Toru
Dunn, W Augustine
He, Ningjia
Emmons, Aaron C
Willis, Judith H
author_facet Cornman, R Scott
Togawa, Toru
Dunn, W Augustine
He, Ningjia
Emmons, Aaron C
Willis, Judith H
author_sort Cornman, R Scott
collection PubMed
description BACKGROUND: The most abundant family of insect cuticular proteins, the CPR family, is recognized by the R&R Consensus, a domain of about 64 amino acids that binds to chitin and is present throughout arthropods. Several species have now been shown to have more than 100 CPR genes, inviting speculation as to the functional importance of this large number and diversity. RESULTS: We have identified 156 genes in Anopheles gambiae that code for putative cuticular proteins in this CPR family, over 1% of the total number of predicted genes in this species. Annotation was verified using several criteria including identification of TATA boxes, INRs, and DPEs plus support from proteomic and gene expression analyses. Two previously recognized CPR classes, RR-1 and RR-2, form separate, well-supported clades with the exception of a small set of genes with long branches whose relationships are poorly resolved. Several of these outliers have clear orthologs in other species. Although both clades are under purifying selection, the RR-1 variant of the R&R Consensus is evolving at twice the rate of the RR-2 variant and is structurally more labile. In contrast, the regions flanking the R&R Consensus have diversified in amino-acid composition to a much greater extent in RR-2 genes compared with RR-1 genes. Many genes are found in compact tandem arrays that may include similar or dissimilar genes but always include just one of the two classes. Tandem arrays of RR-2 genes frequently contain subsets of genes coding for highly similar proteins (sequence clusters). Properties of the proteins indicated that each cluster may serve a distinct function in the cuticle. CONCLUSION: The complete annotation of this large gene family provides insight on the mechanisms of gene family evolution and clues about the need for so many CPR genes. These data also should assist annotation of other Anopheles genes.
format Text
id pubmed-2259329
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22593292008-03-04 Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae Cornman, R Scott Togawa, Toru Dunn, W Augustine He, Ningjia Emmons, Aaron C Willis, Judith H BMC Genomics Research Article BACKGROUND: The most abundant family of insect cuticular proteins, the CPR family, is recognized by the R&R Consensus, a domain of about 64 amino acids that binds to chitin and is present throughout arthropods. Several species have now been shown to have more than 100 CPR genes, inviting speculation as to the functional importance of this large number and diversity. RESULTS: We have identified 156 genes in Anopheles gambiae that code for putative cuticular proteins in this CPR family, over 1% of the total number of predicted genes in this species. Annotation was verified using several criteria including identification of TATA boxes, INRs, and DPEs plus support from proteomic and gene expression analyses. Two previously recognized CPR classes, RR-1 and RR-2, form separate, well-supported clades with the exception of a small set of genes with long branches whose relationships are poorly resolved. Several of these outliers have clear orthologs in other species. Although both clades are under purifying selection, the RR-1 variant of the R&R Consensus is evolving at twice the rate of the RR-2 variant and is structurally more labile. In contrast, the regions flanking the R&R Consensus have diversified in amino-acid composition to a much greater extent in RR-2 genes compared with RR-1 genes. Many genes are found in compact tandem arrays that may include similar or dissimilar genes but always include just one of the two classes. Tandem arrays of RR-2 genes frequently contain subsets of genes coding for highly similar proteins (sequence clusters). Properties of the proteins indicated that each cluster may serve a distinct function in the cuticle. CONCLUSION: The complete annotation of this large gene family provides insight on the mechanisms of gene family evolution and clues about the need for so many CPR genes. These data also should assist annotation of other Anopheles genes. BioMed Central 2008-01-18 /pmc/articles/PMC2259329/ /pubmed/18205929 http://dx.doi.org/10.1186/1471-2164-9-22 Text en Copyright © 2008 Cornman et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Cornman, R Scott
Togawa, Toru
Dunn, W Augustine
He, Ningjia
Emmons, Aaron C
Willis, Judith H
Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae
title Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae
title_full Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae
title_fullStr Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae
title_full_unstemmed Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae
title_short Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae
title_sort annotation and analysis of a large cuticular protein family with the r&r consensus in anopheles gambiae
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2259329/
https://www.ncbi.nlm.nih.gov/pubmed/18205929
http://dx.doi.org/10.1186/1471-2164-9-22
work_keys_str_mv AT cornmanrscott annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae
AT togawatoru annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae
AT dunnwaugustine annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae
AT heningjia annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae
AT emmonsaaronc annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae
AT willisjudithh annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae