Cargando…
Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae
BACKGROUND: The most abundant family of insect cuticular proteins, the CPR family, is recognized by the R&R Consensus, a domain of about 64 amino acids that binds to chitin and is present throughout arthropods. Several species have now been shown to have more than 100 CPR genes, inviting specula...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2259329/ https://www.ncbi.nlm.nih.gov/pubmed/18205929 http://dx.doi.org/10.1186/1471-2164-9-22 |
_version_ | 1782151374740389888 |
---|---|
author | Cornman, R Scott Togawa, Toru Dunn, W Augustine He, Ningjia Emmons, Aaron C Willis, Judith H |
author_facet | Cornman, R Scott Togawa, Toru Dunn, W Augustine He, Ningjia Emmons, Aaron C Willis, Judith H |
author_sort | Cornman, R Scott |
collection | PubMed |
description | BACKGROUND: The most abundant family of insect cuticular proteins, the CPR family, is recognized by the R&R Consensus, a domain of about 64 amino acids that binds to chitin and is present throughout arthropods. Several species have now been shown to have more than 100 CPR genes, inviting speculation as to the functional importance of this large number and diversity. RESULTS: We have identified 156 genes in Anopheles gambiae that code for putative cuticular proteins in this CPR family, over 1% of the total number of predicted genes in this species. Annotation was verified using several criteria including identification of TATA boxes, INRs, and DPEs plus support from proteomic and gene expression analyses. Two previously recognized CPR classes, RR-1 and RR-2, form separate, well-supported clades with the exception of a small set of genes with long branches whose relationships are poorly resolved. Several of these outliers have clear orthologs in other species. Although both clades are under purifying selection, the RR-1 variant of the R&R Consensus is evolving at twice the rate of the RR-2 variant and is structurally more labile. In contrast, the regions flanking the R&R Consensus have diversified in amino-acid composition to a much greater extent in RR-2 genes compared with RR-1 genes. Many genes are found in compact tandem arrays that may include similar or dissimilar genes but always include just one of the two classes. Tandem arrays of RR-2 genes frequently contain subsets of genes coding for highly similar proteins (sequence clusters). Properties of the proteins indicated that each cluster may serve a distinct function in the cuticle. CONCLUSION: The complete annotation of this large gene family provides insight on the mechanisms of gene family evolution and clues about the need for so many CPR genes. These data also should assist annotation of other Anopheles genes. |
format | Text |
id | pubmed-2259329 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-22593292008-03-04 Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae Cornman, R Scott Togawa, Toru Dunn, W Augustine He, Ningjia Emmons, Aaron C Willis, Judith H BMC Genomics Research Article BACKGROUND: The most abundant family of insect cuticular proteins, the CPR family, is recognized by the R&R Consensus, a domain of about 64 amino acids that binds to chitin and is present throughout arthropods. Several species have now been shown to have more than 100 CPR genes, inviting speculation as to the functional importance of this large number and diversity. RESULTS: We have identified 156 genes in Anopheles gambiae that code for putative cuticular proteins in this CPR family, over 1% of the total number of predicted genes in this species. Annotation was verified using several criteria including identification of TATA boxes, INRs, and DPEs plus support from proteomic and gene expression analyses. Two previously recognized CPR classes, RR-1 and RR-2, form separate, well-supported clades with the exception of a small set of genes with long branches whose relationships are poorly resolved. Several of these outliers have clear orthologs in other species. Although both clades are under purifying selection, the RR-1 variant of the R&R Consensus is evolving at twice the rate of the RR-2 variant and is structurally more labile. In contrast, the regions flanking the R&R Consensus have diversified in amino-acid composition to a much greater extent in RR-2 genes compared with RR-1 genes. Many genes are found in compact tandem arrays that may include similar or dissimilar genes but always include just one of the two classes. Tandem arrays of RR-2 genes frequently contain subsets of genes coding for highly similar proteins (sequence clusters). Properties of the proteins indicated that each cluster may serve a distinct function in the cuticle. CONCLUSION: The complete annotation of this large gene family provides insight on the mechanisms of gene family evolution and clues about the need for so many CPR genes. These data also should assist annotation of other Anopheles genes. BioMed Central 2008-01-18 /pmc/articles/PMC2259329/ /pubmed/18205929 http://dx.doi.org/10.1186/1471-2164-9-22 Text en Copyright © 2008 Cornman et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Cornman, R Scott Togawa, Toru Dunn, W Augustine He, Ningjia Emmons, Aaron C Willis, Judith H Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae |
title | Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae |
title_full | Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae |
title_fullStr | Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae |
title_full_unstemmed | Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae |
title_short | Annotation and analysis of a large cuticular protein family with the R&R Consensus in Anopheles gambiae |
title_sort | annotation and analysis of a large cuticular protein family with the r&r consensus in anopheles gambiae |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2259329/ https://www.ncbi.nlm.nih.gov/pubmed/18205929 http://dx.doi.org/10.1186/1471-2164-9-22 |
work_keys_str_mv | AT cornmanrscott annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae AT togawatoru annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae AT dunnwaugustine annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae AT heningjia annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae AT emmonsaaronc annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae AT willisjudithh annotationandanalysisofalargecuticularproteinfamilywiththerrconsensusinanophelesgambiae |