Cargando…

Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome

BACKGROUND: Host defense peptides are a critical component of the innate immune system. Human alpha- and beta-defensin genes are subject to copy number variation (CNV) and historically the organization of mouse alpha-defensin genes has been poorly defined. Here we present the first full manual genom...

Descripción completa

Detalles Bibliográficos
Autores principales: Amid, Clara, Rehaume, Linda M, Brown, Kelly L, Gilbert, James GR, Dougan, Gordon, Hancock, Robert EW, Harrow, Jennifer L
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2807441/
https://www.ncbi.nlm.nih.gov/pubmed/20003482
http://dx.doi.org/10.1186/1471-2164-10-606
_version_ 1782176403962200064
author Amid, Clara
Rehaume, Linda M
Brown, Kelly L
Gilbert, James GR
Dougan, Gordon
Hancock, Robert EW
Harrow, Jennifer L
author_facet Amid, Clara
Rehaume, Linda M
Brown, Kelly L
Gilbert, James GR
Dougan, Gordon
Hancock, Robert EW
Harrow, Jennifer L
author_sort Amid, Clara
collection PubMed
description BACKGROUND: Host defense peptides are a critical component of the innate immune system. Human alpha- and beta-defensin genes are subject to copy number variation (CNV) and historically the organization of mouse alpha-defensin genes has been poorly defined. Here we present the first full manual genomic annotation of the mouse defensin region on Chromosome 8 of the reference strain C57BL/6J, and the analysis of the orthologous regions of the human and rat genomes. Problems were identified with the reference assemblies of all three genomes. Defensins have been studied for over two decades and their naming has become a critical issue due to incorrect identification of defensin genes derived from different mouse strains and the duplicated nature of this region. RESULTS: The defensin gene cluster region on mouse Chromosome 8 A2 contains 98 gene loci: 53 are likely active defensin genes and 22 defensin pseudogenes. Several TATA box motifs were found for human and mouse defensin genes that likely impact gene expression. Three novel defensin genes belonging to the Cryptdin Related Sequences (CRS) family were identified. All additional mouse defensin loci on Chromosomes 1, 2 and 14 were annotated and unusual splice variants identified. Comparison of the mouse alpha-defensins in the three main mouse reference gene sets Ensembl, Mouse Genome Informatics (MGI), and NCBI RefSeq reveals significant inconsistencies in annotation and nomenclature. We are collaborating with the Mouse Genome Nomenclature Committee (MGNC) to establish a standardized naming scheme for alpha-defensins. CONCLUSIONS: Prior to this analysis, there was no reliable reference gene set available for the mouse strain C57BL/6J defensin genes, demonstrating that manual intervention is still critical for the annotation of complex gene families and heavily duplicated regions. Accurate gene annotation is facilitated by the annotation of pseudogenes and regulatory elements. Manually curated gene models will be incorporated into the Ensembl and Consensus Coding Sequence (CCDS) reference sets. Elucidation of the genomic structure of this complex gene cluster on the mouse reference sequence, and adoption of a clear and unambiguous naming scheme, will provide a valuable tool to support studies on the evolution, regulatory mechanisms and biological functions of defensins in vivo.
format Text
id pubmed-2807441
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28074412010-01-16 Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome Amid, Clara Rehaume, Linda M Brown, Kelly L Gilbert, James GR Dougan, Gordon Hancock, Robert EW Harrow, Jennifer L BMC Genomics Research article BACKGROUND: Host defense peptides are a critical component of the innate immune system. Human alpha- and beta-defensin genes are subject to copy number variation (CNV) and historically the organization of mouse alpha-defensin genes has been poorly defined. Here we present the first full manual genomic annotation of the mouse defensin region on Chromosome 8 of the reference strain C57BL/6J, and the analysis of the orthologous regions of the human and rat genomes. Problems were identified with the reference assemblies of all three genomes. Defensins have been studied for over two decades and their naming has become a critical issue due to incorrect identification of defensin genes derived from different mouse strains and the duplicated nature of this region. RESULTS: The defensin gene cluster region on mouse Chromosome 8 A2 contains 98 gene loci: 53 are likely active defensin genes and 22 defensin pseudogenes. Several TATA box motifs were found for human and mouse defensin genes that likely impact gene expression. Three novel defensin genes belonging to the Cryptdin Related Sequences (CRS) family were identified. All additional mouse defensin loci on Chromosomes 1, 2 and 14 were annotated and unusual splice variants identified. Comparison of the mouse alpha-defensins in the three main mouse reference gene sets Ensembl, Mouse Genome Informatics (MGI), and NCBI RefSeq reveals significant inconsistencies in annotation and nomenclature. We are collaborating with the Mouse Genome Nomenclature Committee (MGNC) to establish a standardized naming scheme for alpha-defensins. CONCLUSIONS: Prior to this analysis, there was no reliable reference gene set available for the mouse strain C57BL/6J defensin genes, demonstrating that manual intervention is still critical for the annotation of complex gene families and heavily duplicated regions. Accurate gene annotation is facilitated by the annotation of pseudogenes and regulatory elements. Manually curated gene models will be incorporated into the Ensembl and Consensus Coding Sequence (CCDS) reference sets. Elucidation of the genomic structure of this complex gene cluster on the mouse reference sequence, and adoption of a clear and unambiguous naming scheme, will provide a valuable tool to support studies on the evolution, regulatory mechanisms and biological functions of defensins in vivo. BioMed Central 2009-12-15 /pmc/articles/PMC2807441/ /pubmed/20003482 http://dx.doi.org/10.1186/1471-2164-10-606 Text en Copyright ©2009 Amid et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Amid, Clara
Rehaume, Linda M
Brown, Kelly L
Gilbert, James GR
Dougan, Gordon
Hancock, Robert EW
Harrow, Jennifer L
Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome
title Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome
title_full Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome
title_fullStr Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome
title_full_unstemmed Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome
title_short Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome
title_sort manual annotation and analysis of the defensin gene cluster in the c57bl/6j mouse reference genome
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2807441/
https://www.ncbi.nlm.nih.gov/pubmed/20003482
http://dx.doi.org/10.1186/1471-2164-10-606
work_keys_str_mv AT amidclara manualannotationandanalysisofthedefensingeneclusterinthec57bl6jmousereferencegenome
AT rehaumelindam manualannotationandanalysisofthedefensingeneclusterinthec57bl6jmousereferencegenome
AT brownkellyl manualannotationandanalysisofthedefensingeneclusterinthec57bl6jmousereferencegenome
AT gilbertjamesgr manualannotationandanalysisofthedefensingeneclusterinthec57bl6jmousereferencegenome
AT dougangordon manualannotationandanalysisofthedefensingeneclusterinthec57bl6jmousereferencegenome
AT hancockrobertew manualannotationandanalysisofthedefensingeneclusterinthec57bl6jmousereferencegenome
AT harrowjenniferl manualannotationandanalysisofthedefensingeneclusterinthec57bl6jmousereferencegenome