Cargando…

Comprehensive comparative homeobox gene annotation in human and mouse

Homeobox genes are a group of genes coding for transcription factors with a DNA-binding helix-turn-helix structure called a homeodomain and which play a crucial role in pattern formation during embryogenesis. Many homeobox genes are located in clusters and some of these, most notably the HOX genes,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wilming, Laurens G., Boychenko, Veronika, Harrow, Jennifer L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4584094/
https://www.ncbi.nlm.nih.gov/pubmed/26412852
http://dx.doi.org/10.1093/database/bav091
_version_ 1782391939862102016
author Wilming, Laurens G.
Boychenko, Veronika
Harrow, Jennifer L.
author_facet Wilming, Laurens G.
Boychenko, Veronika
Harrow, Jennifer L.
author_sort Wilming, Laurens G.
collection PubMed
description Homeobox genes are a group of genes coding for transcription factors with a DNA-binding helix-turn-helix structure called a homeodomain and which play a crucial role in pattern formation during embryogenesis. Many homeobox genes are located in clusters and some of these, most notably the HOX genes, are known to have antisense or opposite strand long non-coding RNA (lncRNA) genes that play a regulatory role. Because automated annotation of both gene clusters and non-coding genes is fraught with difficulty (over-prediction, under-prediction, inaccurate transcript structures), we set out to manually annotate all homeobox genes in the mouse and human genomes. This includes all supported splice variants, pseudogenes and both antisense and flanking lncRNAs. One of the areas where manual annotation has a significant advantage is the annotation of duplicated gene clusters. After comprehensive annotation of all homeobox genes and their antisense genes in human and in mouse, we found some discrepancies with the current gene set in RefSeq regarding exact gene structures and coding versus pseudogene locus biotype. We also identified previously un-annotated pseudogenes in the DUX, Rhox and Obox gene clusters, which helped us re-evaluate and update the gene nomenclature in these regions. We found that human homeobox genes are enriched in antisense lncRNA loci, some of which are known to play a role in gene or gene cluster regulation, compared to their mouse orthologues. Of the annotated set of 241 human protein-coding homeobox genes, 98 have an antisense locus (41%) while of the 277 orthologous mouse genes, only 62 protein coding gene have an antisense locus (22%), based on publicly available transcriptional evidence.
format Online
Article
Text
id pubmed-4584094
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-45840942015-09-28 Comprehensive comparative homeobox gene annotation in human and mouse Wilming, Laurens G. Boychenko, Veronika Harrow, Jennifer L. Database (Oxford) Original Article Homeobox genes are a group of genes coding for transcription factors with a DNA-binding helix-turn-helix structure called a homeodomain and which play a crucial role in pattern formation during embryogenesis. Many homeobox genes are located in clusters and some of these, most notably the HOX genes, are known to have antisense or opposite strand long non-coding RNA (lncRNA) genes that play a regulatory role. Because automated annotation of both gene clusters and non-coding genes is fraught with difficulty (over-prediction, under-prediction, inaccurate transcript structures), we set out to manually annotate all homeobox genes in the mouse and human genomes. This includes all supported splice variants, pseudogenes and both antisense and flanking lncRNAs. One of the areas where manual annotation has a significant advantage is the annotation of duplicated gene clusters. After comprehensive annotation of all homeobox genes and their antisense genes in human and in mouse, we found some discrepancies with the current gene set in RefSeq regarding exact gene structures and coding versus pseudogene locus biotype. We also identified previously un-annotated pseudogenes in the DUX, Rhox and Obox gene clusters, which helped us re-evaluate and update the gene nomenclature in these regions. We found that human homeobox genes are enriched in antisense lncRNA loci, some of which are known to play a role in gene or gene cluster regulation, compared to their mouse orthologues. Of the annotated set of 241 human protein-coding homeobox genes, 98 have an antisense locus (41%) while of the 277 orthologous mouse genes, only 62 protein coding gene have an antisense locus (22%), based on publicly available transcriptional evidence. Oxford University Press 2015-09-25 /pmc/articles/PMC4584094/ /pubmed/26412852 http://dx.doi.org/10.1093/database/bav091 Text en © The Author(s) 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Wilming, Laurens G.
Boychenko, Veronika
Harrow, Jennifer L.
Comprehensive comparative homeobox gene annotation in human and mouse
title Comprehensive comparative homeobox gene annotation in human and mouse
title_full Comprehensive comparative homeobox gene annotation in human and mouse
title_fullStr Comprehensive comparative homeobox gene annotation in human and mouse
title_full_unstemmed Comprehensive comparative homeobox gene annotation in human and mouse
title_short Comprehensive comparative homeobox gene annotation in human and mouse
title_sort comprehensive comparative homeobox gene annotation in human and mouse
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4584094/
https://www.ncbi.nlm.nih.gov/pubmed/26412852
http://dx.doi.org/10.1093/database/bav091
work_keys_str_mv AT wilminglaurensg comprehensivecomparativehomeoboxgeneannotationinhumanandmouse
AT boychenkoveronika comprehensivecomparativehomeoboxgeneannotationinhumanandmouse
AT harrowjenniferl comprehensivecomparativehomeoboxgeneannotationinhumanandmouse