Cargando…
Comprehensive comparative homeobox gene annotation in human and mouse
Homeobox genes are a group of genes coding for transcription factors with a DNA-binding helix-turn-helix structure called a homeodomain and which play a crucial role in pattern formation during embryogenesis. Many homeobox genes are located in clusters and some of these, most notably the HOX genes,...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4584094/ https://www.ncbi.nlm.nih.gov/pubmed/26412852 http://dx.doi.org/10.1093/database/bav091 |
_version_ | 1782391939862102016 |
---|---|
author | Wilming, Laurens G. Boychenko, Veronika Harrow, Jennifer L. |
author_facet | Wilming, Laurens G. Boychenko, Veronika Harrow, Jennifer L. |
author_sort | Wilming, Laurens G. |
collection | PubMed |
description | Homeobox genes are a group of genes coding for transcription factors with a DNA-binding helix-turn-helix structure called a homeodomain and which play a crucial role in pattern formation during embryogenesis. Many homeobox genes are located in clusters and some of these, most notably the HOX genes, are known to have antisense or opposite strand long non-coding RNA (lncRNA) genes that play a regulatory role. Because automated annotation of both gene clusters and non-coding genes is fraught with difficulty (over-prediction, under-prediction, inaccurate transcript structures), we set out to manually annotate all homeobox genes in the mouse and human genomes. This includes all supported splice variants, pseudogenes and both antisense and flanking lncRNAs. One of the areas where manual annotation has a significant advantage is the annotation of duplicated gene clusters. After comprehensive annotation of all homeobox genes and their antisense genes in human and in mouse, we found some discrepancies with the current gene set in RefSeq regarding exact gene structures and coding versus pseudogene locus biotype. We also identified previously un-annotated pseudogenes in the DUX, Rhox and Obox gene clusters, which helped us re-evaluate and update the gene nomenclature in these regions. We found that human homeobox genes are enriched in antisense lncRNA loci, some of which are known to play a role in gene or gene cluster regulation, compared to their mouse orthologues. Of the annotated set of 241 human protein-coding homeobox genes, 98 have an antisense locus (41%) while of the 277 orthologous mouse genes, only 62 protein coding gene have an antisense locus (22%), based on publicly available transcriptional evidence. |
format | Online Article Text |
id | pubmed-4584094 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-45840942015-09-28 Comprehensive comparative homeobox gene annotation in human and mouse Wilming, Laurens G. Boychenko, Veronika Harrow, Jennifer L. Database (Oxford) Original Article Homeobox genes are a group of genes coding for transcription factors with a DNA-binding helix-turn-helix structure called a homeodomain and which play a crucial role in pattern formation during embryogenesis. Many homeobox genes are located in clusters and some of these, most notably the HOX genes, are known to have antisense or opposite strand long non-coding RNA (lncRNA) genes that play a regulatory role. Because automated annotation of both gene clusters and non-coding genes is fraught with difficulty (over-prediction, under-prediction, inaccurate transcript structures), we set out to manually annotate all homeobox genes in the mouse and human genomes. This includes all supported splice variants, pseudogenes and both antisense and flanking lncRNAs. One of the areas where manual annotation has a significant advantage is the annotation of duplicated gene clusters. After comprehensive annotation of all homeobox genes and their antisense genes in human and in mouse, we found some discrepancies with the current gene set in RefSeq regarding exact gene structures and coding versus pseudogene locus biotype. We also identified previously un-annotated pseudogenes in the DUX, Rhox and Obox gene clusters, which helped us re-evaluate and update the gene nomenclature in these regions. We found that human homeobox genes are enriched in antisense lncRNA loci, some of which are known to play a role in gene or gene cluster regulation, compared to their mouse orthologues. Of the annotated set of 241 human protein-coding homeobox genes, 98 have an antisense locus (41%) while of the 277 orthologous mouse genes, only 62 protein coding gene have an antisense locus (22%), based on publicly available transcriptional evidence. Oxford University Press 2015-09-25 /pmc/articles/PMC4584094/ /pubmed/26412852 http://dx.doi.org/10.1093/database/bav091 Text en © The Author(s) 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Wilming, Laurens G. Boychenko, Veronika Harrow, Jennifer L. Comprehensive comparative homeobox gene annotation in human and mouse |
title | Comprehensive comparative homeobox gene annotation in human and mouse |
title_full | Comprehensive comparative homeobox gene annotation in human and mouse |
title_fullStr | Comprehensive comparative homeobox gene annotation in human and mouse |
title_full_unstemmed | Comprehensive comparative homeobox gene annotation in human and mouse |
title_short | Comprehensive comparative homeobox gene annotation in human and mouse |
title_sort | comprehensive comparative homeobox gene annotation in human and mouse |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4584094/ https://www.ncbi.nlm.nih.gov/pubmed/26412852 http://dx.doi.org/10.1093/database/bav091 |
work_keys_str_mv | AT wilminglaurensg comprehensivecomparativehomeoboxgeneannotationinhumanandmouse AT boychenkoveronika comprehensivecomparativehomeoboxgeneannotationinhumanandmouse AT harrowjenniferl comprehensivecomparativehomeoboxgeneannotationinhumanandmouse |