Cargando…

An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome

BACKGROUND: While the genome sequences for a variety of organisms are now available, the precise number of the genes encoded is still a matter of debate. For the human genome several stringent annotation approaches have resulted in the same number of potential genes, but a careful comparison reveale...

Descripción completa

Detalles Bibliográficos
Autores principales: Hild, M, Beckmann, B, Haas, SA, Koch, B, Solovyev, V, Busold, C, Fellenberg, K, Boutros, M, Vingron, M, Sauer, F, Hoheisel, JD, Paro, R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC395735/
https://www.ncbi.nlm.nih.gov/pubmed/14709175
http://dx.doi.org/10.1186/gb-2003-5-1-r3
_version_ 1782121322891968512
author Hild, M
Beckmann, B
Haas, SA
Koch, B
Solovyev, V
Busold, C
Fellenberg, K
Boutros, M
Vingron, M
Sauer, F
Hoheisel, JD
Paro, R
author_facet Hild, M
Beckmann, B
Haas, SA
Koch, B
Solovyev, V
Busold, C
Fellenberg, K
Boutros, M
Vingron, M
Sauer, F
Hoheisel, JD
Paro, R
author_sort Hild, M
collection PubMed
description BACKGROUND: While the genome sequences for a variety of organisms are now available, the precise number of the genes encoded is still a matter of debate. For the human genome several stringent annotation approaches have resulted in the same number of potential genes, but a careful comparison revealed only limited overlap. This indicates that only the combination of different computational prediction methods and experimental evaluation of such in silico data will provide more complete genome annotations. In order to get a more complete gene content of the Drosophila melanogaster genome, we based our new D. melanogaster whole-transcriptome microarray, the Heidelberg FlyArray, on the combination of the Berkeley Drosophila Genome Project (BDGP) annotation and a novel ab initio gene prediction of lower stringency using the Fgenesh software. RESULTS: Here we provide evidence for the transcription of approximately 2,600 additional genes predicted by Fgenesh. Validation of the developmental profiling data by RT-PCR and in situ hybridization indicates a lower limit of 2,000 novel annotations, thus substantially raising the number of genes that make a fly. CONCLUSIONS: The successful design and application of this novel Drosophila microarray on the basis of our integrated in silico/wet biology approach confirms our expectation that in silico approaches alone will always tend to be incomplete. The identification of at least 2,000 novel genes highlights the importance of gathering experimental evidence to discover all genes within a genome. Moreover, as such an approach is independent of homology criteria, it will allow the discovery of novel genes unrelated to known protein families or those that have not been strictly conserved between species.
format Text
id pubmed-395735
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-3957352004-04-24 An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome Hild, M Beckmann, B Haas, SA Koch, B Solovyev, V Busold, C Fellenberg, K Boutros, M Vingron, M Sauer, F Hoheisel, JD Paro, R Genome Biol Research BACKGROUND: While the genome sequences for a variety of organisms are now available, the precise number of the genes encoded is still a matter of debate. For the human genome several stringent annotation approaches have resulted in the same number of potential genes, but a careful comparison revealed only limited overlap. This indicates that only the combination of different computational prediction methods and experimental evaluation of such in silico data will provide more complete genome annotations. In order to get a more complete gene content of the Drosophila melanogaster genome, we based our new D. melanogaster whole-transcriptome microarray, the Heidelberg FlyArray, on the combination of the Berkeley Drosophila Genome Project (BDGP) annotation and a novel ab initio gene prediction of lower stringency using the Fgenesh software. RESULTS: Here we provide evidence for the transcription of approximately 2,600 additional genes predicted by Fgenesh. Validation of the developmental profiling data by RT-PCR and in situ hybridization indicates a lower limit of 2,000 novel annotations, thus substantially raising the number of genes that make a fly. CONCLUSIONS: The successful design and application of this novel Drosophila microarray on the basis of our integrated in silico/wet biology approach confirms our expectation that in silico approaches alone will always tend to be incomplete. The identification of at least 2,000 novel genes highlights the importance of gathering experimental evidence to discover all genes within a genome. Moreover, as such an approach is independent of homology criteria, it will allow the discovery of novel genes unrelated to known protein families or those that have not been strictly conserved between species. BioMed Central 2004 2003-12-22 /pmc/articles/PMC395735/ /pubmed/14709175 http://dx.doi.org/10.1186/gb-2003-5-1-r3 Text en Copyright © 2003 Hild et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Research
Hild, M
Beckmann, B
Haas, SA
Koch, B
Solovyev, V
Busold, C
Fellenberg, K
Boutros, M
Vingron, M
Sauer, F
Hoheisel, JD
Paro, R
An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome
title An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome
title_full An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome
title_fullStr An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome
title_full_unstemmed An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome
title_short An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome
title_sort integrated gene annotation and transcriptional profiling approach towards the full gene content of the drosophila genome
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC395735/
https://www.ncbi.nlm.nih.gov/pubmed/14709175
http://dx.doi.org/10.1186/gb-2003-5-1-r3
work_keys_str_mv AT hildm anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT beckmannb anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT haassa anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT kochb anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT solovyevv anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT busoldc anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT fellenbergk anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT boutrosm anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT vingronm anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT sauerf anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT hoheiseljd anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT paror anintegratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT hildm integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT beckmannb integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT haassa integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT kochb integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT solovyevv integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT busoldc integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT fellenbergk integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT boutrosm integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT vingronm integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT sauerf integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT hoheiseljd integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome
AT paror integratedgeneannotationandtranscriptionalprofilingapproachtowardsthefullgenecontentofthedrosophilagenome