Cargando…

Identification of unannotated exons of low abundance transcripts in Drosophila melanogaster and cloning of a new serine protease gene upregulated upon injury

BACKGROUND: The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that...

Descripción completa

Detalles Bibliográficos
Autores principales: Maia, Rafaela M, Valente, Valeria, Cunha, Marco AV, Sousa, Josane F, Araujo, Daniela D, Silva, Wilson A, Zago, Marco A, Dias-Neto, Emmanuel, Souza, Sandro J, Simpson, Andrew JG, Monesi, Nadia, Ramos, Ricardo GP, Espreafico, Enilza M, Paçó-Larson, Maria L
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1949825/
https://www.ncbi.nlm.nih.gov/pubmed/17650329
http://dx.doi.org/10.1186/1471-2164-8-249
_version_ 1782134521894797312
author Maia, Rafaela M
Valente, Valeria
Cunha, Marco AV
Sousa, Josane F
Araujo, Daniela D
Silva, Wilson A
Zago, Marco A
Dias-Neto, Emmanuel
Souza, Sandro J
Simpson, Andrew JG
Monesi, Nadia
Ramos, Ricardo GP
Espreafico, Enilza M
Paçó-Larson, Maria L
author_facet Maia, Rafaela M
Valente, Valeria
Cunha, Marco AV
Sousa, Josane F
Araujo, Daniela D
Silva, Wilson A
Zago, Marco A
Dias-Neto, Emmanuel
Souza, Sandro J
Simpson, Andrew JG
Monesi, Nadia
Ramos, Ricardo GP
Espreafico, Enilza M
Paçó-Larson, Maria L
author_sort Maia, Rafaela M
collection PubMed
description BACKGROUND: The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. RESULTS: Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA(+ )northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. CONCLUSION: Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data.
format Text
id pubmed-1949825
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19498252007-08-17 Identification of unannotated exons of low abundance transcripts in Drosophila melanogaster and cloning of a new serine protease gene upregulated upon injury Maia, Rafaela M Valente, Valeria Cunha, Marco AV Sousa, Josane F Araujo, Daniela D Silva, Wilson A Zago, Marco A Dias-Neto, Emmanuel Souza, Sandro J Simpson, Andrew JG Monesi, Nadia Ramos, Ricardo GP Espreafico, Enilza M Paçó-Larson, Maria L BMC Genomics Research Article BACKGROUND: The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome. RESULTS: Bioinformatic analysis of 1,303 Drosophila ORESTES clusters identified 68 sequences derived from unannotated regions in the current Drosophila genome version (4.3). Of these, a set of 38 was analysed by polyA(+ )northern blot hybridization, validating 17 (50%) new exons of low abundance transcripts. For one of these ESTs, we obtained the cDNA encompassing the complete coding sequence of a new serine protease, named SP212. The SP212 gene is part of a serine protease gene cluster located in the chromosome region 88A12-B1. This cluster includes the predicted genes CG9631, CG9649 and CG31326, which were previously identified as up-regulated after immune challenges in genomic-scale microarray analysis. In agreement with the proposal that this locus is co-regulated in response to microorganisms infection, we show here that SP212 is also up-regulated upon injury. CONCLUSION: Using the ORESTES methodology we identified 17 novel exons from low abundance Drosophila transcripts, and through a PCR approach the complete CDS of one of these transcripts was defined. Our results show that the computational identification and manual inspection are not sufficient to annotate a genome in the absence of experimentally derived data. BioMed Central 2007-07-24 /pmc/articles/PMC1949825/ /pubmed/17650329 http://dx.doi.org/10.1186/1471-2164-8-249 Text en Copyright © 2007 Maia et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Maia, Rafaela M
Valente, Valeria
Cunha, Marco AV
Sousa, Josane F
Araujo, Daniela D
Silva, Wilson A
Zago, Marco A
Dias-Neto, Emmanuel
Souza, Sandro J
Simpson, Andrew JG
Monesi, Nadia
Ramos, Ricardo GP
Espreafico, Enilza M
Paçó-Larson, Maria L
Identification of unannotated exons of low abundance transcripts in Drosophila melanogaster and cloning of a new serine protease gene upregulated upon injury
title Identification of unannotated exons of low abundance transcripts in Drosophila melanogaster and cloning of a new serine protease gene upregulated upon injury
title_full Identification of unannotated exons of low abundance transcripts in Drosophila melanogaster and cloning of a new serine protease gene upregulated upon injury
title_fullStr Identification of unannotated exons of low abundance transcripts in Drosophila melanogaster and cloning of a new serine protease gene upregulated upon injury
title_full_unstemmed Identification of unannotated exons of low abundance transcripts in Drosophila melanogaster and cloning of a new serine protease gene upregulated upon injury
title_short Identification of unannotated exons of low abundance transcripts in Drosophila melanogaster and cloning of a new serine protease gene upregulated upon injury
title_sort identification of unannotated exons of low abundance transcripts in drosophila melanogaster and cloning of a new serine protease gene upregulated upon injury
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1949825/
https://www.ncbi.nlm.nih.gov/pubmed/17650329
http://dx.doi.org/10.1186/1471-2164-8-249
work_keys_str_mv AT maiarafaelam identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT valentevaleria identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT cunhamarcoav identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT sousajosanef identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT araujodanielad identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT silvawilsona identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT zagomarcoa identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT diasnetoemmanuel identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT souzasandroj identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT simpsonandrewjg identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT monesinadia identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT ramosricardogp identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT espreaficoenilzam identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury
AT pacolarsonmarial identificationofunannotatedexonsoflowabundancetranscriptsindrosophilamelanogasterandcloningofanewserineproteasegeneupregulateduponinjury