Cargando…

In depth annotation of the Anopheles gambiae mosquito midgut transcriptome

BACKGROUND: Genome sequencing of Anopheles gambiae was completed more than ten years ago and has accelerated research on malaria transmission. However, annotation needs to be refined and verified experimentally, as most predicted transcripts have been identified by comparative analysis with genomes...

Descripción completa

Detalles Bibliográficos
Autores principales: Padrón, Alejandro, Molina-Cruz, Alvaro, Quinones, Mariam, Ribeiro, José MC, Ramphul, Urvashi, Rodrigues, Janneth, Shen, Kui, Haile, Ashley, Ramirez, José Luis, Barillas-Mury, Carolina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4131051/
https://www.ncbi.nlm.nih.gov/pubmed/25073905
http://dx.doi.org/10.1186/1471-2164-15-636
_version_ 1782330399032082432
author Padrón, Alejandro
Molina-Cruz, Alvaro
Quinones, Mariam
Ribeiro, José MC
Ramphul, Urvashi
Rodrigues, Janneth
Shen, Kui
Haile, Ashley
Ramirez, José Luis
Barillas-Mury, Carolina
author_facet Padrón, Alejandro
Molina-Cruz, Alvaro
Quinones, Mariam
Ribeiro, José MC
Ramphul, Urvashi
Rodrigues, Janneth
Shen, Kui
Haile, Ashley
Ramirez, José Luis
Barillas-Mury, Carolina
author_sort Padrón, Alejandro
collection PubMed
description BACKGROUND: Genome sequencing of Anopheles gambiae was completed more than ten years ago and has accelerated research on malaria transmission. However, annotation needs to be refined and verified experimentally, as most predicted transcripts have been identified by comparative analysis with genomes from other species. The mosquito midgut—the first organ to interact with Plasmodium parasites—mounts effective antiplasmodial responses that limit parasite survival and disease transmission. High-throughput Illumina sequencing of the midgut transcriptome was used to identify new genes and transcripts, contributing to the refinement of An. gambiae genome annotation. RESULTS: We sequenced ~223 million reads from An. gambiae midgut cDNA libraries generated from susceptible (G3) and refractory (L35) mosquito strains. Mosquitoes were infected with either Plasmodium berghei or Plasmodium falciparum, and midguts were collected after the first or second Plasmodium infection. In total, 22,889 unique midgut transcript models were generated from both An. gambiae strain sequences combined, and 76% are potentially novel. Of these novel transcripts, 49.5% aligned with annotated genes and appear to be isoforms or pre-mRNAs of reference transcripts, while 50.5% mapped to regions between annotated genes and represent novel intergenic transcripts (NITs). Predicted models were validated for midgut expression using qRT-PCR and microarray analysis, and novel isoforms were confirmed by sequencing predicted intron-exon boundaries. Coding potential analysis revealed that 43% of total midgut transcripts appear to be long non-coding RNA (lncRNA), and functional annotation of NITs showed that 68% had no homology to current databases from other species. Reads were also analyzed using de novo assembly and predicted transcripts compared with genome mapping-based models. Finally, variant analysis of G3 and L35 midgut transcripts detected 160,742 variants with respect to the An. gambiae PEST genome, and 74% were new variants. Intergenic transcripts had a higher frequency of variation compared with non-intergenic transcripts. CONCLUSION: This in-depth Illumina sequencing and assembly of the An. gambiae midgut transcriptome doubled the number of known transcripts and tripled the number of variants known in this mosquito species. It also revealed existence of a large number of lncRNA and opens new possibilities for investigating the biological function of many newly discovered transcripts. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-636) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4131051
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41310512014-08-18 In depth annotation of the Anopheles gambiae mosquito midgut transcriptome Padrón, Alejandro Molina-Cruz, Alvaro Quinones, Mariam Ribeiro, José MC Ramphul, Urvashi Rodrigues, Janneth Shen, Kui Haile, Ashley Ramirez, José Luis Barillas-Mury, Carolina BMC Genomics Research Article BACKGROUND: Genome sequencing of Anopheles gambiae was completed more than ten years ago and has accelerated research on malaria transmission. However, annotation needs to be refined and verified experimentally, as most predicted transcripts have been identified by comparative analysis with genomes from other species. The mosquito midgut—the first organ to interact with Plasmodium parasites—mounts effective antiplasmodial responses that limit parasite survival and disease transmission. High-throughput Illumina sequencing of the midgut transcriptome was used to identify new genes and transcripts, contributing to the refinement of An. gambiae genome annotation. RESULTS: We sequenced ~223 million reads from An. gambiae midgut cDNA libraries generated from susceptible (G3) and refractory (L35) mosquito strains. Mosquitoes were infected with either Plasmodium berghei or Plasmodium falciparum, and midguts were collected after the first or second Plasmodium infection. In total, 22,889 unique midgut transcript models were generated from both An. gambiae strain sequences combined, and 76% are potentially novel. Of these novel transcripts, 49.5% aligned with annotated genes and appear to be isoforms or pre-mRNAs of reference transcripts, while 50.5% mapped to regions between annotated genes and represent novel intergenic transcripts (NITs). Predicted models were validated for midgut expression using qRT-PCR and microarray analysis, and novel isoforms were confirmed by sequencing predicted intron-exon boundaries. Coding potential analysis revealed that 43% of total midgut transcripts appear to be long non-coding RNA (lncRNA), and functional annotation of NITs showed that 68% had no homology to current databases from other species. Reads were also analyzed using de novo assembly and predicted transcripts compared with genome mapping-based models. Finally, variant analysis of G3 and L35 midgut transcripts detected 160,742 variants with respect to the An. gambiae PEST genome, and 74% were new variants. Intergenic transcripts had a higher frequency of variation compared with non-intergenic transcripts. CONCLUSION: This in-depth Illumina sequencing and assembly of the An. gambiae midgut transcriptome doubled the number of known transcripts and tripled the number of variants known in this mosquito species. It also revealed existence of a large number of lncRNA and opens new possibilities for investigating the biological function of many newly discovered transcripts. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-636) contains supplementary material, which is available to authorized users. BioMed Central 2014-07-29 /pmc/articles/PMC4131051/ /pubmed/25073905 http://dx.doi.org/10.1186/1471-2164-15-636 Text en © Padrón et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Padrón, Alejandro
Molina-Cruz, Alvaro
Quinones, Mariam
Ribeiro, José MC
Ramphul, Urvashi
Rodrigues, Janneth
Shen, Kui
Haile, Ashley
Ramirez, José Luis
Barillas-Mury, Carolina
In depth annotation of the Anopheles gambiae mosquito midgut transcriptome
title In depth annotation of the Anopheles gambiae mosquito midgut transcriptome
title_full In depth annotation of the Anopheles gambiae mosquito midgut transcriptome
title_fullStr In depth annotation of the Anopheles gambiae mosquito midgut transcriptome
title_full_unstemmed In depth annotation of the Anopheles gambiae mosquito midgut transcriptome
title_short In depth annotation of the Anopheles gambiae mosquito midgut transcriptome
title_sort in depth annotation of the anopheles gambiae mosquito midgut transcriptome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4131051/
https://www.ncbi.nlm.nih.gov/pubmed/25073905
http://dx.doi.org/10.1186/1471-2164-15-636
work_keys_str_mv AT padronalejandro indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome
AT molinacruzalvaro indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome
AT quinonesmariam indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome
AT ribeirojosemc indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome
AT ramphulurvashi indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome
AT rodriguesjanneth indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome
AT shenkui indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome
AT haileashley indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome
AT ramirezjoseluis indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome
AT barillasmurycarolina indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome