Cargando…
In depth annotation of the Anopheles gambiae mosquito midgut transcriptome
BACKGROUND: Genome sequencing of Anopheles gambiae was completed more than ten years ago and has accelerated research on malaria transmission. However, annotation needs to be refined and verified experimentally, as most predicted transcripts have been identified by comparative analysis with genomes...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4131051/ https://www.ncbi.nlm.nih.gov/pubmed/25073905 http://dx.doi.org/10.1186/1471-2164-15-636 |
_version_ | 1782330399032082432 |
---|---|
author | Padrón, Alejandro Molina-Cruz, Alvaro Quinones, Mariam Ribeiro, José MC Ramphul, Urvashi Rodrigues, Janneth Shen, Kui Haile, Ashley Ramirez, José Luis Barillas-Mury, Carolina |
author_facet | Padrón, Alejandro Molina-Cruz, Alvaro Quinones, Mariam Ribeiro, José MC Ramphul, Urvashi Rodrigues, Janneth Shen, Kui Haile, Ashley Ramirez, José Luis Barillas-Mury, Carolina |
author_sort | Padrón, Alejandro |
collection | PubMed |
description | BACKGROUND: Genome sequencing of Anopheles gambiae was completed more than ten years ago and has accelerated research on malaria transmission. However, annotation needs to be refined and verified experimentally, as most predicted transcripts have been identified by comparative analysis with genomes from other species. The mosquito midgut—the first organ to interact with Plasmodium parasites—mounts effective antiplasmodial responses that limit parasite survival and disease transmission. High-throughput Illumina sequencing of the midgut transcriptome was used to identify new genes and transcripts, contributing to the refinement of An. gambiae genome annotation. RESULTS: We sequenced ~223 million reads from An. gambiae midgut cDNA libraries generated from susceptible (G3) and refractory (L35) mosquito strains. Mosquitoes were infected with either Plasmodium berghei or Plasmodium falciparum, and midguts were collected after the first or second Plasmodium infection. In total, 22,889 unique midgut transcript models were generated from both An. gambiae strain sequences combined, and 76% are potentially novel. Of these novel transcripts, 49.5% aligned with annotated genes and appear to be isoforms or pre-mRNAs of reference transcripts, while 50.5% mapped to regions between annotated genes and represent novel intergenic transcripts (NITs). Predicted models were validated for midgut expression using qRT-PCR and microarray analysis, and novel isoforms were confirmed by sequencing predicted intron-exon boundaries. Coding potential analysis revealed that 43% of total midgut transcripts appear to be long non-coding RNA (lncRNA), and functional annotation of NITs showed that 68% had no homology to current databases from other species. Reads were also analyzed using de novo assembly and predicted transcripts compared with genome mapping-based models. Finally, variant analysis of G3 and L35 midgut transcripts detected 160,742 variants with respect to the An. gambiae PEST genome, and 74% were new variants. Intergenic transcripts had a higher frequency of variation compared with non-intergenic transcripts. CONCLUSION: This in-depth Illumina sequencing and assembly of the An. gambiae midgut transcriptome doubled the number of known transcripts and tripled the number of variants known in this mosquito species. It also revealed existence of a large number of lncRNA and opens new possibilities for investigating the biological function of many newly discovered transcripts. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-636) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4131051 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41310512014-08-18 In depth annotation of the Anopheles gambiae mosquito midgut transcriptome Padrón, Alejandro Molina-Cruz, Alvaro Quinones, Mariam Ribeiro, José MC Ramphul, Urvashi Rodrigues, Janneth Shen, Kui Haile, Ashley Ramirez, José Luis Barillas-Mury, Carolina BMC Genomics Research Article BACKGROUND: Genome sequencing of Anopheles gambiae was completed more than ten years ago and has accelerated research on malaria transmission. However, annotation needs to be refined and verified experimentally, as most predicted transcripts have been identified by comparative analysis with genomes from other species. The mosquito midgut—the first organ to interact with Plasmodium parasites—mounts effective antiplasmodial responses that limit parasite survival and disease transmission. High-throughput Illumina sequencing of the midgut transcriptome was used to identify new genes and transcripts, contributing to the refinement of An. gambiae genome annotation. RESULTS: We sequenced ~223 million reads from An. gambiae midgut cDNA libraries generated from susceptible (G3) and refractory (L35) mosquito strains. Mosquitoes were infected with either Plasmodium berghei or Plasmodium falciparum, and midguts were collected after the first or second Plasmodium infection. In total, 22,889 unique midgut transcript models were generated from both An. gambiae strain sequences combined, and 76% are potentially novel. Of these novel transcripts, 49.5% aligned with annotated genes and appear to be isoforms or pre-mRNAs of reference transcripts, while 50.5% mapped to regions between annotated genes and represent novel intergenic transcripts (NITs). Predicted models were validated for midgut expression using qRT-PCR and microarray analysis, and novel isoforms were confirmed by sequencing predicted intron-exon boundaries. Coding potential analysis revealed that 43% of total midgut transcripts appear to be long non-coding RNA (lncRNA), and functional annotation of NITs showed that 68% had no homology to current databases from other species. Reads were also analyzed using de novo assembly and predicted transcripts compared with genome mapping-based models. Finally, variant analysis of G3 and L35 midgut transcripts detected 160,742 variants with respect to the An. gambiae PEST genome, and 74% were new variants. Intergenic transcripts had a higher frequency of variation compared with non-intergenic transcripts. CONCLUSION: This in-depth Illumina sequencing and assembly of the An. gambiae midgut transcriptome doubled the number of known transcripts and tripled the number of variants known in this mosquito species. It also revealed existence of a large number of lncRNA and opens new possibilities for investigating the biological function of many newly discovered transcripts. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-636) contains supplementary material, which is available to authorized users. BioMed Central 2014-07-29 /pmc/articles/PMC4131051/ /pubmed/25073905 http://dx.doi.org/10.1186/1471-2164-15-636 Text en © Padrón et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Padrón, Alejandro Molina-Cruz, Alvaro Quinones, Mariam Ribeiro, José MC Ramphul, Urvashi Rodrigues, Janneth Shen, Kui Haile, Ashley Ramirez, José Luis Barillas-Mury, Carolina In depth annotation of the Anopheles gambiae mosquito midgut transcriptome |
title | In depth annotation of the Anopheles gambiae mosquito midgut transcriptome |
title_full | In depth annotation of the Anopheles gambiae mosquito midgut transcriptome |
title_fullStr | In depth annotation of the Anopheles gambiae mosquito midgut transcriptome |
title_full_unstemmed | In depth annotation of the Anopheles gambiae mosquito midgut transcriptome |
title_short | In depth annotation of the Anopheles gambiae mosquito midgut transcriptome |
title_sort | in depth annotation of the anopheles gambiae mosquito midgut transcriptome |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4131051/ https://www.ncbi.nlm.nih.gov/pubmed/25073905 http://dx.doi.org/10.1186/1471-2164-15-636 |
work_keys_str_mv | AT padronalejandro indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome AT molinacruzalvaro indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome AT quinonesmariam indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome AT ribeirojosemc indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome AT ramphulurvashi indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome AT rodriguesjanneth indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome AT shenkui indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome AT haileashley indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome AT ramirezjoseluis indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome AT barillasmurycarolina indepthannotationoftheanophelesgambiaemosquitomidguttranscriptome |