Cargando…

Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes

BACKGROUND: Allelic gene structure variations and alternative splicing are responsible for transcript structure variations. More than 75% of human genes have structural isoforms of transcripts, but to date few studies have been conducted to verify the alternative splicing systematically. METHODOLOGY...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jun, Ribeiro, Jose M. C., Yan, Guiyun
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2873427/
https://www.ncbi.nlm.nih.gov/pubmed/20502664
http://dx.doi.org/10.1371/journal.pone.0010699
_version_ 1782181337908641792
author Li, Jun
Ribeiro, Jose M. C.
Yan, Guiyun
author_facet Li, Jun
Ribeiro, Jose M. C.
Yan, Guiyun
author_sort Li, Jun
collection PubMed
description BACKGROUND: Allelic gene structure variations and alternative splicing are responsible for transcript structure variations. More than 75% of human genes have structural isoforms of transcripts, but to date few studies have been conducted to verify the alternative splicing systematically. METHODOLOGY/PRINCIPAL FINDINGS: The present study used expressed sequence tags (ESTs) and EST tagged SNP patterns to examine the transcript structure variations resulting from allelic gene structure variations in the major human malaria vector, Anopheles gambiae. About 80% of 236,004 available A. gambiae ESTs were successfully aligned to A. gambiae reference genomes. More than 2,340 transcript structure variation events were detected. Because the current A. gambiae annotation is incomplete, we re-annotated the A. gambiae genome with an A. gambiae-specific gene model so that the effect of variations on gene coding could be better evaluated. A total of 15,962 genes were predicted. Among them, 3,873 were novel genes and 12,089 were previously identified genes. The gene completion rate improved from 60% to 84%. Based on EST support, 82.5% of gene structures were predicted correctly. In light of the new annotation, we found that ∼78% of transcript structure variations were located within the coding sequence (CDS) regions, and >65% of variations in the CDS regions have the same open-reading-frame. The association between transcript structure isoforms and SNPs indicated that more than 28% of transcript structure variation events were contributed by different gene alleles in A. gambiae. CONCLUSIONS/SIGNIFICANCE: We successfully expanded the A. gambiae genome annotation. We predicted and analyzed transcript structure variations in A. gambiae and found that allelic gene structure variation plays a major role in transcript diversity in this important human malaria vector.
format Text
id pubmed-2873427
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28734272010-05-25 Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes Li, Jun Ribeiro, Jose M. C. Yan, Guiyun PLoS One Research Article BACKGROUND: Allelic gene structure variations and alternative splicing are responsible for transcript structure variations. More than 75% of human genes have structural isoforms of transcripts, but to date few studies have been conducted to verify the alternative splicing systematically. METHODOLOGY/PRINCIPAL FINDINGS: The present study used expressed sequence tags (ESTs) and EST tagged SNP patterns to examine the transcript structure variations resulting from allelic gene structure variations in the major human malaria vector, Anopheles gambiae. About 80% of 236,004 available A. gambiae ESTs were successfully aligned to A. gambiae reference genomes. More than 2,340 transcript structure variation events were detected. Because the current A. gambiae annotation is incomplete, we re-annotated the A. gambiae genome with an A. gambiae-specific gene model so that the effect of variations on gene coding could be better evaluated. A total of 15,962 genes were predicted. Among them, 3,873 were novel genes and 12,089 were previously identified genes. The gene completion rate improved from 60% to 84%. Based on EST support, 82.5% of gene structures were predicted correctly. In light of the new annotation, we found that ∼78% of transcript structure variations were located within the coding sequence (CDS) regions, and >65% of variations in the CDS regions have the same open-reading-frame. The association between transcript structure isoforms and SNPs indicated that more than 28% of transcript structure variation events were contributed by different gene alleles in A. gambiae. CONCLUSIONS/SIGNIFICANCE: We successfully expanded the A. gambiae genome annotation. We predicted and analyzed transcript structure variations in A. gambiae and found that allelic gene structure variation plays a major role in transcript diversity in this important human malaria vector. Public Library of Science 2010-05-19 /pmc/articles/PMC2873427/ /pubmed/20502664 http://dx.doi.org/10.1371/journal.pone.0010699 Text en This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Li, Jun
Ribeiro, Jose M. C.
Yan, Guiyun
Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes
title Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes
title_full Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes
title_fullStr Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes
title_full_unstemmed Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes
title_short Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes
title_sort allelic gene structure variations in anopheles gambiae mosquitoes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2873427/
https://www.ncbi.nlm.nih.gov/pubmed/20502664
http://dx.doi.org/10.1371/journal.pone.0010699
work_keys_str_mv AT lijun allelicgenestructurevariationsinanophelesgambiaemosquitoes
AT ribeirojosemc allelicgenestructurevariationsinanophelesgambiaemosquitoes
AT yanguiyun allelicgenestructurevariationsinanophelesgambiaemosquitoes