Cargando…
Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes
BACKGROUND: Allelic gene structure variations and alternative splicing are responsible for transcript structure variations. More than 75% of human genes have structural isoforms of transcripts, but to date few studies have been conducted to verify the alternative splicing systematically. METHODOLOGY...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2873427/ https://www.ncbi.nlm.nih.gov/pubmed/20502664 http://dx.doi.org/10.1371/journal.pone.0010699 |
_version_ | 1782181337908641792 |
---|---|
author | Li, Jun Ribeiro, Jose M. C. Yan, Guiyun |
author_facet | Li, Jun Ribeiro, Jose M. C. Yan, Guiyun |
author_sort | Li, Jun |
collection | PubMed |
description | BACKGROUND: Allelic gene structure variations and alternative splicing are responsible for transcript structure variations. More than 75% of human genes have structural isoforms of transcripts, but to date few studies have been conducted to verify the alternative splicing systematically. METHODOLOGY/PRINCIPAL FINDINGS: The present study used expressed sequence tags (ESTs) and EST tagged SNP patterns to examine the transcript structure variations resulting from allelic gene structure variations in the major human malaria vector, Anopheles gambiae. About 80% of 236,004 available A. gambiae ESTs were successfully aligned to A. gambiae reference genomes. More than 2,340 transcript structure variation events were detected. Because the current A. gambiae annotation is incomplete, we re-annotated the A. gambiae genome with an A. gambiae-specific gene model so that the effect of variations on gene coding could be better evaluated. A total of 15,962 genes were predicted. Among them, 3,873 were novel genes and 12,089 were previously identified genes. The gene completion rate improved from 60% to 84%. Based on EST support, 82.5% of gene structures were predicted correctly. In light of the new annotation, we found that ∼78% of transcript structure variations were located within the coding sequence (CDS) regions, and >65% of variations in the CDS regions have the same open-reading-frame. The association between transcript structure isoforms and SNPs indicated that more than 28% of transcript structure variation events were contributed by different gene alleles in A. gambiae. CONCLUSIONS/SIGNIFICANCE: We successfully expanded the A. gambiae genome annotation. We predicted and analyzed transcript structure variations in A. gambiae and found that allelic gene structure variation plays a major role in transcript diversity in this important human malaria vector. |
format | Text |
id | pubmed-2873427 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-28734272010-05-25 Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes Li, Jun Ribeiro, Jose M. C. Yan, Guiyun PLoS One Research Article BACKGROUND: Allelic gene structure variations and alternative splicing are responsible for transcript structure variations. More than 75% of human genes have structural isoforms of transcripts, but to date few studies have been conducted to verify the alternative splicing systematically. METHODOLOGY/PRINCIPAL FINDINGS: The present study used expressed sequence tags (ESTs) and EST tagged SNP patterns to examine the transcript structure variations resulting from allelic gene structure variations in the major human malaria vector, Anopheles gambiae. About 80% of 236,004 available A. gambiae ESTs were successfully aligned to A. gambiae reference genomes. More than 2,340 transcript structure variation events were detected. Because the current A. gambiae annotation is incomplete, we re-annotated the A. gambiae genome with an A. gambiae-specific gene model so that the effect of variations on gene coding could be better evaluated. A total of 15,962 genes were predicted. Among them, 3,873 were novel genes and 12,089 were previously identified genes. The gene completion rate improved from 60% to 84%. Based on EST support, 82.5% of gene structures were predicted correctly. In light of the new annotation, we found that ∼78% of transcript structure variations were located within the coding sequence (CDS) regions, and >65% of variations in the CDS regions have the same open-reading-frame. The association between transcript structure isoforms and SNPs indicated that more than 28% of transcript structure variation events were contributed by different gene alleles in A. gambiae. CONCLUSIONS/SIGNIFICANCE: We successfully expanded the A. gambiae genome annotation. We predicted and analyzed transcript structure variations in A. gambiae and found that allelic gene structure variation plays a major role in transcript diversity in this important human malaria vector. Public Library of Science 2010-05-19 /pmc/articles/PMC2873427/ /pubmed/20502664 http://dx.doi.org/10.1371/journal.pone.0010699 Text en This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. |
spellingShingle | Research Article Li, Jun Ribeiro, Jose M. C. Yan, Guiyun Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes |
title | Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes |
title_full | Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes |
title_fullStr | Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes |
title_full_unstemmed | Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes |
title_short | Allelic Gene Structure Variations in Anopheles gambiae Mosquitoes |
title_sort | allelic gene structure variations in anopheles gambiae mosquitoes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2873427/ https://www.ncbi.nlm.nih.gov/pubmed/20502664 http://dx.doi.org/10.1371/journal.pone.0010699 |
work_keys_str_mv | AT lijun allelicgenestructurevariationsinanophelesgambiaemosquitoes AT ribeirojosemc allelicgenestructurevariationsinanophelesgambiaemosquitoes AT yanguiyun allelicgenestructurevariationsinanophelesgambiaemosquitoes |