Cargando…

Recognition of Unknown Conserved Alternatively Spliced Exons

The split structure of most mammalian protein-coding genes allows for the potential to produce multiple different mRNA and protein isoforms from a single gene locus through the process of alternative splicing (AS). We propose a computational approach called UNCOVER based on a pair hidden Markov mode...

Descripción completa

Detalles Bibliográficos
Autores principales: Ohler, Uwe, Shomron, Noam, Burge, Christopher B
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1185642/
https://www.ncbi.nlm.nih.gov/pubmed/16110330
http://dx.doi.org/10.1371/journal.pcbi.0010015
_version_ 1782124745445081088
author Ohler, Uwe
Shomron, Noam
Burge, Christopher B
author_facet Ohler, Uwe
Shomron, Noam
Burge, Christopher B
author_sort Ohler, Uwe
collection PubMed
description The split structure of most mammalian protein-coding genes allows for the potential to produce multiple different mRNA and protein isoforms from a single gene locus through the process of alternative splicing (AS). We propose a computational approach called UNCOVER based on a pair hidden Markov model to discover conserved coding exonic sequences subject to AS that have so far gone undetected. Applying UNCOVER to orthologous introns of known human and mouse genes predicts skipped exons or retained introns present in both species, while discriminating them from conserved noncoding sequences. The accuracy of the model is evaluated on a curated set of genes with known conserved AS events. The prediction of skipped exons in the ~1% of the human genome represented by the ENCODE regions leads to more than 50 new exon candidates. Five novel predicted AS exons were validated by RT-PCR and sequencing analysis of 15 introns with strong UNCOVER predictions and lacking EST evidence. These results imply that a considerable number of conserved exonic sequences and associated isoforms are still completely missing from the current annotation of known genes. UNCOVER also identifies a small number of candidates for conserved intron retention.
format Text
id pubmed-1185642
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-11856422005-08-17 Recognition of Unknown Conserved Alternatively Spliced Exons Ohler, Uwe Shomron, Noam Burge, Christopher B PLoS Comput Biol Research Article The split structure of most mammalian protein-coding genes allows for the potential to produce multiple different mRNA and protein isoforms from a single gene locus through the process of alternative splicing (AS). We propose a computational approach called UNCOVER based on a pair hidden Markov model to discover conserved coding exonic sequences subject to AS that have so far gone undetected. Applying UNCOVER to orthologous introns of known human and mouse genes predicts skipped exons or retained introns present in both species, while discriminating them from conserved noncoding sequences. The accuracy of the model is evaluated on a curated set of genes with known conserved AS events. The prediction of skipped exons in the ~1% of the human genome represented by the ENCODE regions leads to more than 50 new exon candidates. Five novel predicted AS exons were validated by RT-PCR and sequencing analysis of 15 introns with strong UNCOVER predictions and lacking EST evidence. These results imply that a considerable number of conserved exonic sequences and associated isoforms are still completely missing from the current annotation of known genes. UNCOVER also identifies a small number of candidates for conserved intron retention. Public Library of Science 2005-07 2005-07-08 /pmc/articles/PMC1185642/ /pubmed/16110330 http://dx.doi.org/10.1371/journal.pcbi.0010015 Text en Copyright: © 2005 Ohler et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ohler, Uwe
Shomron, Noam
Burge, Christopher B
Recognition of Unknown Conserved Alternatively Spliced Exons
title Recognition of Unknown Conserved Alternatively Spliced Exons
title_full Recognition of Unknown Conserved Alternatively Spliced Exons
title_fullStr Recognition of Unknown Conserved Alternatively Spliced Exons
title_full_unstemmed Recognition of Unknown Conserved Alternatively Spliced Exons
title_short Recognition of Unknown Conserved Alternatively Spliced Exons
title_sort recognition of unknown conserved alternatively spliced exons
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1185642/
https://www.ncbi.nlm.nih.gov/pubmed/16110330
http://dx.doi.org/10.1371/journal.pcbi.0010015
work_keys_str_mv AT ohleruwe recognitionofunknownconservedalternativelysplicedexons
AT shomronnoam recognitionofunknownconservedalternativelysplicedexons
AT burgechristopherb recognitionofunknownconservedalternativelysplicedexons