Cargando…
A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals
Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional typ...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5387049/ https://www.ncbi.nlm.nih.gov/pubmed/28443131 http://dx.doi.org/10.3389/fgene.2017.00038 |
_version_ | 1782520867357458432 |
---|---|
author | Qu, Wen Cingolani, Pablo Zeeberg, Barry R. Ruden, Douglas M. |
author_facet | Qu, Wen Cingolani, Pablo Zeeberg, Barry R. Ruden, Douglas M. |
author_sort | Qu, Wen |
collection | PubMed |
description | Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5′ or 3′ splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an “alternative-splicing code” in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs. |
format | Online Article Text |
id | pubmed-5387049 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-53870492017-04-25 A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals Qu, Wen Cingolani, Pablo Zeeberg, Barry R. Ruden, Douglas M. Front Genet Genetics Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5′ or 3′ splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an “alternative-splicing code” in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs. Frontiers Media S.A. 2017-04-11 /pmc/articles/PMC5387049/ /pubmed/28443131 http://dx.doi.org/10.3389/fgene.2017.00038 Text en Copyright © 2017 Qu, Cingolani, Zeeberg and Ruden. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Qu, Wen Cingolani, Pablo Zeeberg, Barry R. Ruden, Douglas M. A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals |
title | A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals |
title_full | A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals |
title_fullStr | A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals |
title_full_unstemmed | A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals |
title_short | A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals |
title_sort | bioinformatics-based alternative mrna splicing code that may explain some disease mutations is conserved in animals |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5387049/ https://www.ncbi.nlm.nih.gov/pubmed/28443131 http://dx.doi.org/10.3389/fgene.2017.00038 |
work_keys_str_mv | AT quwen abioinformaticsbasedalternativemrnasplicingcodethatmayexplainsomediseasemutationsisconservedinanimals AT cingolanipablo abioinformaticsbasedalternativemrnasplicingcodethatmayexplainsomediseasemutationsisconservedinanimals AT zeebergbarryr abioinformaticsbasedalternativemrnasplicingcodethatmayexplainsomediseasemutationsisconservedinanimals AT rudendouglasm abioinformaticsbasedalternativemrnasplicingcodethatmayexplainsomediseasemutationsisconservedinanimals AT quwen bioinformaticsbasedalternativemrnasplicingcodethatmayexplainsomediseasemutationsisconservedinanimals AT cingolanipablo bioinformaticsbasedalternativemrnasplicingcodethatmayexplainsomediseasemutationsisconservedinanimals AT zeebergbarryr bioinformaticsbasedalternativemrnasplicingcodethatmayexplainsomediseasemutationsisconservedinanimals AT rudendouglasm bioinformaticsbasedalternativemrnasplicingcodethatmayexplainsomediseasemutationsisconservedinanimals |