Cargando…

Identification of introns harboring functional sequence elements through positional conservation

Many human introns carry out a function, in the sense that they are critical to maintain normal cellular activity. Their identification is fundamental to understanding cellular processes and disease. However, being noncoding elements, such functional introns are poorly predicted based on traditional...

Descripción completa

Detalles Bibliográficos
Autores principales: Chorev, Michal, Joseph Bekker, Alan, Goldberger, Jacob, Carmel, Liran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5482813/
https://www.ncbi.nlm.nih.gov/pubmed/28646210
http://dx.doi.org/10.1038/s41598-017-04476-0
_version_ 1783245632006258688
author Chorev, Michal
Joseph Bekker, Alan
Goldberger, Jacob
Carmel, Liran
author_facet Chorev, Michal
Joseph Bekker, Alan
Goldberger, Jacob
Carmel, Liran
author_sort Chorev, Michal
collection PubMed
description Many human introns carry out a function, in the sense that they are critical to maintain normal cellular activity. Their identification is fundamental to understanding cellular processes and disease. However, being noncoding elements, such functional introns are poorly predicted based on traditional approaches of sequence and structure conservation. Here, we generated a dataset of human functional introns that carry out different types of functions. We showed that functional introns share common characteristics, such as higher positional conservation along the coding sequence and reduced loss rates, regardless of their specific function. A unique property of the data is that if an intron is unknown to be functional, it still does not mean that it is indeed non-functional. We developed a probabilistic framework that explicitly accounts for this unique property, and predicts which specific human introns are functional. We show that we successfully predict function even when the algorithm is trained on introns with a different type of function. This ability has many implications in studying regulatory networks, gene regulation, the effect of mutations outside exons on human disease, and on our general understanding of intron evolution and their functional exaptation in mammals.
format Online
Article
Text
id pubmed-5482813
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-54828132017-06-26 Identification of introns harboring functional sequence elements through positional conservation Chorev, Michal Joseph Bekker, Alan Goldberger, Jacob Carmel, Liran Sci Rep Article Many human introns carry out a function, in the sense that they are critical to maintain normal cellular activity. Their identification is fundamental to understanding cellular processes and disease. However, being noncoding elements, such functional introns are poorly predicted based on traditional approaches of sequence and structure conservation. Here, we generated a dataset of human functional introns that carry out different types of functions. We showed that functional introns share common characteristics, such as higher positional conservation along the coding sequence and reduced loss rates, regardless of their specific function. A unique property of the data is that if an intron is unknown to be functional, it still does not mean that it is indeed non-functional. We developed a probabilistic framework that explicitly accounts for this unique property, and predicts which specific human introns are functional. We show that we successfully predict function even when the algorithm is trained on introns with a different type of function. This ability has many implications in studying regulatory networks, gene regulation, the effect of mutations outside exons on human disease, and on our general understanding of intron evolution and their functional exaptation in mammals. Nature Publishing Group UK 2017-06-23 /pmc/articles/PMC5482813/ /pubmed/28646210 http://dx.doi.org/10.1038/s41598-017-04476-0 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Chorev, Michal
Joseph Bekker, Alan
Goldberger, Jacob
Carmel, Liran
Identification of introns harboring functional sequence elements through positional conservation
title Identification of introns harboring functional sequence elements through positional conservation
title_full Identification of introns harboring functional sequence elements through positional conservation
title_fullStr Identification of introns harboring functional sequence elements through positional conservation
title_full_unstemmed Identification of introns harboring functional sequence elements through positional conservation
title_short Identification of introns harboring functional sequence elements through positional conservation
title_sort identification of introns harboring functional sequence elements through positional conservation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5482813/
https://www.ncbi.nlm.nih.gov/pubmed/28646210
http://dx.doi.org/10.1038/s41598-017-04476-0
work_keys_str_mv AT chorevmichal identificationofintronsharboringfunctionalsequenceelementsthroughpositionalconservation
AT josephbekkeralan identificationofintronsharboringfunctionalsequenceelementsthroughpositionalconservation
AT goldbergerjacob identificationofintronsharboringfunctionalsequenceelementsthroughpositionalconservation
AT carmelliran identificationofintronsharboringfunctionalsequenceelementsthroughpositionalconservation