Cargando…

Widespread purifying selection on RNA structure in mammals

Evolutionarily conserved RNA secondary structures are a robust indicator of purifying selection and, consequently, molecular function. Evaluating their genome-wide occurrence through comparative genomics has consistently been plagued by high false-positive rates and divergent predictions. We present...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Martin A., Gesell, Tanja, Stadler, Peter F., Mattick, John S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3783177/
https://www.ncbi.nlm.nih.gov/pubmed/23847102
http://dx.doi.org/10.1093/nar/gkt596
_version_ 1782285637170233344
author Smith, Martin A.
Gesell, Tanja
Stadler, Peter F.
Mattick, John S.
author_facet Smith, Martin A.
Gesell, Tanja
Stadler, Peter F.
Mattick, John S.
author_sort Smith, Martin A.
collection PubMed
description Evolutionarily conserved RNA secondary structures are a robust indicator of purifying selection and, consequently, molecular function. Evaluating their genome-wide occurrence through comparative genomics has consistently been plagued by high false-positive rates and divergent predictions. We present a novel benchmarking pipeline aimed at calibrating the precision of genome-wide scans for consensus RNA structure prediction. The benchmarking data obtained from two refined structure prediction algorithms, RNAz and SISSIz, were then analyzed to fine-tune the parameters of an optimized workflow for genomic sliding window screens. When applied to consistency-based multiple genome alignments of 35 mammals, our approach confidently identifies >4 million evolutionarily constrained RNA structures using a conservative sensitivity threshold that entails historically low false discovery rates for such analyses (5–22%). These predictions comprise 13.6% of the human genome, 88% of which fall outside any known sequence-constrained element, suggesting that a large proportion of the mammalian genome is functional. As an example, our findings identify both known and novel conserved RNA structure motifs in the long noncoding RNA MALAT1. This study provides an extensive set of functional transcriptomic annotations that will assist researchers in uncovering the precise mechanisms underlying the developmental ontologies of higher eukaryotes.
format Online
Article
Text
id pubmed-3783177
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-37831772013-09-30 Widespread purifying selection on RNA structure in mammals Smith, Martin A. Gesell, Tanja Stadler, Peter F. Mattick, John S. Nucleic Acids Res Genomics Evolutionarily conserved RNA secondary structures are a robust indicator of purifying selection and, consequently, molecular function. Evaluating their genome-wide occurrence through comparative genomics has consistently been plagued by high false-positive rates and divergent predictions. We present a novel benchmarking pipeline aimed at calibrating the precision of genome-wide scans for consensus RNA structure prediction. The benchmarking data obtained from two refined structure prediction algorithms, RNAz and SISSIz, were then analyzed to fine-tune the parameters of an optimized workflow for genomic sliding window screens. When applied to consistency-based multiple genome alignments of 35 mammals, our approach confidently identifies >4 million evolutionarily constrained RNA structures using a conservative sensitivity threshold that entails historically low false discovery rates for such analyses (5–22%). These predictions comprise 13.6% of the human genome, 88% of which fall outside any known sequence-constrained element, suggesting that a large proportion of the mammalian genome is functional. As an example, our findings identify both known and novel conserved RNA structure motifs in the long noncoding RNA MALAT1. This study provides an extensive set of functional transcriptomic annotations that will assist researchers in uncovering the precise mechanisms underlying the developmental ontologies of higher eukaryotes. Oxford University Press 2013-09 2013-07-11 /pmc/articles/PMC3783177/ /pubmed/23847102 http://dx.doi.org/10.1093/nar/gkt596 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomics
Smith, Martin A.
Gesell, Tanja
Stadler, Peter F.
Mattick, John S.
Widespread purifying selection on RNA structure in mammals
title Widespread purifying selection on RNA structure in mammals
title_full Widespread purifying selection on RNA structure in mammals
title_fullStr Widespread purifying selection on RNA structure in mammals
title_full_unstemmed Widespread purifying selection on RNA structure in mammals
title_short Widespread purifying selection on RNA structure in mammals
title_sort widespread purifying selection on rna structure in mammals
topic Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3783177/
https://www.ncbi.nlm.nih.gov/pubmed/23847102
http://dx.doi.org/10.1093/nar/gkt596
work_keys_str_mv AT smithmartina widespreadpurifyingselectiononrnastructureinmammals
AT geselltanja widespreadpurifyingselectiononrnastructureinmammals
AT stadlerpeterf widespreadpurifyingselectiononrnastructureinmammals
AT mattickjohns widespreadpurifyingselectiononrnastructureinmammals