Cargando…

Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors

BACKGROUND: Upwards of 1200 miRNA loci have hitherto been annotated in the human genome. The specific features defining a miRNA precursor and deciding its recognition and subsequent processing are not yet exhaustively described and miRNA loci can thus not be computationally identified with sufficien...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Xiuqin, He, Shunmin, Skogerbø, Geir, Gong, Fuzhou, Chen, Runsheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3305290/
https://www.ncbi.nlm.nih.gov/pubmed/22438883
http://dx.doi.org/10.1371/journal.pone.0032797
_version_ 1782227036369059840
author Liu, Xiuqin
He, Shunmin
Skogerbø, Geir
Gong, Fuzhou
Chen, Runsheng
author_facet Liu, Xiuqin
He, Shunmin
Skogerbø, Geir
Gong, Fuzhou
Chen, Runsheng
author_sort Liu, Xiuqin
collection PubMed
description BACKGROUND: Upwards of 1200 miRNA loci have hitherto been annotated in the human genome. The specific features defining a miRNA precursor and deciding its recognition and subsequent processing are not yet exhaustively described and miRNA loci can thus not be computationally identified with sufficient confidence. RESULTS: We rendered pre-miRNA and non-pre-miRNA hairpins as strings of integrated sequence-structure information, and used the software Teiresias to identify sequence-structure motifs (ss-motifs) of variable length in these data sets. Using only ss-motifs as features in a Support Vector Machine (SVM) algorithm for pre-miRNA identification achieved 99.2% specificity and 97.6% sensitivity on a human test data set, which is comparable to previously published algorithms employing combinations of sequence-structure and additional features. Further analysis of the ss-motif information contents revealed strongly significant deviations from those of the respective training sets, revealing important potential clues as to how the sequence and structural information of RNA hairpins are utilized by the miRNA processing apparatus. CONCLUSION: Integrated sequence-structure motifs of variable length apparently capture nearly all information required to distinguish miRNA precursors from other stem-loop structures.
format Online
Article
Text
id pubmed-3305290
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33052902012-03-21 Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors Liu, Xiuqin He, Shunmin Skogerbø, Geir Gong, Fuzhou Chen, Runsheng PLoS One Research Article BACKGROUND: Upwards of 1200 miRNA loci have hitherto been annotated in the human genome. The specific features defining a miRNA precursor and deciding its recognition and subsequent processing are not yet exhaustively described and miRNA loci can thus not be computationally identified with sufficient confidence. RESULTS: We rendered pre-miRNA and non-pre-miRNA hairpins as strings of integrated sequence-structure information, and used the software Teiresias to identify sequence-structure motifs (ss-motifs) of variable length in these data sets. Using only ss-motifs as features in a Support Vector Machine (SVM) algorithm for pre-miRNA identification achieved 99.2% specificity and 97.6% sensitivity on a human test data set, which is comparable to previously published algorithms employing combinations of sequence-structure and additional features. Further analysis of the ss-motif information contents revealed strongly significant deviations from those of the respective training sets, revealing important potential clues as to how the sequence and structural information of RNA hairpins are utilized by the miRNA processing apparatus. CONCLUSION: Integrated sequence-structure motifs of variable length apparently capture nearly all information required to distinguish miRNA precursors from other stem-loop structures. Public Library of Science 2012-03-15 /pmc/articles/PMC3305290/ /pubmed/22438883 http://dx.doi.org/10.1371/journal.pone.0032797 Text en Liu et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Liu, Xiuqin
He, Shunmin
Skogerbø, Geir
Gong, Fuzhou
Chen, Runsheng
Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors
title Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors
title_full Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors
title_fullStr Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors
title_full_unstemmed Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors
title_short Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors
title_sort integrated sequence-structure motifs suffice to identify microrna precursors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3305290/
https://www.ncbi.nlm.nih.gov/pubmed/22438883
http://dx.doi.org/10.1371/journal.pone.0032797
work_keys_str_mv AT liuxiuqin integratedsequencestructuremotifssufficetoidentifymicrornaprecursors
AT heshunmin integratedsequencestructuremotifssufficetoidentifymicrornaprecursors
AT skogerbøgeir integratedsequencestructuremotifssufficetoidentifymicrornaprecursors
AT gongfuzhou integratedsequencestructuremotifssufficetoidentifymicrornaprecursors
AT chenrunsheng integratedsequencestructuremotifssufficetoidentifymicrornaprecursors