Cargando…

A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling

Accurate tertiary structures are very important for the functional study of non-coding RNA molecules. However, predicting RNA tertiary structures is extremely challenging, because of a large conformation space to be explored and lack of an accurate scoring function differentiating the native structu...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zhiyong, Xu, Jinbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117333/
https://www.ncbi.nlm.nih.gov/pubmed/21685058
http://dx.doi.org/10.1093/bioinformatics/btr232
_version_ 1782206317079822336
author Wang, Zhiyong
Xu, Jinbo
author_facet Wang, Zhiyong
Xu, Jinbo
author_sort Wang, Zhiyong
collection PubMed
description Accurate tertiary structures are very important for the functional study of non-coding RNA molecules. However, predicting RNA tertiary structures is extremely challenging, because of a large conformation space to be explored and lack of an accurate scoring function differentiating the native structure from decoys. The fragment-based conformation sampling method (e.g. FARNA) bears shortcomings that the limited size of a fragment library makes it infeasible to represent all possible conformations well. A recent dynamic Bayesian network method, BARNACLE, overcomes the issue of fragment assembly. In addition, neither of these methods makes use of sequence information in sampling conformations. Here, we present a new probabilistic graphical model, conditional random fields (CRFs), to model RNA sequence–structure relationship, which enables us to accurately estimate the probability of an RNA conformation from sequence. Coupled with a novel tree-guided sampling scheme, our CRF model is then applied to RNA conformation sampling. Experimental results show that our CRF method can model RNA sequence–structure relationship well and sequence information is important for conformation sampling. Our method, named as TreeFolder, generates a much higher percentage of native-like decoys than FARNA and BARNACLE, although we use the same simple energy function as BARNACLE. Contact: zywang@ttic.edu; j3xu@ttic.edu Supplementary Information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3117333
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31173332011-06-17 A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling Wang, Zhiyong Xu, Jinbo Bioinformatics Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria Accurate tertiary structures are very important for the functional study of non-coding RNA molecules. However, predicting RNA tertiary structures is extremely challenging, because of a large conformation space to be explored and lack of an accurate scoring function differentiating the native structure from decoys. The fragment-based conformation sampling method (e.g. FARNA) bears shortcomings that the limited size of a fragment library makes it infeasible to represent all possible conformations well. A recent dynamic Bayesian network method, BARNACLE, overcomes the issue of fragment assembly. In addition, neither of these methods makes use of sequence information in sampling conformations. Here, we present a new probabilistic graphical model, conditional random fields (CRFs), to model RNA sequence–structure relationship, which enables us to accurately estimate the probability of an RNA conformation from sequence. Coupled with a novel tree-guided sampling scheme, our CRF model is then applied to RNA conformation sampling. Experimental results show that our CRF method can model RNA sequence–structure relationship well and sequence information is important for conformation sampling. Our method, named as TreeFolder, generates a much higher percentage of native-like decoys than FARNA and BARNACLE, although we use the same simple energy function as BARNACLE. Contact: zywang@ttic.edu; j3xu@ttic.edu Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-07-01 2011-06-14 /pmc/articles/PMC3117333/ /pubmed/21685058 http://dx.doi.org/10.1093/bioinformatics/btr232 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria
Wang, Zhiyong
Xu, Jinbo
A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling
title A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling
title_full A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling
title_fullStr A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling
title_full_unstemmed A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling
title_short A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling
title_sort conditional random fields method for rna sequence–structure relationship modeling and conformation sampling
topic Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117333/
https://www.ncbi.nlm.nih.gov/pubmed/21685058
http://dx.doi.org/10.1093/bioinformatics/btr232
work_keys_str_mv AT wangzhiyong aconditionalrandomfieldsmethodforrnasequencestructurerelationshipmodelingandconformationsampling
AT xujinbo aconditionalrandomfieldsmethodforrnasequencestructurerelationshipmodelingandconformationsampling
AT wangzhiyong conditionalrandomfieldsmethodforrnasequencestructurerelationshipmodelingandconformationsampling
AT xujinbo conditionalrandomfieldsmethodforrnasequencestructurerelationshipmodelingandconformationsampling