Cargando…
A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling
Accurate tertiary structures are very important for the functional study of non-coding RNA molecules. However, predicting RNA tertiary structures is extremely challenging, because of a large conformation space to be explored and lack of an accurate scoring function differentiating the native structu...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117333/ https://www.ncbi.nlm.nih.gov/pubmed/21685058 http://dx.doi.org/10.1093/bioinformatics/btr232 |
_version_ | 1782206317079822336 |
---|---|
author | Wang, Zhiyong Xu, Jinbo |
author_facet | Wang, Zhiyong Xu, Jinbo |
author_sort | Wang, Zhiyong |
collection | PubMed |
description | Accurate tertiary structures are very important for the functional study of non-coding RNA molecules. However, predicting RNA tertiary structures is extremely challenging, because of a large conformation space to be explored and lack of an accurate scoring function differentiating the native structure from decoys. The fragment-based conformation sampling method (e.g. FARNA) bears shortcomings that the limited size of a fragment library makes it infeasible to represent all possible conformations well. A recent dynamic Bayesian network method, BARNACLE, overcomes the issue of fragment assembly. In addition, neither of these methods makes use of sequence information in sampling conformations. Here, we present a new probabilistic graphical model, conditional random fields (CRFs), to model RNA sequence–structure relationship, which enables us to accurately estimate the probability of an RNA conformation from sequence. Coupled with a novel tree-guided sampling scheme, our CRF model is then applied to RNA conformation sampling. Experimental results show that our CRF method can model RNA sequence–structure relationship well and sequence information is important for conformation sampling. Our method, named as TreeFolder, generates a much higher percentage of native-like decoys than FARNA and BARNACLE, although we use the same simple energy function as BARNACLE. Contact: zywang@ttic.edu; j3xu@ttic.edu Supplementary Information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3117333 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-31173332011-06-17 A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling Wang, Zhiyong Xu, Jinbo Bioinformatics Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria Accurate tertiary structures are very important for the functional study of non-coding RNA molecules. However, predicting RNA tertiary structures is extremely challenging, because of a large conformation space to be explored and lack of an accurate scoring function differentiating the native structure from decoys. The fragment-based conformation sampling method (e.g. FARNA) bears shortcomings that the limited size of a fragment library makes it infeasible to represent all possible conformations well. A recent dynamic Bayesian network method, BARNACLE, overcomes the issue of fragment assembly. In addition, neither of these methods makes use of sequence information in sampling conformations. Here, we present a new probabilistic graphical model, conditional random fields (CRFs), to model RNA sequence–structure relationship, which enables us to accurately estimate the probability of an RNA conformation from sequence. Coupled with a novel tree-guided sampling scheme, our CRF model is then applied to RNA conformation sampling. Experimental results show that our CRF method can model RNA sequence–structure relationship well and sequence information is important for conformation sampling. Our method, named as TreeFolder, generates a much higher percentage of native-like decoys than FARNA and BARNACLE, although we use the same simple energy function as BARNACLE. Contact: zywang@ttic.edu; j3xu@ttic.edu Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-07-01 2011-06-14 /pmc/articles/PMC3117333/ /pubmed/21685058 http://dx.doi.org/10.1093/bioinformatics/btr232 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria Wang, Zhiyong Xu, Jinbo A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling |
title | A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling |
title_full | A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling |
title_fullStr | A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling |
title_full_unstemmed | A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling |
title_short | A conditional random fields method for RNA sequence–structure relationship modeling and conformation sampling |
title_sort | conditional random fields method for rna sequence–structure relationship modeling and conformation sampling |
topic | Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117333/ https://www.ncbi.nlm.nih.gov/pubmed/21685058 http://dx.doi.org/10.1093/bioinformatics/btr232 |
work_keys_str_mv | AT wangzhiyong aconditionalrandomfieldsmethodforrnasequencestructurerelationshipmodelingandconformationsampling AT xujinbo aconditionalrandomfieldsmethodforrnasequencestructurerelationshipmodelingandconformationsampling AT wangzhiyong conditionalrandomfieldsmethodforrnasequencestructurerelationshipmodelingandconformationsampling AT xujinbo conditionalrandomfieldsmethodforrnasequencestructurerelationshipmodelingandconformationsampling |