Cargando…
De novo discovery of structural motifs in RNA 3D structures through clustering
As functional components in three-dimensional (3D) conformation of an RNA, the RNA structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms. In the past years, many computational tools have been developed to search motif instances by using the e...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5961109/ https://www.ncbi.nlm.nih.gov/pubmed/29534235 http://dx.doi.org/10.1093/nar/gky139 |
_version_ | 1783324681945743360 |
---|---|
author | Ge, Ping Islam, Shahidul Zhong, Cuncong Zhang, Shaojie |
author_facet | Ge, Ping Islam, Shahidul Zhong, Cuncong Zhang, Shaojie |
author_sort | Ge, Ping |
collection | PubMed |
description | As functional components in three-dimensional (3D) conformation of an RNA, the RNA structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms. In the past years, many computational tools have been developed to search motif instances by using the existing knowledge of well-studied families. Recently, with the rapidly increasing number of resolved RNA 3D structures, there is an urgent need to discover novel motifs with the newly presented information. In this work, we classify all the loops in non-redundant RNA 3D structures to detect plausible RNA structural motif families by using a clustering pipeline. Compared with other clustering approaches, our method has two benefits: first, the underlying alignment algorithm is tolerant to the variations in 3D structures. Second, sophisticated downstream analysis has been performed to ensure the clusters are valid and easily applied to further research. The final clustering results contain many interesting new variants of known motif families, such as GNAA tetraloop, kink-turn, sarcin-ricin and T-loop. We have also discovered potential novel functional motifs conserved in ribosomal RNA, sgRNA, SRP RNA, riboswitch and ribozyme. |
format | Online Article Text |
id | pubmed-5961109 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-59611092018-06-06 De novo discovery of structural motifs in RNA 3D structures through clustering Ge, Ping Islam, Shahidul Zhong, Cuncong Zhang, Shaojie Nucleic Acids Res RNA and RNA-protein complexes As functional components in three-dimensional (3D) conformation of an RNA, the RNA structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms. In the past years, many computational tools have been developed to search motif instances by using the existing knowledge of well-studied families. Recently, with the rapidly increasing number of resolved RNA 3D structures, there is an urgent need to discover novel motifs with the newly presented information. In this work, we classify all the loops in non-redundant RNA 3D structures to detect plausible RNA structural motif families by using a clustering pipeline. Compared with other clustering approaches, our method has two benefits: first, the underlying alignment algorithm is tolerant to the variations in 3D structures. Second, sophisticated downstream analysis has been performed to ensure the clusters are valid and easily applied to further research. The final clustering results contain many interesting new variants of known motif families, such as GNAA tetraloop, kink-turn, sarcin-ricin and T-loop. We have also discovered potential novel functional motifs conserved in ribosomal RNA, sgRNA, SRP RNA, riboswitch and ribozyme. Oxford University Press 2018-05-18 2018-03-09 /pmc/articles/PMC5961109/ /pubmed/29534235 http://dx.doi.org/10.1093/nar/gky139 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | RNA and RNA-protein complexes Ge, Ping Islam, Shahidul Zhong, Cuncong Zhang, Shaojie De novo discovery of structural motifs in RNA 3D structures through clustering |
title |
De novo discovery of structural motifs in RNA 3D structures through clustering |
title_full |
De novo discovery of structural motifs in RNA 3D structures through clustering |
title_fullStr |
De novo discovery of structural motifs in RNA 3D structures through clustering |
title_full_unstemmed |
De novo discovery of structural motifs in RNA 3D structures through clustering |
title_short |
De novo discovery of structural motifs in RNA 3D structures through clustering |
title_sort | de novo discovery of structural motifs in rna 3d structures through clustering |
topic | RNA and RNA-protein complexes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5961109/ https://www.ncbi.nlm.nih.gov/pubmed/29534235 http://dx.doi.org/10.1093/nar/gky139 |
work_keys_str_mv | AT geping denovodiscoveryofstructuralmotifsinrna3dstructuresthroughclustering AT islamshahidul denovodiscoveryofstructuralmotifsinrna3dstructuresthroughclustering AT zhongcuncong denovodiscoveryofstructuralmotifsinrna3dstructuresthroughclustering AT zhangshaojie denovodiscoveryofstructuralmotifsinrna3dstructuresthroughclustering |