Cargando…

Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures

The increasing number and complexity of structures containing RNA chains in the Protein Data Bank (PDB) have led to the need for automated structure annotation methods to replace or complement expert visual curation. This is especially true when searching for tertiary base motifs and substructures....

Descripción completa

Detalles Bibliográficos
Autores principales: Emrizal, Reeki, Hamdani, Hazrina Yusof, Firdaus-Raih, Mohd
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8395288/
https://www.ncbi.nlm.nih.gov/pubmed/34445259
http://dx.doi.org/10.3390/ijms22168553
_version_ 1783744138589503488
author Emrizal, Reeki
Hamdani, Hazrina Yusof
Firdaus-Raih, Mohd
author_facet Emrizal, Reeki
Hamdani, Hazrina Yusof
Firdaus-Raih, Mohd
author_sort Emrizal, Reeki
collection PubMed
description The increasing number and complexity of structures containing RNA chains in the Protein Data Bank (PDB) have led to the need for automated structure annotation methods to replace or complement expert visual curation. This is especially true when searching for tertiary base motifs and substructures. Such base arrangements and motifs have diverse roles that range from contributions to structural stability to more direct involvement in the molecule’s functions, such as the sites for ligand binding and catalytic activity. We review the utility of computational approaches in annotating RNA tertiary base motifs in a dataset of PDB structures, particularly the use of graph theoretical algorithms that can search for such base motifs and annotate them or find and annotate clusters of hydrogen-bond-connected bases. We also demonstrate how such graph theoretical algorithms can be integrated into a workflow that allows for functional analysis and comparisons of base arrangements and sub-structures, such as those involved in ligand binding. The capacity to carry out such automatic curations has led to the discovery of novel motifs and can give new context to known motifs as well as enable the rapid compilation of RNA 3D motifs into a database.
format Online
Article
Text
id pubmed-8395288
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83952882021-08-28 Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures Emrizal, Reeki Hamdani, Hazrina Yusof Firdaus-Raih, Mohd Int J Mol Sci Review The increasing number and complexity of structures containing RNA chains in the Protein Data Bank (PDB) have led to the need for automated structure annotation methods to replace or complement expert visual curation. This is especially true when searching for tertiary base motifs and substructures. Such base arrangements and motifs have diverse roles that range from contributions to structural stability to more direct involvement in the molecule’s functions, such as the sites for ligand binding and catalytic activity. We review the utility of computational approaches in annotating RNA tertiary base motifs in a dataset of PDB structures, particularly the use of graph theoretical algorithms that can search for such base motifs and annotate them or find and annotate clusters of hydrogen-bond-connected bases. We also demonstrate how such graph theoretical algorithms can be integrated into a workflow that allows for functional analysis and comparisons of base arrangements and sub-structures, such as those involved in ligand binding. The capacity to carry out such automatic curations has led to the discovery of novel motifs and can give new context to known motifs as well as enable the rapid compilation of RNA 3D motifs into a database. MDPI 2021-08-09 /pmc/articles/PMC8395288/ /pubmed/34445259 http://dx.doi.org/10.3390/ijms22168553 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Emrizal, Reeki
Hamdani, Hazrina Yusof
Firdaus-Raih, Mohd
Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures
title Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures
title_full Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures
title_fullStr Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures
title_full_unstemmed Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures
title_short Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures
title_sort graph theoretical methods and workflows for searching and annotation of rna tertiary base motifs and substructures
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8395288/
https://www.ncbi.nlm.nih.gov/pubmed/34445259
http://dx.doi.org/10.3390/ijms22168553
work_keys_str_mv AT emrizalreeki graphtheoreticalmethodsandworkflowsforsearchingandannotationofrnatertiarybasemotifsandsubstructures
AT hamdanihazrinayusof graphtheoreticalmethodsandworkflowsforsearchingandannotationofrnatertiarybasemotifsandsubstructures
AT firdausraihmohd graphtheoreticalmethodsandworkflowsforsearchingandannotationofrnatertiarybasemotifsandsubstructures