Cargando…

An image processing approach to computing distances between RNA secondary structures dot plots

BACKGROUND: Computing the distance between two RNA secondary structures can contribute in understanding the functional relationship between them. When used repeatedly, such a procedure may lead to finding a query RNA structure of interest in a database of structures. Several methods are available fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Ivry, Tor, Michal, Shahar, Avihoo, Assaf, Sapiro, Guillermo, Barash, Danny
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2677394/
https://www.ncbi.nlm.nih.gov/pubmed/19203377
http://dx.doi.org/10.1186/1748-7188-4-4
_version_ 1782166782230921216
author Ivry, Tor
Michal, Shahar
Avihoo, Assaf
Sapiro, Guillermo
Barash, Danny
author_facet Ivry, Tor
Michal, Shahar
Avihoo, Assaf
Sapiro, Guillermo
Barash, Danny
author_sort Ivry, Tor
collection PubMed
description BACKGROUND: Computing the distance between two RNA secondary structures can contribute in understanding the functional relationship between them. When used repeatedly, such a procedure may lead to finding a query RNA structure of interest in a database of structures. Several methods are available for computing distances between RNAs represented as strings or graphs, but none utilize the RNA representation with dot plots. Since dot plots are essentially digital images, there is a clear motivation to devise an algorithm for computing the distance between dot plots based on image processing methods. RESULTS: We have developed a new metric dubbed 'DoPloCompare', which compares two RNA structures. The method is based on comparing dot plot diagrams that represent the secondary structures. When analyzing two diagrams and motivated by image processing, the distance is based on a combination of histogram correlations and a geometrical distance measure. We introduce, describe, and illustrate the procedure by two applications that utilize this metric on RNA sequences. The first application is the RNA design problem, where the goal is to find the nucleotide sequence for a given secondary structure. Examples where our proposed distance measure outperforms others are given. The second application locates peculiar point mutations that induce significant structural alternations relative to the wild type predicted secondary structure. The approach reported in the past to solve this problem was tested on several RNA sequences with known secondary structures to affirm their prediction, as well as on a data set of ribosomal pieces. These pieces were computationally cut from a ribosome for which an experimentally derived secondary structure is available, and on each piece the prediction conveys similarity to the experimental result. Our newly proposed distance measure shows benefit in this problem as well when compared to standard methods used for assessing the distance similarity between two RNA secondary structures. CONCLUSION: Inspired by image processing and the dot plot representation for RNA secondary structure, we have managed to provide a conceptually new and potentially beneficial metric for comparing two RNA secondary structures. We illustrated our approach on the RNA design problem, as well as on an application that utilizes the distance measure to detect conformational rearranging point mutations in an RNA sequence.
format Text
id pubmed-2677394
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26773942009-05-06 An image processing approach to computing distances between RNA secondary structures dot plots Ivry, Tor Michal, Shahar Avihoo, Assaf Sapiro, Guillermo Barash, Danny Algorithms Mol Biol Research BACKGROUND: Computing the distance between two RNA secondary structures can contribute in understanding the functional relationship between them. When used repeatedly, such a procedure may lead to finding a query RNA structure of interest in a database of structures. Several methods are available for computing distances between RNAs represented as strings or graphs, but none utilize the RNA representation with dot plots. Since dot plots are essentially digital images, there is a clear motivation to devise an algorithm for computing the distance between dot plots based on image processing methods. RESULTS: We have developed a new metric dubbed 'DoPloCompare', which compares two RNA structures. The method is based on comparing dot plot diagrams that represent the secondary structures. When analyzing two diagrams and motivated by image processing, the distance is based on a combination of histogram correlations and a geometrical distance measure. We introduce, describe, and illustrate the procedure by two applications that utilize this metric on RNA sequences. The first application is the RNA design problem, where the goal is to find the nucleotide sequence for a given secondary structure. Examples where our proposed distance measure outperforms others are given. The second application locates peculiar point mutations that induce significant structural alternations relative to the wild type predicted secondary structure. The approach reported in the past to solve this problem was tested on several RNA sequences with known secondary structures to affirm their prediction, as well as on a data set of ribosomal pieces. These pieces were computationally cut from a ribosome for which an experimentally derived secondary structure is available, and on each piece the prediction conveys similarity to the experimental result. Our newly proposed distance measure shows benefit in this problem as well when compared to standard methods used for assessing the distance similarity between two RNA secondary structures. CONCLUSION: Inspired by image processing and the dot plot representation for RNA secondary structure, we have managed to provide a conceptually new and potentially beneficial metric for comparing two RNA secondary structures. We illustrated our approach on the RNA design problem, as well as on an application that utilizes the distance measure to detect conformational rearranging point mutations in an RNA sequence. BioMed Central 2009-02-09 /pmc/articles/PMC2677394/ /pubmed/19203377 http://dx.doi.org/10.1186/1748-7188-4-4 Text en Copyright © 2009 Ivry et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Ivry, Tor
Michal, Shahar
Avihoo, Assaf
Sapiro, Guillermo
Barash, Danny
An image processing approach to computing distances between RNA secondary structures dot plots
title An image processing approach to computing distances between RNA secondary structures dot plots
title_full An image processing approach to computing distances between RNA secondary structures dot plots
title_fullStr An image processing approach to computing distances between RNA secondary structures dot plots
title_full_unstemmed An image processing approach to computing distances between RNA secondary structures dot plots
title_short An image processing approach to computing distances between RNA secondary structures dot plots
title_sort image processing approach to computing distances between rna secondary structures dot plots
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2677394/
https://www.ncbi.nlm.nih.gov/pubmed/19203377
http://dx.doi.org/10.1186/1748-7188-4-4
work_keys_str_mv AT ivrytor animageprocessingapproachtocomputingdistancesbetweenrnasecondarystructuresdotplots
AT michalshahar animageprocessingapproachtocomputingdistancesbetweenrnasecondarystructuresdotplots
AT avihooassaf animageprocessingapproachtocomputingdistancesbetweenrnasecondarystructuresdotplots
AT sapiroguillermo animageprocessingapproachtocomputingdistancesbetweenrnasecondarystructuresdotplots
AT barashdanny animageprocessingapproachtocomputingdistancesbetweenrnasecondarystructuresdotplots
AT ivrytor imageprocessingapproachtocomputingdistancesbetweenrnasecondarystructuresdotplots
AT michalshahar imageprocessingapproachtocomputingdistancesbetweenrnasecondarystructuresdotplots
AT avihooassaf imageprocessingapproachtocomputingdistancesbetweenrnasecondarystructuresdotplots
AT sapiroguillermo imageprocessingapproachtocomputingdistancesbetweenrnasecondarystructuresdotplots
AT barashdanny imageprocessingapproachtocomputingdistancesbetweenrnasecondarystructuresdotplots