Cargando…

Classification and assessment tools for structural motif discovery algorithms

BACKGROUND: Motif discovery is the problem of finding recurring patterns in biological data. Patterns can be sequential, mainly when discovered in DNA sequences. They can also be structural (e.g. when discovering RNA motifs). Finding common structural patterns helps to gain a better understanding of...

Descripción completa

Detalles Bibliográficos
Autores principales: Badr, Ghada, Al-Turaiki, Isra, Mathkour, Hassan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3698030/
https://www.ncbi.nlm.nih.gov/pubmed/23902564
http://dx.doi.org/10.1186/1471-2105-14-S9-S4
_version_ 1782275227656388608
author Badr, Ghada
Al-Turaiki, Isra
Mathkour, Hassan
author_facet Badr, Ghada
Al-Turaiki, Isra
Mathkour, Hassan
author_sort Badr, Ghada
collection PubMed
description BACKGROUND: Motif discovery is the problem of finding recurring patterns in biological data. Patterns can be sequential, mainly when discovered in DNA sequences. They can also be structural (e.g. when discovering RNA motifs). Finding common structural patterns helps to gain a better understanding of the mechanism of action (e.g. post-transcriptional regulation). Unlike DNA motifs, which are sequentially conserved, RNA motifs exhibit conservation in structure, which may be common even if the sequences are different. Over the past few years, hundreds of algorithms have been developed to solve the sequential motif discovery problem, while less work has been done for the structural case. METHODS: In this paper, we survey, classify, and compare different algorithms that solve the structural motif discovery problem, where the underlying sequences may be different. We highlight their strengths and weaknesses. We start by proposing a benchmark dataset and a measurement tool that can be used to evaluate different motif discovery approaches. Then, we proceed by proposing our experimental setup. Finally, results are obtained using the proposed benchmark to compare available tools. To the best of our knowledge, this is the first attempt to compare tools solely designed for structural motif discovery. RESULTS: Results show that the accuracy of discovered motifs is relatively low. The results also suggest a complementary behavior among tools where some tools perform well on simple structures, while other tools are better for complex structures. CONCLUSIONS: We have classified and evaluated the performance of available structural motif discovery tools. In addition, we have proposed a benchmark dataset with tools that can be used to evaluate newly developed tools.
format Online
Article
Text
id pubmed-3698030
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36980302013-07-02 Classification and assessment tools for structural motif discovery algorithms Badr, Ghada Al-Turaiki, Isra Mathkour, Hassan BMC Bioinformatics Methodology Article BACKGROUND: Motif discovery is the problem of finding recurring patterns in biological data. Patterns can be sequential, mainly when discovered in DNA sequences. They can also be structural (e.g. when discovering RNA motifs). Finding common structural patterns helps to gain a better understanding of the mechanism of action (e.g. post-transcriptional regulation). Unlike DNA motifs, which are sequentially conserved, RNA motifs exhibit conservation in structure, which may be common even if the sequences are different. Over the past few years, hundreds of algorithms have been developed to solve the sequential motif discovery problem, while less work has been done for the structural case. METHODS: In this paper, we survey, classify, and compare different algorithms that solve the structural motif discovery problem, where the underlying sequences may be different. We highlight their strengths and weaknesses. We start by proposing a benchmark dataset and a measurement tool that can be used to evaluate different motif discovery approaches. Then, we proceed by proposing our experimental setup. Finally, results are obtained using the proposed benchmark to compare available tools. To the best of our knowledge, this is the first attempt to compare tools solely designed for structural motif discovery. RESULTS: Results show that the accuracy of discovered motifs is relatively low. The results also suggest a complementary behavior among tools where some tools perform well on simple structures, while other tools are better for complex structures. CONCLUSIONS: We have classified and evaluated the performance of available structural motif discovery tools. In addition, we have proposed a benchmark dataset with tools that can be used to evaluate newly developed tools. BioMed Central 2013-06-28 /pmc/articles/PMC3698030/ /pubmed/23902564 http://dx.doi.org/10.1186/1471-2105-14-S9-S4 Text en Copyright © 2013 Badr et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Badr, Ghada
Al-Turaiki, Isra
Mathkour, Hassan
Classification and assessment tools for structural motif discovery algorithms
title Classification and assessment tools for structural motif discovery algorithms
title_full Classification and assessment tools for structural motif discovery algorithms
title_fullStr Classification and assessment tools for structural motif discovery algorithms
title_full_unstemmed Classification and assessment tools for structural motif discovery algorithms
title_short Classification and assessment tools for structural motif discovery algorithms
title_sort classification and assessment tools for structural motif discovery algorithms
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3698030/
https://www.ncbi.nlm.nih.gov/pubmed/23902564
http://dx.doi.org/10.1186/1471-2105-14-S9-S4
work_keys_str_mv AT badrghada classificationandassessmenttoolsforstructuralmotifdiscoveryalgorithms
AT alturaikiisra classificationandassessmenttoolsforstructuralmotifdiscoveryalgorithms
AT mathkourhassan classificationandassessmenttoolsforstructuralmotifdiscoveryalgorithms