Cargando…

Directed acyclic graph kernels for structural RNA analysis

BACKGROUND: Recent discoveries of a large variety of important roles for non-coding RNAs (ncRNAs) have been reported by numerous researchers. In order to analyze ncRNAs by kernel methods including support vector machines, we propose stem kernels as an extension of string kernels for measuring the si...

Descripción completa

Detalles Bibliográficos
Autores principales: Sato, Kengo, Mituyama, Toutai, Asai, Kiyoshi, Sakakibara, Yasubumi
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2515856/
https://www.ncbi.nlm.nih.gov/pubmed/18647390
http://dx.doi.org/10.1186/1471-2105-9-318
_version_ 1782158445399506944
author Sato, Kengo
Mituyama, Toutai
Asai, Kiyoshi
Sakakibara, Yasubumi
author_facet Sato, Kengo
Mituyama, Toutai
Asai, Kiyoshi
Sakakibara, Yasubumi
author_sort Sato, Kengo
collection PubMed
description BACKGROUND: Recent discoveries of a large variety of important roles for non-coding RNAs (ncRNAs) have been reported by numerous researchers. In order to analyze ncRNAs by kernel methods including support vector machines, we propose stem kernels as an extension of string kernels for measuring the similarities between two RNA sequences from the viewpoint of secondary structures. However, applying stem kernels directly to large data sets of ncRNAs is impractical due to their computational complexity. RESULTS: We have developed a new technique based on directed acyclic graphs (DAGs) derived from base-pairing probability matrices of RNA sequences that significantly increases the computation speed of stem kernels. Furthermore, we propose profile-profile stem kernels for multiple alignments of RNA sequences which utilize base-pairing probability matrices for multiple alignments instead of those for individual sequences. Our kernels outperformed the existing methods with respect to the detection of known ncRNAs and kernel hierarchical clustering. CONCLUSION: Stem kernels can be utilized as a reliable similarity measure of structural RNAs, and can be used in various kernel-based applications.
format Text
id pubmed-2515856
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25158562008-08-14 Directed acyclic graph kernels for structural RNA analysis Sato, Kengo Mituyama, Toutai Asai, Kiyoshi Sakakibara, Yasubumi BMC Bioinformatics Methodology Article BACKGROUND: Recent discoveries of a large variety of important roles for non-coding RNAs (ncRNAs) have been reported by numerous researchers. In order to analyze ncRNAs by kernel methods including support vector machines, we propose stem kernels as an extension of string kernels for measuring the similarities between two RNA sequences from the viewpoint of secondary structures. However, applying stem kernels directly to large data sets of ncRNAs is impractical due to their computational complexity. RESULTS: We have developed a new technique based on directed acyclic graphs (DAGs) derived from base-pairing probability matrices of RNA sequences that significantly increases the computation speed of stem kernels. Furthermore, we propose profile-profile stem kernels for multiple alignments of RNA sequences which utilize base-pairing probability matrices for multiple alignments instead of those for individual sequences. Our kernels outperformed the existing methods with respect to the detection of known ncRNAs and kernel hierarchical clustering. CONCLUSION: Stem kernels can be utilized as a reliable similarity measure of structural RNAs, and can be used in various kernel-based applications. BioMed Central 2008-07-22 /pmc/articles/PMC2515856/ /pubmed/18647390 http://dx.doi.org/10.1186/1471-2105-9-318 Text en Copyright © 2008 Sato et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Sato, Kengo
Mituyama, Toutai
Asai, Kiyoshi
Sakakibara, Yasubumi
Directed acyclic graph kernels for structural RNA analysis
title Directed acyclic graph kernels for structural RNA analysis
title_full Directed acyclic graph kernels for structural RNA analysis
title_fullStr Directed acyclic graph kernels for structural RNA analysis
title_full_unstemmed Directed acyclic graph kernels for structural RNA analysis
title_short Directed acyclic graph kernels for structural RNA analysis
title_sort directed acyclic graph kernels for structural rna analysis
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2515856/
https://www.ncbi.nlm.nih.gov/pubmed/18647390
http://dx.doi.org/10.1186/1471-2105-9-318
work_keys_str_mv AT satokengo directedacyclicgraphkernelsforstructuralrnaanalysis
AT mituyamatoutai directedacyclicgraphkernelsforstructuralrnaanalysis
AT asaikiyoshi directedacyclicgraphkernelsforstructuralrnaanalysis
AT sakakibarayasubumi directedacyclicgraphkernelsforstructuralrnaanalysis