Cargando…

Robust and accurate prediction of noncoding RNAs from aligned sequences

BACKGROUND: Computational prediction of noncoding RNAs (ncRNAs) is an important task in the post-genomic era. One common approach is to utilize the profile information contained in alignment data rather than single sequences. However, this strategy involves the possibility that the quality of input...

Descripción completa

Detalles Bibliográficos
Autores principales: Saito, Yutaka, Sato, Kengo, Sakakibara, Yasubumi
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2957686/
https://www.ncbi.nlm.nih.gov/pubmed/21106125
http://dx.doi.org/10.1186/1471-2105-11-S7-S3
_version_ 1782188249459982336
author Saito, Yutaka
Sato, Kengo
Sakakibara, Yasubumi
author_facet Saito, Yutaka
Sato, Kengo
Sakakibara, Yasubumi
author_sort Saito, Yutaka
collection PubMed
description BACKGROUND: Computational prediction of noncoding RNAs (ncRNAs) is an important task in the post-genomic era. One common approach is to utilize the profile information contained in alignment data rather than single sequences. However, this strategy involves the possibility that the quality of input alignments can influence the performance of prediction methods. Therefore, the evaluation of the robustness against alignment errors is necessary as well as the development of accurate prediction methods. RESULTS: We describe a new method, called Profile BPLA kernel, which predicts ncRNAs from alignment data in combination with support vector machines (SVMs). Profile BPLA kernel is an extension of base-pairing profile local alignment (BPLA) kernel which we previously developed for the prediction from single sequences. By utilizing the profile information of alignment data, the proposed kernel can achieve better accuracy than the original BPLA kernel. We show that Profile BPLA kernel outperforms the existing prediction methods which also utilize the profile information using the high-quality structural alignment dataset. In addition to these standard benchmark tests, we extensively evaluate the robustness of Profile BPLA kernel against errors in input alignments. We consider two different types of error: first, that all sequences in an alignment are actually ncRNAs but are aligned ignoring their secondary structures; second, that an alignment contains unrelated sequences which are not ncRNAs but still aligned. In both cases, the effects on the performance of Profile BPLA kernel are surprisingly small. Especially for the latter case, we demonstrate that Profile BPLA kernel is more robust compared to the existing prediction methods. CONCLUSIONS: Profile BPLA kernel provides a promising way for identifying ncRNAs from alignment data. It is more accurate than the existing prediction methods, and can keep its performance under the practical situations in which the quality of input alignments is not necessarily high.
format Text
id pubmed-2957686
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29576862010-10-21 Robust and accurate prediction of noncoding RNAs from aligned sequences Saito, Yutaka Sato, Kengo Sakakibara, Yasubumi BMC Bioinformatics Proceedings BACKGROUND: Computational prediction of noncoding RNAs (ncRNAs) is an important task in the post-genomic era. One common approach is to utilize the profile information contained in alignment data rather than single sequences. However, this strategy involves the possibility that the quality of input alignments can influence the performance of prediction methods. Therefore, the evaluation of the robustness against alignment errors is necessary as well as the development of accurate prediction methods. RESULTS: We describe a new method, called Profile BPLA kernel, which predicts ncRNAs from alignment data in combination with support vector machines (SVMs). Profile BPLA kernel is an extension of base-pairing profile local alignment (BPLA) kernel which we previously developed for the prediction from single sequences. By utilizing the profile information of alignment data, the proposed kernel can achieve better accuracy than the original BPLA kernel. We show that Profile BPLA kernel outperforms the existing prediction methods which also utilize the profile information using the high-quality structural alignment dataset. In addition to these standard benchmark tests, we extensively evaluate the robustness of Profile BPLA kernel against errors in input alignments. We consider two different types of error: first, that all sequences in an alignment are actually ncRNAs but are aligned ignoring their secondary structures; second, that an alignment contains unrelated sequences which are not ncRNAs but still aligned. In both cases, the effects on the performance of Profile BPLA kernel are surprisingly small. Especially for the latter case, we demonstrate that Profile BPLA kernel is more robust compared to the existing prediction methods. CONCLUSIONS: Profile BPLA kernel provides a promising way for identifying ncRNAs from alignment data. It is more accurate than the existing prediction methods, and can keep its performance under the practical situations in which the quality of input alignments is not necessarily high. BioMed Central 2010-10-15 /pmc/articles/PMC2957686/ /pubmed/21106125 http://dx.doi.org/10.1186/1471-2105-11-S7-S3 Text en Copyright ©2010 Saito et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Saito, Yutaka
Sato, Kengo
Sakakibara, Yasubumi
Robust and accurate prediction of noncoding RNAs from aligned sequences
title Robust and accurate prediction of noncoding RNAs from aligned sequences
title_full Robust and accurate prediction of noncoding RNAs from aligned sequences
title_fullStr Robust and accurate prediction of noncoding RNAs from aligned sequences
title_full_unstemmed Robust and accurate prediction of noncoding RNAs from aligned sequences
title_short Robust and accurate prediction of noncoding RNAs from aligned sequences
title_sort robust and accurate prediction of noncoding rnas from aligned sequences
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2957686/
https://www.ncbi.nlm.nih.gov/pubmed/21106125
http://dx.doi.org/10.1186/1471-2105-11-S7-S3
work_keys_str_mv AT saitoyutaka robustandaccuratepredictionofnoncodingrnasfromalignedsequences
AT satokengo robustandaccuratepredictionofnoncodingrnasfromalignedsequences
AT sakakibarayasubumi robustandaccuratepredictionofnoncodingrnasfromalignedsequences