Cargando…

ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search

BACKGROUND: Recent advances in RNA structure probing technologies, including the ones based on high-throughput sequencing, have improved the accuracy of thermodynamic folding with quantitative nucleotide-resolution structural information. RESULTS: In this paper, we present a novel approach, ProbeAli...

Descripción completa

Detalles Bibliográficos
Autores principales: Ge, Ping, Zhong, Cuncong, Zhang, Shaojie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168714/
https://www.ncbi.nlm.nih.gov/pubmed/25253206
http://dx.doi.org/10.1186/1471-2105-15-S9-S15
_version_ 1782335604848066560
author Ge, Ping
Zhong, Cuncong
Zhang, Shaojie
author_facet Ge, Ping
Zhong, Cuncong
Zhang, Shaojie
author_sort Ge, Ping
collection PubMed
description BACKGROUND: Recent advances in RNA structure probing technologies, including the ones based on high-throughput sequencing, have improved the accuracy of thermodynamic folding with quantitative nucleotide-resolution structural information. RESULTS: In this paper, we present a novel approach, ProbeAlign, to incorporate the reactivities from high-throughput RNA structure probing into ncRNA homology search for functional annotation. To reduce the overhead of structure alignment on large-scale data, the specific pairing patterns in the query sequences are ignored. On the other hand, the partial structural information of the target sequences embedded in probing data is retrieved to guide the alignment. Thus the structure alignment problem is transformed into a sequence alignment problem with additional reactivity information. The benchmark results show that the prediction accuracy of ProbeAlign outperforms filter-based CMsearch with high computational efficiency. The application of ProbeAlign to the FragSeq data, which is based on genome-wide structure probing, has demonstrated its capability to search ncRNAs in a large-scale dataset from high-throughput sequencing. CONCLUSIONS: By incorporating high-throughput sequencing-based structure probing information, ProbeAlign can improve the accuracy and efficiency of ncRNA homology search. It is a promising tool for ncRNA functional annotation on genome-wide datasets. AVAILABILITY: The source code of ProbeAlign is available at http://genome.ucf.edu/ProbeAlign.
format Online
Article
Text
id pubmed-4168714
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41687142014-10-02 ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search Ge, Ping Zhong, Cuncong Zhang, Shaojie BMC Bioinformatics Proceedings BACKGROUND: Recent advances in RNA structure probing technologies, including the ones based on high-throughput sequencing, have improved the accuracy of thermodynamic folding with quantitative nucleotide-resolution structural information. RESULTS: In this paper, we present a novel approach, ProbeAlign, to incorporate the reactivities from high-throughput RNA structure probing into ncRNA homology search for functional annotation. To reduce the overhead of structure alignment on large-scale data, the specific pairing patterns in the query sequences are ignored. On the other hand, the partial structural information of the target sequences embedded in probing data is retrieved to guide the alignment. Thus the structure alignment problem is transformed into a sequence alignment problem with additional reactivity information. The benchmark results show that the prediction accuracy of ProbeAlign outperforms filter-based CMsearch with high computational efficiency. The application of ProbeAlign to the FragSeq data, which is based on genome-wide structure probing, has demonstrated its capability to search ncRNAs in a large-scale dataset from high-throughput sequencing. CONCLUSIONS: By incorporating high-throughput sequencing-based structure probing information, ProbeAlign can improve the accuracy and efficiency of ncRNA homology search. It is a promising tool for ncRNA functional annotation on genome-wide datasets. AVAILABILITY: The source code of ProbeAlign is available at http://genome.ucf.edu/ProbeAlign. BioMed Central 2014-09-10 /pmc/articles/PMC4168714/ /pubmed/25253206 http://dx.doi.org/10.1186/1471-2105-15-S9-S15 Text en Copyright © 2014 Ge et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Ge, Ping
Zhong, Cuncong
Zhang, Shaojie
ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search
title ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search
title_full ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search
title_fullStr ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search
title_full_unstemmed ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search
title_short ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search
title_sort probealign: incorporating high-throughput sequencing-based structure probing information into ncrna homology search
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168714/
https://www.ncbi.nlm.nih.gov/pubmed/25253206
http://dx.doi.org/10.1186/1471-2105-15-S9-S15
work_keys_str_mv AT geping probealignincorporatinghighthroughputsequencingbasedstructureprobinginformationintoncrnahomologysearch
AT zhongcuncong probealignincorporatinghighthroughputsequencingbasedstructureprobinginformationintoncrnahomologysearch
AT zhangshaojie probealignincorporatinghighthroughputsequencingbasedstructureprobinginformationintoncrnahomologysearch