Cargando…
ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search
BACKGROUND: Recent advances in RNA structure probing technologies, including the ones based on high-throughput sequencing, have improved the accuracy of thermodynamic folding with quantitative nucleotide-resolution structural information. RESULTS: In this paper, we present a novel approach, ProbeAli...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168714/ https://www.ncbi.nlm.nih.gov/pubmed/25253206 http://dx.doi.org/10.1186/1471-2105-15-S9-S15 |
_version_ | 1782335604848066560 |
---|---|
author | Ge, Ping Zhong, Cuncong Zhang, Shaojie |
author_facet | Ge, Ping Zhong, Cuncong Zhang, Shaojie |
author_sort | Ge, Ping |
collection | PubMed |
description | BACKGROUND: Recent advances in RNA structure probing technologies, including the ones based on high-throughput sequencing, have improved the accuracy of thermodynamic folding with quantitative nucleotide-resolution structural information. RESULTS: In this paper, we present a novel approach, ProbeAlign, to incorporate the reactivities from high-throughput RNA structure probing into ncRNA homology search for functional annotation. To reduce the overhead of structure alignment on large-scale data, the specific pairing patterns in the query sequences are ignored. On the other hand, the partial structural information of the target sequences embedded in probing data is retrieved to guide the alignment. Thus the structure alignment problem is transformed into a sequence alignment problem with additional reactivity information. The benchmark results show that the prediction accuracy of ProbeAlign outperforms filter-based CMsearch with high computational efficiency. The application of ProbeAlign to the FragSeq data, which is based on genome-wide structure probing, has demonstrated its capability to search ncRNAs in a large-scale dataset from high-throughput sequencing. CONCLUSIONS: By incorporating high-throughput sequencing-based structure probing information, ProbeAlign can improve the accuracy and efficiency of ncRNA homology search. It is a promising tool for ncRNA functional annotation on genome-wide datasets. AVAILABILITY: The source code of ProbeAlign is available at http://genome.ucf.edu/ProbeAlign. |
format | Online Article Text |
id | pubmed-4168714 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41687142014-10-02 ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search Ge, Ping Zhong, Cuncong Zhang, Shaojie BMC Bioinformatics Proceedings BACKGROUND: Recent advances in RNA structure probing technologies, including the ones based on high-throughput sequencing, have improved the accuracy of thermodynamic folding with quantitative nucleotide-resolution structural information. RESULTS: In this paper, we present a novel approach, ProbeAlign, to incorporate the reactivities from high-throughput RNA structure probing into ncRNA homology search for functional annotation. To reduce the overhead of structure alignment on large-scale data, the specific pairing patterns in the query sequences are ignored. On the other hand, the partial structural information of the target sequences embedded in probing data is retrieved to guide the alignment. Thus the structure alignment problem is transformed into a sequence alignment problem with additional reactivity information. The benchmark results show that the prediction accuracy of ProbeAlign outperforms filter-based CMsearch with high computational efficiency. The application of ProbeAlign to the FragSeq data, which is based on genome-wide structure probing, has demonstrated its capability to search ncRNAs in a large-scale dataset from high-throughput sequencing. CONCLUSIONS: By incorporating high-throughput sequencing-based structure probing information, ProbeAlign can improve the accuracy and efficiency of ncRNA homology search. It is a promising tool for ncRNA functional annotation on genome-wide datasets. AVAILABILITY: The source code of ProbeAlign is available at http://genome.ucf.edu/ProbeAlign. BioMed Central 2014-09-10 /pmc/articles/PMC4168714/ /pubmed/25253206 http://dx.doi.org/10.1186/1471-2105-15-S9-S15 Text en Copyright © 2014 Ge et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Ge, Ping Zhong, Cuncong Zhang, Shaojie ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search |
title | ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search |
title_full | ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search |
title_fullStr | ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search |
title_full_unstemmed | ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search |
title_short | ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search |
title_sort | probealign: incorporating high-throughput sequencing-based structure probing information into ncrna homology search |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4168714/ https://www.ncbi.nlm.nih.gov/pubmed/25253206 http://dx.doi.org/10.1186/1471-2105-15-S9-S15 |
work_keys_str_mv | AT geping probealignincorporatinghighthroughputsequencingbasedstructureprobinginformationintoncrnahomologysearch AT zhongcuncong probealignincorporatinghighthroughputsequencingbasedstructureprobinginformationintoncrnahomologysearch AT zhangshaojie probealignincorporatinghighthroughputsequencingbasedstructureprobinginformationintoncrnahomologysearch |