Cargando…

Finding sRNA generative locales from high-throughput sequencing data with NiBLS

BACKGROUND: Next-generation sequencing technologies allow researchers to obtain millions of sequence reads in a single experiment. One important use of the technology is the sequencing of small non-coding regulatory RNAs and the identification of the genomic locales from which they originate. Curren...

Descripción completa

Detalles Bibliográficos
Autores principales: MacLean, Daniel, Moulton, Vincent, Studholme, David J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2837031/
https://www.ncbi.nlm.nih.gov/pubmed/20167070
http://dx.doi.org/10.1186/1471-2105-11-93
_version_ 1782178763990106112
author MacLean, Daniel
Moulton, Vincent
Studholme, David J
author_facet MacLean, Daniel
Moulton, Vincent
Studholme, David J
author_sort MacLean, Daniel
collection PubMed
description BACKGROUND: Next-generation sequencing technologies allow researchers to obtain millions of sequence reads in a single experiment. One important use of the technology is the sequencing of small non-coding regulatory RNAs and the identification of the genomic locales from which they originate. Currently, there is a paucity of methods for finding small RNA generative locales. RESULTS: We describe and implement an algorithm that can determine small RNA generative locales from high-throughput sequencing data. The algorithm creates a network, or graph, of the small RNAs by creating links between them depending on their proximity on the target genome. For each of the sub-networks in the resulting graph the clustering coefficient, a measure of the interconnectedness of the subnetwork, is used to identify the generative locales. We test the algorithm over a wide range of parameters using RFAM sequences as positive controls and demonstrate that the algorithm has good sensitivity and specificity in a range of Arabidopsis and mouse small RNA sequence sets and that the locales it generates are robust to differences in the choice of parameters. CONCLUSIONS: NiBLS is a fast, reliable and sensitive method for determining small RNA locales in high-throughput sequence data that is generally applicable to all classes of small RNA.
format Text
id pubmed-2837031
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28370312010-03-12 Finding sRNA generative locales from high-throughput sequencing data with NiBLS MacLean, Daniel Moulton, Vincent Studholme, David J BMC Bioinformatics Methodology article BACKGROUND: Next-generation sequencing technologies allow researchers to obtain millions of sequence reads in a single experiment. One important use of the technology is the sequencing of small non-coding regulatory RNAs and the identification of the genomic locales from which they originate. Currently, there is a paucity of methods for finding small RNA generative locales. RESULTS: We describe and implement an algorithm that can determine small RNA generative locales from high-throughput sequencing data. The algorithm creates a network, or graph, of the small RNAs by creating links between them depending on their proximity on the target genome. For each of the sub-networks in the resulting graph the clustering coefficient, a measure of the interconnectedness of the subnetwork, is used to identify the generative locales. We test the algorithm over a wide range of parameters using RFAM sequences as positive controls and demonstrate that the algorithm has good sensitivity and specificity in a range of Arabidopsis and mouse small RNA sequence sets and that the locales it generates are robust to differences in the choice of parameters. CONCLUSIONS: NiBLS is a fast, reliable and sensitive method for determining small RNA locales in high-throughput sequence data that is generally applicable to all classes of small RNA. BioMed Central 2010-02-18 /pmc/articles/PMC2837031/ /pubmed/20167070 http://dx.doi.org/10.1186/1471-2105-11-93 Text en Copyright ©2010 MacLean et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
MacLean, Daniel
Moulton, Vincent
Studholme, David J
Finding sRNA generative locales from high-throughput sequencing data with NiBLS
title Finding sRNA generative locales from high-throughput sequencing data with NiBLS
title_full Finding sRNA generative locales from high-throughput sequencing data with NiBLS
title_fullStr Finding sRNA generative locales from high-throughput sequencing data with NiBLS
title_full_unstemmed Finding sRNA generative locales from high-throughput sequencing data with NiBLS
title_short Finding sRNA generative locales from high-throughput sequencing data with NiBLS
title_sort finding srna generative locales from high-throughput sequencing data with nibls
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2837031/
https://www.ncbi.nlm.nih.gov/pubmed/20167070
http://dx.doi.org/10.1186/1471-2105-11-93
work_keys_str_mv AT macleandaniel findingsrnagenerativelocalesfromhighthroughputsequencingdatawithnibls
AT moultonvincent findingsrnagenerativelocalesfromhighthroughputsequencingdatawithnibls
AT studholmedavidj findingsrnagenerativelocalesfromhighthroughputsequencingdatawithnibls