Cargando…
Finding sRNA generative locales from high-throughput sequencing data with NiBLS
BACKGROUND: Next-generation sequencing technologies allow researchers to obtain millions of sequence reads in a single experiment. One important use of the technology is the sequencing of small non-coding regulatory RNAs and the identification of the genomic locales from which they originate. Curren...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2837031/ https://www.ncbi.nlm.nih.gov/pubmed/20167070 http://dx.doi.org/10.1186/1471-2105-11-93 |
_version_ | 1782178763990106112 |
---|---|
author | MacLean, Daniel Moulton, Vincent Studholme, David J |
author_facet | MacLean, Daniel Moulton, Vincent Studholme, David J |
author_sort | MacLean, Daniel |
collection | PubMed |
description | BACKGROUND: Next-generation sequencing technologies allow researchers to obtain millions of sequence reads in a single experiment. One important use of the technology is the sequencing of small non-coding regulatory RNAs and the identification of the genomic locales from which they originate. Currently, there is a paucity of methods for finding small RNA generative locales. RESULTS: We describe and implement an algorithm that can determine small RNA generative locales from high-throughput sequencing data. The algorithm creates a network, or graph, of the small RNAs by creating links between them depending on their proximity on the target genome. For each of the sub-networks in the resulting graph the clustering coefficient, a measure of the interconnectedness of the subnetwork, is used to identify the generative locales. We test the algorithm over a wide range of parameters using RFAM sequences as positive controls and demonstrate that the algorithm has good sensitivity and specificity in a range of Arabidopsis and mouse small RNA sequence sets and that the locales it generates are robust to differences in the choice of parameters. CONCLUSIONS: NiBLS is a fast, reliable and sensitive method for determining small RNA locales in high-throughput sequence data that is generally applicable to all classes of small RNA. |
format | Text |
id | pubmed-2837031 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28370312010-03-12 Finding sRNA generative locales from high-throughput sequencing data with NiBLS MacLean, Daniel Moulton, Vincent Studholme, David J BMC Bioinformatics Methodology article BACKGROUND: Next-generation sequencing technologies allow researchers to obtain millions of sequence reads in a single experiment. One important use of the technology is the sequencing of small non-coding regulatory RNAs and the identification of the genomic locales from which they originate. Currently, there is a paucity of methods for finding small RNA generative locales. RESULTS: We describe and implement an algorithm that can determine small RNA generative locales from high-throughput sequencing data. The algorithm creates a network, or graph, of the small RNAs by creating links between them depending on their proximity on the target genome. For each of the sub-networks in the resulting graph the clustering coefficient, a measure of the interconnectedness of the subnetwork, is used to identify the generative locales. We test the algorithm over a wide range of parameters using RFAM sequences as positive controls and demonstrate that the algorithm has good sensitivity and specificity in a range of Arabidopsis and mouse small RNA sequence sets and that the locales it generates are robust to differences in the choice of parameters. CONCLUSIONS: NiBLS is a fast, reliable and sensitive method for determining small RNA locales in high-throughput sequence data that is generally applicable to all classes of small RNA. BioMed Central 2010-02-18 /pmc/articles/PMC2837031/ /pubmed/20167070 http://dx.doi.org/10.1186/1471-2105-11-93 Text en Copyright ©2010 MacLean et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology article MacLean, Daniel Moulton, Vincent Studholme, David J Finding sRNA generative locales from high-throughput sequencing data with NiBLS |
title | Finding sRNA generative locales from high-throughput sequencing data with NiBLS |
title_full | Finding sRNA generative locales from high-throughput sequencing data with NiBLS |
title_fullStr | Finding sRNA generative locales from high-throughput sequencing data with NiBLS |
title_full_unstemmed | Finding sRNA generative locales from high-throughput sequencing data with NiBLS |
title_short | Finding sRNA generative locales from high-throughput sequencing data with NiBLS |
title_sort | finding srna generative locales from high-throughput sequencing data with nibls |
topic | Methodology article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2837031/ https://www.ncbi.nlm.nih.gov/pubmed/20167070 http://dx.doi.org/10.1186/1471-2105-11-93 |
work_keys_str_mv | AT macleandaniel findingsrnagenerativelocalesfromhighthroughputsequencingdatawithnibls AT moultonvincent findingsrnagenerativelocalesfromhighthroughputsequencingdatawithnibls AT studholmedavidj findingsrnagenerativelocalesfromhighthroughputsequencingdatawithnibls |