Cargando…

Comprehensive processing of high-throughput small RNA sequencing data including quality checking, normalization, and differential expression analysis using the UEA sRNA Workbench

Recently, high-throughput sequencing (HTS) has revealed compelling details about the small RNA (sRNA) population in eukaryotes. These 20 to 25 nt noncoding RNAs can influence gene expression by acting as guides for the sequence-specific regulatory mechanism known as RNA silencing. The increase in se...

Descripción completa

Detalles Bibliográficos
Autores principales: Beckers, Matthew, Mohorianu, Irina, Stocks, Matthew, Applegate, Christopher, Dalmay, Tamas, Moulton, Vincent
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5435855/
https://www.ncbi.nlm.nih.gov/pubmed/28289155
http://dx.doi.org/10.1261/rna.059360.116
_version_ 1783237291326570496
author Beckers, Matthew
Mohorianu, Irina
Stocks, Matthew
Applegate, Christopher
Dalmay, Tamas
Moulton, Vincent
author_facet Beckers, Matthew
Mohorianu, Irina
Stocks, Matthew
Applegate, Christopher
Dalmay, Tamas
Moulton, Vincent
author_sort Beckers, Matthew
collection PubMed
description Recently, high-throughput sequencing (HTS) has revealed compelling details about the small RNA (sRNA) population in eukaryotes. These 20 to 25 nt noncoding RNAs can influence gene expression by acting as guides for the sequence-specific regulatory mechanism known as RNA silencing. The increase in sequencing depth and number of samples per project enables a better understanding of the role sRNAs play by facilitating the study of expression patterns. However, the intricacy of the biological hypotheses coupled with a lack of appropriate tools often leads to inadequate mining of the available data and thus, an incomplete description of the biological mechanisms involved. To enable a comprehensive study of differential expression in sRNA data sets, we present a new interactive pipeline that guides researchers through the various stages of data preprocessing and analysis. This includes various tools, some of which we specifically developed for sRNA analysis, for quality checking and normalization of sRNA samples as well as tools for the detection of differentially expressed sRNAs and identification of the resulting expression patterns. The pipeline is available within the UEA sRNA Workbench, a user-friendly software package for the processing of sRNA data sets. We demonstrate the use of the pipeline on a H. sapiens data set; additional examples on a B. terrestris data set and on an A. thaliana data set are described in the Supplemental Information. A comparison with existing approaches is also included, which exemplifies some of the issues that need to be addressed for sRNA analysis and how the new pipeline may be used to do this.
format Online
Article
Text
id pubmed-5435855
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-54358552017-06-02 Comprehensive processing of high-throughput small RNA sequencing data including quality checking, normalization, and differential expression analysis using the UEA sRNA Workbench Beckers, Matthew Mohorianu, Irina Stocks, Matthew Applegate, Christopher Dalmay, Tamas Moulton, Vincent RNA Bioinformatics Recently, high-throughput sequencing (HTS) has revealed compelling details about the small RNA (sRNA) population in eukaryotes. These 20 to 25 nt noncoding RNAs can influence gene expression by acting as guides for the sequence-specific regulatory mechanism known as RNA silencing. The increase in sequencing depth and number of samples per project enables a better understanding of the role sRNAs play by facilitating the study of expression patterns. However, the intricacy of the biological hypotheses coupled with a lack of appropriate tools often leads to inadequate mining of the available data and thus, an incomplete description of the biological mechanisms involved. To enable a comprehensive study of differential expression in sRNA data sets, we present a new interactive pipeline that guides researchers through the various stages of data preprocessing and analysis. This includes various tools, some of which we specifically developed for sRNA analysis, for quality checking and normalization of sRNA samples as well as tools for the detection of differentially expressed sRNAs and identification of the resulting expression patterns. The pipeline is available within the UEA sRNA Workbench, a user-friendly software package for the processing of sRNA data sets. We demonstrate the use of the pipeline on a H. sapiens data set; additional examples on a B. terrestris data set and on an A. thaliana data set are described in the Supplemental Information. A comparison with existing approaches is also included, which exemplifies some of the issues that need to be addressed for sRNA analysis and how the new pipeline may be used to do this. Cold Spring Harbor Laboratory Press 2017-06 /pmc/articles/PMC5435855/ /pubmed/28289155 http://dx.doi.org/10.1261/rna.059360.116 Text en © 2017 Beckers et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society http://creativecommons.org/licenses/by/4.0/ This article, published in RNA, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.
spellingShingle Bioinformatics
Beckers, Matthew
Mohorianu, Irina
Stocks, Matthew
Applegate, Christopher
Dalmay, Tamas
Moulton, Vincent
Comprehensive processing of high-throughput small RNA sequencing data including quality checking, normalization, and differential expression analysis using the UEA sRNA Workbench
title Comprehensive processing of high-throughput small RNA sequencing data including quality checking, normalization, and differential expression analysis using the UEA sRNA Workbench
title_full Comprehensive processing of high-throughput small RNA sequencing data including quality checking, normalization, and differential expression analysis using the UEA sRNA Workbench
title_fullStr Comprehensive processing of high-throughput small RNA sequencing data including quality checking, normalization, and differential expression analysis using the UEA sRNA Workbench
title_full_unstemmed Comprehensive processing of high-throughput small RNA sequencing data including quality checking, normalization, and differential expression analysis using the UEA sRNA Workbench
title_short Comprehensive processing of high-throughput small RNA sequencing data including quality checking, normalization, and differential expression analysis using the UEA sRNA Workbench
title_sort comprehensive processing of high-throughput small rna sequencing data including quality checking, normalization, and differential expression analysis using the uea srna workbench
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5435855/
https://www.ncbi.nlm.nih.gov/pubmed/28289155
http://dx.doi.org/10.1261/rna.059360.116
work_keys_str_mv AT beckersmatthew comprehensiveprocessingofhighthroughputsmallrnasequencingdataincludingqualitycheckingnormalizationanddifferentialexpressionanalysisusingtheueasrnaworkbench
AT mohorianuirina comprehensiveprocessingofhighthroughputsmallrnasequencingdataincludingqualitycheckingnormalizationanddifferentialexpressionanalysisusingtheueasrnaworkbench
AT stocksmatthew comprehensiveprocessingofhighthroughputsmallrnasequencingdataincludingqualitycheckingnormalizationanddifferentialexpressionanalysisusingtheueasrnaworkbench
AT applegatechristopher comprehensiveprocessingofhighthroughputsmallrnasequencingdataincludingqualitycheckingnormalizationanddifferentialexpressionanalysisusingtheueasrnaworkbench
AT dalmaytamas comprehensiveprocessingofhighthroughputsmallrnasequencingdataincludingqualitycheckingnormalizationanddifferentialexpressionanalysisusingtheueasrnaworkbench
AT moultonvincent comprehensiveprocessingofhighthroughputsmallrnasequencingdataincludingqualitycheckingnormalizationanddifferentialexpressionanalysisusingtheueasrnaworkbench