Cargando…

Comparison of alternative approaches for analysing multi-level RNA-seq data

RNA sequencing (RNA-seq) is widely used for RNA quantification in the environmental, biological and medical sciences. It enables the description of genome-wide patterns of expression and the identification of regulatory interactions and networks. The aim of RNA-seq data analyses is to achieve rigoro...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohorianu, Irina, Bretman, Amanda, Smith, Damian T., Fowler, Emily K., Dalmay, Tamas, Chapman, Tracey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5549751/
https://www.ncbi.nlm.nih.gov/pubmed/28792517
http://dx.doi.org/10.1371/journal.pone.0182694
_version_ 1783256025701285888
author Mohorianu, Irina
Bretman, Amanda
Smith, Damian T.
Fowler, Emily K.
Dalmay, Tamas
Chapman, Tracey
author_facet Mohorianu, Irina
Bretman, Amanda
Smith, Damian T.
Fowler, Emily K.
Dalmay, Tamas
Chapman, Tracey
author_sort Mohorianu, Irina
collection PubMed
description RNA sequencing (RNA-seq) is widely used for RNA quantification in the environmental, biological and medical sciences. It enables the description of genome-wide patterns of expression and the identification of regulatory interactions and networks. The aim of RNA-seq data analyses is to achieve rigorous quantification of genes/transcripts to allow a reliable prediction of differential expression (DE), despite variation in levels of noise and inherent biases in sequencing data. This can be especially challenging for datasets in which gene expression differences are subtle, as in the behavioural transcriptomics test dataset from D. melanogaster that we used here. We investigated the power of existing approaches for quality checking mRNA-seq data and explored additional, quantitative quality checks. To accommodate nested, multi-level experimental designs, we incorporated sample layout into our analyses. We employed a subsampling without replacement-based normalization and an identification of DE that accounted for the hierarchy and amplitude of effect sizes within samples, then evaluated the resulting differential expression call in comparison to existing approaches. In a final step to test for broader applicability, we applied our approaches to a published set of H. sapiens mRNA-seq samples, The dataset-tailored methods improved sample comparability and delivered a robust prediction of subtle gene expression changes. The proposed approaches have the potential to improve key steps in the analysis of RNA-seq data by incorporating the structure and characteristics of biological experiments.
format Online
Article
Text
id pubmed-5549751
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-55497512017-08-12 Comparison of alternative approaches for analysing multi-level RNA-seq data Mohorianu, Irina Bretman, Amanda Smith, Damian T. Fowler, Emily K. Dalmay, Tamas Chapman, Tracey PLoS One Research Article RNA sequencing (RNA-seq) is widely used for RNA quantification in the environmental, biological and medical sciences. It enables the description of genome-wide patterns of expression and the identification of regulatory interactions and networks. The aim of RNA-seq data analyses is to achieve rigorous quantification of genes/transcripts to allow a reliable prediction of differential expression (DE), despite variation in levels of noise and inherent biases in sequencing data. This can be especially challenging for datasets in which gene expression differences are subtle, as in the behavioural transcriptomics test dataset from D. melanogaster that we used here. We investigated the power of existing approaches for quality checking mRNA-seq data and explored additional, quantitative quality checks. To accommodate nested, multi-level experimental designs, we incorporated sample layout into our analyses. We employed a subsampling without replacement-based normalization and an identification of DE that accounted for the hierarchy and amplitude of effect sizes within samples, then evaluated the resulting differential expression call in comparison to existing approaches. In a final step to test for broader applicability, we applied our approaches to a published set of H. sapiens mRNA-seq samples, The dataset-tailored methods improved sample comparability and delivered a robust prediction of subtle gene expression changes. The proposed approaches have the potential to improve key steps in the analysis of RNA-seq data by incorporating the structure and characteristics of biological experiments. Public Library of Science 2017-08-08 /pmc/articles/PMC5549751/ /pubmed/28792517 http://dx.doi.org/10.1371/journal.pone.0182694 Text en © 2017 Mohorianu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mohorianu, Irina
Bretman, Amanda
Smith, Damian T.
Fowler, Emily K.
Dalmay, Tamas
Chapman, Tracey
Comparison of alternative approaches for analysing multi-level RNA-seq data
title Comparison of alternative approaches for analysing multi-level RNA-seq data
title_full Comparison of alternative approaches for analysing multi-level RNA-seq data
title_fullStr Comparison of alternative approaches for analysing multi-level RNA-seq data
title_full_unstemmed Comparison of alternative approaches for analysing multi-level RNA-seq data
title_short Comparison of alternative approaches for analysing multi-level RNA-seq data
title_sort comparison of alternative approaches for analysing multi-level rna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5549751/
https://www.ncbi.nlm.nih.gov/pubmed/28792517
http://dx.doi.org/10.1371/journal.pone.0182694
work_keys_str_mv AT mohorianuirina comparisonofalternativeapproachesforanalysingmultilevelrnaseqdata
AT bretmanamanda comparisonofalternativeapproachesforanalysingmultilevelrnaseqdata
AT smithdamiant comparisonofalternativeapproachesforanalysingmultilevelrnaseqdata
AT fowleremilyk comparisonofalternativeapproachesforanalysingmultilevelrnaseqdata
AT dalmaytamas comparisonofalternativeapproachesforanalysingmultilevelrnaseqdata
AT chapmantracey comparisonofalternativeapproachesforanalysingmultilevelrnaseqdata