Cargando…

Robustness of differential gene expression analysis of RNA-seq

RNA-sequencing (RNA-seq) is a relatively new technology that lacks standardisation. RNA-seq can be used for Differential Gene Expression (DGE) analysis, however, no consensus exists as to which methodology ensures robust and reproducible results. Indeed, it is broadly acknowledged that DGE methods p...

Descripción completa

Detalles Bibliográficos
Autores principales: Stupnikov, A., McInerney, C.E., Savage, K.I., McIntosh, S.A., Emmert-Streib, F., Kennedy, R., Salto-Tellez, M., Prise, K.M., McArt, D.G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8214188/
https://www.ncbi.nlm.nih.gov/pubmed/34188784
http://dx.doi.org/10.1016/j.csbj.2021.05.040
_version_ 1783710011047804928
author Stupnikov, A.
McInerney, C.E.
Savage, K.I.
McIntosh, S.A.
Emmert-Streib, F.
Kennedy, R.
Salto-Tellez, M.
Prise, K.M.
McArt, D.G.
author_facet Stupnikov, A.
McInerney, C.E.
Savage, K.I.
McIntosh, S.A.
Emmert-Streib, F.
Kennedy, R.
Salto-Tellez, M.
Prise, K.M.
McArt, D.G.
author_sort Stupnikov, A.
collection PubMed
description RNA-sequencing (RNA-seq) is a relatively new technology that lacks standardisation. RNA-seq can be used for Differential Gene Expression (DGE) analysis, however, no consensus exists as to which methodology ensures robust and reproducible results. Indeed, it is broadly acknowledged that DGE methods provide disparate results. Despite obstacles, RNA-seq assays are in advanced development for clinical use but further optimisation will be needed. Herein, five DGE models (DESeq2, voom + limma, edgeR, EBSeq, NOISeq) for gene-level detection were investigated for robustness to sequencing alterations using a controlled analysis of fixed count matrices. Two breast cancer datasets were analysed with full and reduced sample sizes. DGE model robustness was compared between filtering regimes and for different expression levels (high, low) using unbiased metrics. Test sensitivity estimated as relative False Discovery Rate (FDR), concordance between model outputs and comparisons of a ’population’ of slopes of relative FDRs across different library sizes, generated using linear regressions, were examined. Patterns of relative DGE model robustness proved dataset-agnostic and reliable for drawing conclusions when sample sizes were sufficiently large. Overall, the non-parametric method NOISeq was the most robust followed by edgeR, voom, EBSeq and DESeq2. Our rigorous appraisal provides information for method selection for molecular diagnostics. Metrics may prove useful towards improving the standardisation of RNA-seq for precision medicine.
format Online
Article
Text
id pubmed-8214188
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-82141882021-06-28 Robustness of differential gene expression analysis of RNA-seq Stupnikov, A. McInerney, C.E. Savage, K.I. McIntosh, S.A. Emmert-Streib, F. Kennedy, R. Salto-Tellez, M. Prise, K.M. McArt, D.G. Comput Struct Biotechnol J Research Article RNA-sequencing (RNA-seq) is a relatively new technology that lacks standardisation. RNA-seq can be used for Differential Gene Expression (DGE) analysis, however, no consensus exists as to which methodology ensures robust and reproducible results. Indeed, it is broadly acknowledged that DGE methods provide disparate results. Despite obstacles, RNA-seq assays are in advanced development for clinical use but further optimisation will be needed. Herein, five DGE models (DESeq2, voom + limma, edgeR, EBSeq, NOISeq) for gene-level detection were investigated for robustness to sequencing alterations using a controlled analysis of fixed count matrices. Two breast cancer datasets were analysed with full and reduced sample sizes. DGE model robustness was compared between filtering regimes and for different expression levels (high, low) using unbiased metrics. Test sensitivity estimated as relative False Discovery Rate (FDR), concordance between model outputs and comparisons of a ’population’ of slopes of relative FDRs across different library sizes, generated using linear regressions, were examined. Patterns of relative DGE model robustness proved dataset-agnostic and reliable for drawing conclusions when sample sizes were sufficiently large. Overall, the non-parametric method NOISeq was the most robust followed by edgeR, voom, EBSeq and DESeq2. Our rigorous appraisal provides information for method selection for molecular diagnostics. Metrics may prove useful towards improving the standardisation of RNA-seq for precision medicine. Research Network of Computational and Structural Biotechnology 2021-05-26 /pmc/articles/PMC8214188/ /pubmed/34188784 http://dx.doi.org/10.1016/j.csbj.2021.05.040 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Stupnikov, A.
McInerney, C.E.
Savage, K.I.
McIntosh, S.A.
Emmert-Streib, F.
Kennedy, R.
Salto-Tellez, M.
Prise, K.M.
McArt, D.G.
Robustness of differential gene expression analysis of RNA-seq
title Robustness of differential gene expression analysis of RNA-seq
title_full Robustness of differential gene expression analysis of RNA-seq
title_fullStr Robustness of differential gene expression analysis of RNA-seq
title_full_unstemmed Robustness of differential gene expression analysis of RNA-seq
title_short Robustness of differential gene expression analysis of RNA-seq
title_sort robustness of differential gene expression analysis of rna-seq
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8214188/
https://www.ncbi.nlm.nih.gov/pubmed/34188784
http://dx.doi.org/10.1016/j.csbj.2021.05.040
work_keys_str_mv AT stupnikova robustnessofdifferentialgeneexpressionanalysisofrnaseq
AT mcinerneyce robustnessofdifferentialgeneexpressionanalysisofrnaseq
AT savageki robustnessofdifferentialgeneexpressionanalysisofrnaseq
AT mcintoshsa robustnessofdifferentialgeneexpressionanalysisofrnaseq
AT emmertstreibf robustnessofdifferentialgeneexpressionanalysisofrnaseq
AT kennedyr robustnessofdifferentialgeneexpressionanalysisofrnaseq
AT saltotellezm robustnessofdifferentialgeneexpressionanalysisofrnaseq
AT prisekm robustnessofdifferentialgeneexpressionanalysisofrnaseq
AT mcartdg robustnessofdifferentialgeneexpressionanalysisofrnaseq