Cargando…
Robustness of differential gene expression analysis of RNA-seq
RNA-sequencing (RNA-seq) is a relatively new technology that lacks standardisation. RNA-seq can be used for Differential Gene Expression (DGE) analysis, however, no consensus exists as to which methodology ensures robust and reproducible results. Indeed, it is broadly acknowledged that DGE methods p...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8214188/ https://www.ncbi.nlm.nih.gov/pubmed/34188784 http://dx.doi.org/10.1016/j.csbj.2021.05.040 |
_version_ | 1783710011047804928 |
---|---|
author | Stupnikov, A. McInerney, C.E. Savage, K.I. McIntosh, S.A. Emmert-Streib, F. Kennedy, R. Salto-Tellez, M. Prise, K.M. McArt, D.G. |
author_facet | Stupnikov, A. McInerney, C.E. Savage, K.I. McIntosh, S.A. Emmert-Streib, F. Kennedy, R. Salto-Tellez, M. Prise, K.M. McArt, D.G. |
author_sort | Stupnikov, A. |
collection | PubMed |
description | RNA-sequencing (RNA-seq) is a relatively new technology that lacks standardisation. RNA-seq can be used for Differential Gene Expression (DGE) analysis, however, no consensus exists as to which methodology ensures robust and reproducible results. Indeed, it is broadly acknowledged that DGE methods provide disparate results. Despite obstacles, RNA-seq assays are in advanced development for clinical use but further optimisation will be needed. Herein, five DGE models (DESeq2, voom + limma, edgeR, EBSeq, NOISeq) for gene-level detection were investigated for robustness to sequencing alterations using a controlled analysis of fixed count matrices. Two breast cancer datasets were analysed with full and reduced sample sizes. DGE model robustness was compared between filtering regimes and for different expression levels (high, low) using unbiased metrics. Test sensitivity estimated as relative False Discovery Rate (FDR), concordance between model outputs and comparisons of a ’population’ of slopes of relative FDRs across different library sizes, generated using linear regressions, were examined. Patterns of relative DGE model robustness proved dataset-agnostic and reliable for drawing conclusions when sample sizes were sufficiently large. Overall, the non-parametric method NOISeq was the most robust followed by edgeR, voom, EBSeq and DESeq2. Our rigorous appraisal provides information for method selection for molecular diagnostics. Metrics may prove useful towards improving the standardisation of RNA-seq for precision medicine. |
format | Online Article Text |
id | pubmed-8214188 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-82141882021-06-28 Robustness of differential gene expression analysis of RNA-seq Stupnikov, A. McInerney, C.E. Savage, K.I. McIntosh, S.A. Emmert-Streib, F. Kennedy, R. Salto-Tellez, M. Prise, K.M. McArt, D.G. Comput Struct Biotechnol J Research Article RNA-sequencing (RNA-seq) is a relatively new technology that lacks standardisation. RNA-seq can be used for Differential Gene Expression (DGE) analysis, however, no consensus exists as to which methodology ensures robust and reproducible results. Indeed, it is broadly acknowledged that DGE methods provide disparate results. Despite obstacles, RNA-seq assays are in advanced development for clinical use but further optimisation will be needed. Herein, five DGE models (DESeq2, voom + limma, edgeR, EBSeq, NOISeq) for gene-level detection were investigated for robustness to sequencing alterations using a controlled analysis of fixed count matrices. Two breast cancer datasets were analysed with full and reduced sample sizes. DGE model robustness was compared between filtering regimes and for different expression levels (high, low) using unbiased metrics. Test sensitivity estimated as relative False Discovery Rate (FDR), concordance between model outputs and comparisons of a ’population’ of slopes of relative FDRs across different library sizes, generated using linear regressions, were examined. Patterns of relative DGE model robustness proved dataset-agnostic and reliable for drawing conclusions when sample sizes were sufficiently large. Overall, the non-parametric method NOISeq was the most robust followed by edgeR, voom, EBSeq and DESeq2. Our rigorous appraisal provides information for method selection for molecular diagnostics. Metrics may prove useful towards improving the standardisation of RNA-seq for precision medicine. Research Network of Computational and Structural Biotechnology 2021-05-26 /pmc/articles/PMC8214188/ /pubmed/34188784 http://dx.doi.org/10.1016/j.csbj.2021.05.040 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Stupnikov, A. McInerney, C.E. Savage, K.I. McIntosh, S.A. Emmert-Streib, F. Kennedy, R. Salto-Tellez, M. Prise, K.M. McArt, D.G. Robustness of differential gene expression analysis of RNA-seq |
title | Robustness of differential gene expression analysis of RNA-seq |
title_full | Robustness of differential gene expression analysis of RNA-seq |
title_fullStr | Robustness of differential gene expression analysis of RNA-seq |
title_full_unstemmed | Robustness of differential gene expression analysis of RNA-seq |
title_short | Robustness of differential gene expression analysis of RNA-seq |
title_sort | robustness of differential gene expression analysis of rna-seq |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8214188/ https://www.ncbi.nlm.nih.gov/pubmed/34188784 http://dx.doi.org/10.1016/j.csbj.2021.05.040 |
work_keys_str_mv | AT stupnikova robustnessofdifferentialgeneexpressionanalysisofrnaseq AT mcinerneyce robustnessofdifferentialgeneexpressionanalysisofrnaseq AT savageki robustnessofdifferentialgeneexpressionanalysisofrnaseq AT mcintoshsa robustnessofdifferentialgeneexpressionanalysisofrnaseq AT emmertstreibf robustnessofdifferentialgeneexpressionanalysisofrnaseq AT kennedyr robustnessofdifferentialgeneexpressionanalysisofrnaseq AT saltotellezm robustnessofdifferentialgeneexpressionanalysisofrnaseq AT prisekm robustnessofdifferentialgeneexpressionanalysisofrnaseq AT mcartdg robustnessofdifferentialgeneexpressionanalysisofrnaseq |