Cargando…
Stability of methods for differential expression analysis of RNA-seq data
BACKGROUND: As RNA-seq becomes the assay of choice for measuring gene expression levels, differential expression analysis has received extensive attentions of researchers. To date, for the evaluation of DE methods, most attention has been paid on validity. Yet another important aspect of DE methods,...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6330476/ https://www.ncbi.nlm.nih.gov/pubmed/30634899 http://dx.doi.org/10.1186/s12864-018-5390-6 |
_version_ | 1783386982780502016 |
---|---|
author | Lin, Bingqing Pang, Zhen |
author_facet | Lin, Bingqing Pang, Zhen |
author_sort | Lin, Bingqing |
collection | PubMed |
description | BACKGROUND: As RNA-seq becomes the assay of choice for measuring gene expression levels, differential expression analysis has received extensive attentions of researchers. To date, for the evaluation of DE methods, most attention has been paid on validity. Yet another important aspect of DE methods, stability, is overlooked and has not been studied to the best of our knowledge. RESULTS: In this study, we empirically show the need of assessing stability of DE methods and propose a stability metric, called Area Under the Correlation curve (AUCOR), that generates the perturbed datasets by a mixture distribution and combines the information of similarities between sets of selected features from these perturbed datasets and the original dataset. CONCLUSION: Empirical results support that AUCOR can effectively rank the DE methods in terms of stability for given RNA-seq datasets. In addition, we explore how biological or technical factors from experiments and data analysis affect the stability of DE methods. AUCOR is implemented in the open-source R package AUCOR, with source code freely available at https://github.com/linbingqing/stableDE. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5390-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6330476 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63304762019-01-16 Stability of methods for differential expression analysis of RNA-seq data Lin, Bingqing Pang, Zhen BMC Genomics Methodology Article BACKGROUND: As RNA-seq becomes the assay of choice for measuring gene expression levels, differential expression analysis has received extensive attentions of researchers. To date, for the evaluation of DE methods, most attention has been paid on validity. Yet another important aspect of DE methods, stability, is overlooked and has not been studied to the best of our knowledge. RESULTS: In this study, we empirically show the need of assessing stability of DE methods and propose a stability metric, called Area Under the Correlation curve (AUCOR), that generates the perturbed datasets by a mixture distribution and combines the information of similarities between sets of selected features from these perturbed datasets and the original dataset. CONCLUSION: Empirical results support that AUCOR can effectively rank the DE methods in terms of stability for given RNA-seq datasets. In addition, we explore how biological or technical factors from experiments and data analysis affect the stability of DE methods. AUCOR is implemented in the open-source R package AUCOR, with source code freely available at https://github.com/linbingqing/stableDE. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5390-6) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-11 /pmc/articles/PMC6330476/ /pubmed/30634899 http://dx.doi.org/10.1186/s12864-018-5390-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Lin, Bingqing Pang, Zhen Stability of methods for differential expression analysis of RNA-seq data |
title | Stability of methods for differential expression analysis of RNA-seq data |
title_full | Stability of methods for differential expression analysis of RNA-seq data |
title_fullStr | Stability of methods for differential expression analysis of RNA-seq data |
title_full_unstemmed | Stability of methods for differential expression analysis of RNA-seq data |
title_short | Stability of methods for differential expression analysis of RNA-seq data |
title_sort | stability of methods for differential expression analysis of rna-seq data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6330476/ https://www.ncbi.nlm.nih.gov/pubmed/30634899 http://dx.doi.org/10.1186/s12864-018-5390-6 |
work_keys_str_mv | AT linbingqing stabilityofmethodsfordifferentialexpressionanalysisofrnaseqdata AT pangzhen stabilityofmethodsfordifferentialexpressionanalysisofrnaseqdata |