Cargando…

Stability of methods for differential expression analysis of RNA-seq data

BACKGROUND: As RNA-seq becomes the assay of choice for measuring gene expression levels, differential expression analysis has received extensive attentions of researchers. To date, for the evaluation of DE methods, most attention has been paid on validity. Yet another important aspect of DE methods,...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Bingqing, Pang, Zhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6330476/
https://www.ncbi.nlm.nih.gov/pubmed/30634899
http://dx.doi.org/10.1186/s12864-018-5390-6
_version_ 1783386982780502016
author Lin, Bingqing
Pang, Zhen
author_facet Lin, Bingqing
Pang, Zhen
author_sort Lin, Bingqing
collection PubMed
description BACKGROUND: As RNA-seq becomes the assay of choice for measuring gene expression levels, differential expression analysis has received extensive attentions of researchers. To date, for the evaluation of DE methods, most attention has been paid on validity. Yet another important aspect of DE methods, stability, is overlooked and has not been studied to the best of our knowledge. RESULTS: In this study, we empirically show the need of assessing stability of DE methods and propose a stability metric, called Area Under the Correlation curve (AUCOR), that generates the perturbed datasets by a mixture distribution and combines the information of similarities between sets of selected features from these perturbed datasets and the original dataset. CONCLUSION: Empirical results support that AUCOR can effectively rank the DE methods in terms of stability for given RNA-seq datasets. In addition, we explore how biological or technical factors from experiments and data analysis affect the stability of DE methods. AUCOR is implemented in the open-source R package AUCOR, with source code freely available at https://github.com/linbingqing/stableDE. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5390-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6330476
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63304762019-01-16 Stability of methods for differential expression analysis of RNA-seq data Lin, Bingqing Pang, Zhen BMC Genomics Methodology Article BACKGROUND: As RNA-seq becomes the assay of choice for measuring gene expression levels, differential expression analysis has received extensive attentions of researchers. To date, for the evaluation of DE methods, most attention has been paid on validity. Yet another important aspect of DE methods, stability, is overlooked and has not been studied to the best of our knowledge. RESULTS: In this study, we empirically show the need of assessing stability of DE methods and propose a stability metric, called Area Under the Correlation curve (AUCOR), that generates the perturbed datasets by a mixture distribution and combines the information of similarities between sets of selected features from these perturbed datasets and the original dataset. CONCLUSION: Empirical results support that AUCOR can effectively rank the DE methods in terms of stability for given RNA-seq datasets. In addition, we explore how biological or technical factors from experiments and data analysis affect the stability of DE methods. AUCOR is implemented in the open-source R package AUCOR, with source code freely available at https://github.com/linbingqing/stableDE. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5390-6) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-11 /pmc/articles/PMC6330476/ /pubmed/30634899 http://dx.doi.org/10.1186/s12864-018-5390-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Lin, Bingqing
Pang, Zhen
Stability of methods for differential expression analysis of RNA-seq data
title Stability of methods for differential expression analysis of RNA-seq data
title_full Stability of methods for differential expression analysis of RNA-seq data
title_fullStr Stability of methods for differential expression analysis of RNA-seq data
title_full_unstemmed Stability of methods for differential expression analysis of RNA-seq data
title_short Stability of methods for differential expression analysis of RNA-seq data
title_sort stability of methods for differential expression analysis of rna-seq data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6330476/
https://www.ncbi.nlm.nih.gov/pubmed/30634899
http://dx.doi.org/10.1186/s12864-018-5390-6
work_keys_str_mv AT linbingqing stabilityofmethodsfordifferentialexpressionanalysisofrnaseqdata
AT pangzhen stabilityofmethodsfordifferentialexpressionanalysisofrnaseqdata