Cargando…

Exon-level estimates improve the detection of differentially expressed genes in RNA-seq studies

Detection of differentially expressed genes (DEGs) between different biological conditions is a key data analysis step of most RNA-sequencing studies. Conventionally, computational tools have used gene-level read counts as input to test for differential gene expression between sample condition group...

Descripción completa

Detalles Bibliográficos
Autores principales: Mehmood, Arfa, Laiho, Asta, Elo, Laura L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8582999/
https://www.ncbi.nlm.nih.gov/pubmed/33522408
http://dx.doi.org/10.1080/15476286.2020.1868151
_version_ 1784597114834124800
author Mehmood, Arfa
Laiho, Asta
Elo, Laura L.
author_facet Mehmood, Arfa
Laiho, Asta
Elo, Laura L.
author_sort Mehmood, Arfa
collection PubMed
description Detection of differentially expressed genes (DEGs) between different biological conditions is a key data analysis step of most RNA-sequencing studies. Conventionally, computational tools have used gene-level read counts as input to test for differential gene expression between sample condition groups. Recently, it has been suggested that statistical testing could be performed with increased power at a lower feature level prior to aggregating the results to the gene level. In this study, we systematically compared the performance of calling the DEGs when using read count data at different levels (gene, transcript, and exon) as input, in the context of two publicly available data sets. Additionally, we tested two different methods for aggregating the lower feature-level p-values to gene-level: Lancaster and empirical Brown’s method. Our results show that detection of DEGs is improved compared to the conventional gene-level approach regardless of the lower feature-level used for statistical testing. The overall best balance between accuracy and false discovery rate was obtained using the exon-level approach with empirical Brown’s aggregation method, which we provide as a freely available Bioconductor package EBSEA (https://bioconductor.org/packages/release/bioc/html/EBSEA.html).
format Online
Article
Text
id pubmed-8582999
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-85829992021-11-12 Exon-level estimates improve the detection of differentially expressed genes in RNA-seq studies Mehmood, Arfa Laiho, Asta Elo, Laura L. RNA Biol Research Paper Detection of differentially expressed genes (DEGs) between different biological conditions is a key data analysis step of most RNA-sequencing studies. Conventionally, computational tools have used gene-level read counts as input to test for differential gene expression between sample condition groups. Recently, it has been suggested that statistical testing could be performed with increased power at a lower feature level prior to aggregating the results to the gene level. In this study, we systematically compared the performance of calling the DEGs when using read count data at different levels (gene, transcript, and exon) as input, in the context of two publicly available data sets. Additionally, we tested two different methods for aggregating the lower feature-level p-values to gene-level: Lancaster and empirical Brown’s method. Our results show that detection of DEGs is improved compared to the conventional gene-level approach regardless of the lower feature-level used for statistical testing. The overall best balance between accuracy and false discovery rate was obtained using the exon-level approach with empirical Brown’s aggregation method, which we provide as a freely available Bioconductor package EBSEA (https://bioconductor.org/packages/release/bioc/html/EBSEA.html). Taylor & Francis 2021-01-30 /pmc/articles/PMC8582999/ /pubmed/33522408 http://dx.doi.org/10.1080/15476286.2020.1868151 Text en © 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Paper
Mehmood, Arfa
Laiho, Asta
Elo, Laura L.
Exon-level estimates improve the detection of differentially expressed genes in RNA-seq studies
title Exon-level estimates improve the detection of differentially expressed genes in RNA-seq studies
title_full Exon-level estimates improve the detection of differentially expressed genes in RNA-seq studies
title_fullStr Exon-level estimates improve the detection of differentially expressed genes in RNA-seq studies
title_full_unstemmed Exon-level estimates improve the detection of differentially expressed genes in RNA-seq studies
title_short Exon-level estimates improve the detection of differentially expressed genes in RNA-seq studies
title_sort exon-level estimates improve the detection of differentially expressed genes in rna-seq studies
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8582999/
https://www.ncbi.nlm.nih.gov/pubmed/33522408
http://dx.doi.org/10.1080/15476286.2020.1868151
work_keys_str_mv AT mehmoodarfa exonlevelestimatesimprovethedetectionofdifferentiallyexpressedgenesinrnaseqstudies
AT laihoasta exonlevelestimatesimprovethedetectionofdifferentiallyexpressedgenesinrnaseqstudies
AT elolaural exonlevelestimatesimprovethedetectionofdifferentiallyexpressedgenesinrnaseqstudies