Cargando…

Detecting differentially expressed genes for syndromes by considering change in mean and dispersion simultaneously

BACKGROUND: Using next-generation sequencing technology to measure gene expression, an empirically intriguing question concerns the identification of differentially expressed genes across treatment groups. Existing methods aim to identify genes whose mean expressions differ among treatment groups by...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Chenchen, Ji, Tieming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6148965/
https://www.ncbi.nlm.nih.gov/pubmed/30236056
http://dx.doi.org/10.1186/s12859-018-2354-4
_version_ 1783356790668263424
author Ma, Chenchen
Ji, Tieming
author_facet Ma, Chenchen
Ji, Tieming
author_sort Ma, Chenchen
collection PubMed
description BACKGROUND: Using next-generation sequencing technology to measure gene expression, an empirically intriguing question concerns the identification of differentially expressed genes across treatment groups. Existing methods aim to identify genes whose mean expressions differ among treatment groups by assuming equal dispersion across all groups. For syndromes, however, various combinations of gene expression alterations can result in the same disease, leading to greater heteroscedasticity in the biological replicates in the disease group compared to the normal group. Traditional methods that only consider changes in the mean will fail to fully analyze gene expression in such a scenario. In addition, sequencing technology is relatively expensive; most labs can only afford a few replicates per treatment group, which poses further challenges to reliably estimating the mean and dispersion under each treatment condition. RESULTS: We designed an empirical Bayes method and a pooled permutation test to simultaneously consider the change in mean and dispersion across treatment groups. We further computed confidence intervals based on Bayes estimates to identify differentially expressed genes that are unique to each disease sample as well as those that are common across all disease samples. We illustrated our method by applying it to gene expression data from a large offspring syndrome experiment, which motivated this study. We compared our method to competing approaches through simulation studies that mimicked the real datasets to demonstrate the effectiveness of our proposed method. CONCLUSIONS: We will show that, compared to popular methods that only aim to find the difference in the mean, our method can capture greater variation in the disease group to effectively identify differentially expressed genes for syndromes.
format Online
Article
Text
id pubmed-6148965
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61489652018-09-24 Detecting differentially expressed genes for syndromes by considering change in mean and dispersion simultaneously Ma, Chenchen Ji, Tieming BMC Bioinformatics Research Article BACKGROUND: Using next-generation sequencing technology to measure gene expression, an empirically intriguing question concerns the identification of differentially expressed genes across treatment groups. Existing methods aim to identify genes whose mean expressions differ among treatment groups by assuming equal dispersion across all groups. For syndromes, however, various combinations of gene expression alterations can result in the same disease, leading to greater heteroscedasticity in the biological replicates in the disease group compared to the normal group. Traditional methods that only consider changes in the mean will fail to fully analyze gene expression in such a scenario. In addition, sequencing technology is relatively expensive; most labs can only afford a few replicates per treatment group, which poses further challenges to reliably estimating the mean and dispersion under each treatment condition. RESULTS: We designed an empirical Bayes method and a pooled permutation test to simultaneously consider the change in mean and dispersion across treatment groups. We further computed confidence intervals based on Bayes estimates to identify differentially expressed genes that are unique to each disease sample as well as those that are common across all disease samples. We illustrated our method by applying it to gene expression data from a large offspring syndrome experiment, which motivated this study. We compared our method to competing approaches through simulation studies that mimicked the real datasets to demonstrate the effectiveness of our proposed method. CONCLUSIONS: We will show that, compared to popular methods that only aim to find the difference in the mean, our method can capture greater variation in the disease group to effectively identify differentially expressed genes for syndromes. BioMed Central 2018-09-20 /pmc/articles/PMC6148965/ /pubmed/30236056 http://dx.doi.org/10.1186/s12859-018-2354-4 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ma, Chenchen
Ji, Tieming
Detecting differentially expressed genes for syndromes by considering change in mean and dispersion simultaneously
title Detecting differentially expressed genes for syndromes by considering change in mean and dispersion simultaneously
title_full Detecting differentially expressed genes for syndromes by considering change in mean and dispersion simultaneously
title_fullStr Detecting differentially expressed genes for syndromes by considering change in mean and dispersion simultaneously
title_full_unstemmed Detecting differentially expressed genes for syndromes by considering change in mean and dispersion simultaneously
title_short Detecting differentially expressed genes for syndromes by considering change in mean and dispersion simultaneously
title_sort detecting differentially expressed genes for syndromes by considering change in mean and dispersion simultaneously
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6148965/
https://www.ncbi.nlm.nih.gov/pubmed/30236056
http://dx.doi.org/10.1186/s12859-018-2354-4
work_keys_str_mv AT machenchen detectingdifferentiallyexpressedgenesforsyndromesbyconsideringchangeinmeananddispersionsimultaneously
AT jitieming detectingdifferentiallyexpressedgenesforsyndromesbyconsideringchangeinmeananddispersionsimultaneously