Cargando…

Robust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach

In this paper, the problem of identifying differentially expressed genes under different conditions using gene expression microarray data, in the presence of outliers, is discussed. For this purpose, the robust modeling of gene expression data using some powerful distributions known as normal/indepe...

Descripción completa

Detalles Bibliográficos
Autores principales: Ganjali, Mojtaba, Baghfalaki, Taban, Berridge, Damon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4409222/
https://www.ncbi.nlm.nih.gov/pubmed/25910040
http://dx.doi.org/10.1371/journal.pone.0123791
_version_ 1782368173411008512
author Ganjali, Mojtaba
Baghfalaki, Taban
Berridge, Damon
author_facet Ganjali, Mojtaba
Baghfalaki, Taban
Berridge, Damon
author_sort Ganjali, Mojtaba
collection PubMed
description In this paper, the problem of identifying differentially expressed genes under different conditions using gene expression microarray data, in the presence of outliers, is discussed. For this purpose, the robust modeling of gene expression data using some powerful distributions known as normal/independent distributions is considered. These distributions include the Student’s t and normal distributions which have been used previously, but also include extensions such as the slash, the contaminated normal and the Laplace distributions. The purpose of this paper is to identify differentially expressed genes by considering these distributional assumptions instead of the normal distribution. A Bayesian approach using the Markov Chain Monte Carlo method is adopted for parameter estimation. Two publicly available gene expression data sets are analyzed using the proposed approach. The use of the robust models for detecting differentially expressed genes is investigated. This investigation shows that the choice of model for differentiating gene expression data is very important. This is due to the small number of replicates for each gene and the existence of outlying data. Comparison of the performance of these models is made using different statistical criteria and the ROC curve. The method is illustrated using some simulation studies. We demonstrate the flexibility of these robust models in identifying differentially expressed genes.
format Online
Article
Text
id pubmed-4409222
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44092222015-05-12 Robust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach Ganjali, Mojtaba Baghfalaki, Taban Berridge, Damon PLoS One Research Article In this paper, the problem of identifying differentially expressed genes under different conditions using gene expression microarray data, in the presence of outliers, is discussed. For this purpose, the robust modeling of gene expression data using some powerful distributions known as normal/independent distributions is considered. These distributions include the Student’s t and normal distributions which have been used previously, but also include extensions such as the slash, the contaminated normal and the Laplace distributions. The purpose of this paper is to identify differentially expressed genes by considering these distributional assumptions instead of the normal distribution. A Bayesian approach using the Markov Chain Monte Carlo method is adopted for parameter estimation. Two publicly available gene expression data sets are analyzed using the proposed approach. The use of the robust models for detecting differentially expressed genes is investigated. This investigation shows that the choice of model for differentiating gene expression data is very important. This is due to the small number of replicates for each gene and the existence of outlying data. Comparison of the performance of these models is made using different statistical criteria and the ROC curve. The method is illustrated using some simulation studies. We demonstrate the flexibility of these robust models in identifying differentially expressed genes. Public Library of Science 2015-04-24 /pmc/articles/PMC4409222/ /pubmed/25910040 http://dx.doi.org/10.1371/journal.pone.0123791 Text en © 2015 Ganjali et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ganjali, Mojtaba
Baghfalaki, Taban
Berridge, Damon
Robust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach
title Robust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach
title_full Robust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach
title_fullStr Robust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach
title_full_unstemmed Robust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach
title_short Robust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach
title_sort robust modeling of differential gene expression data using normal/independent distributions: a bayesian approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4409222/
https://www.ncbi.nlm.nih.gov/pubmed/25910040
http://dx.doi.org/10.1371/journal.pone.0123791
work_keys_str_mv AT ganjalimojtaba robustmodelingofdifferentialgeneexpressiondatausingnormalindependentdistributionsabayesianapproach
AT baghfalakitaban robustmodelingofdifferentialgeneexpressiondatausingnormalindependentdistributionsabayesianapproach
AT berridgedamon robustmodelingofdifferentialgeneexpressiondatausingnormalindependentdistributionsabayesianapproach