Cargando…

A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels

BACKGROUND: Two-dimensional polyacrylamide gel electrophoresis (2D PAGE) is commonly used to identify differentially expressed proteins under two or more experimental or observational conditions. Wu et al (2009) developed a univariate probabilistic model which was used to identify differential expre...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Steven H, Black, Michael A, North, Robyn A, Rodrigo, Allen G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3505467/
https://www.ncbi.nlm.nih.gov/pubmed/22712439
http://dx.doi.org/10.1186/1471-2105-13-137
_version_ 1782250760422031360
author Wu, Steven H
Black, Michael A
North, Robyn A
Rodrigo, Allen G
author_facet Wu, Steven H
Black, Michael A
North, Robyn A
Rodrigo, Allen G
author_sort Wu, Steven H
collection PubMed
description BACKGROUND: Two-dimensional polyacrylamide gel electrophoresis (2D PAGE) is commonly used to identify differentially expressed proteins under two or more experimental or observational conditions. Wu et al (2009) developed a univariate probabilistic model which was used to identify differential expression between Case and Control groups, by applying a Likelihood Ratio Test (LRT) to each protein on a 2D PAGE. In contrast to commonly used statistical approaches, this model takes into account the two possible causes of missing values in 2D PAGE: either (1) the non-expression of a protein; or (2) a level of expression that falls below the limit of detection. RESULTS: We develop a global Bayesian model which extends the previously described model. Unlike the univariate approach, the model reported here is able treat all differentially expressed proteins simultaneously. Whereas each protein is modelled by the univariate likelihood function previously described, several global distributions are used to model the underlying relationship between the parameters associated with individual proteins. These global distributions are able to combine information from each protein to give more accurate estimates of the true parameters. In our implementation of the procedure, all parameters are recovered by Markov chain Monte Carlo (MCMC) integration. The 95% highest posterior density (HPD) intervals for the marginal posterior distributions are used to determine whether differences in protein expression are due to differences in mean expression intensities, and/or differences in the probabilities of expression. CONCLUSIONS: Simulation analyses showed that the global model is able to accurately recover the underlying global distributions, and identify more differentially expressed proteins than the simple application of a LRT. Additionally, simulations also indicate that the probability of incorrectly identifying a protein as differentially expressed (i.e., the False Discovery Rate) is very low. The source code is available at https://github.com/stevenhwu/BIDE-2D.
format Online
Article
Text
id pubmed-3505467
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35054672012-11-29 A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels Wu, Steven H Black, Michael A North, Robyn A Rodrigo, Allen G BMC Bioinformatics Methodology Article BACKGROUND: Two-dimensional polyacrylamide gel electrophoresis (2D PAGE) is commonly used to identify differentially expressed proteins under two or more experimental or observational conditions. Wu et al (2009) developed a univariate probabilistic model which was used to identify differential expression between Case and Control groups, by applying a Likelihood Ratio Test (LRT) to each protein on a 2D PAGE. In contrast to commonly used statistical approaches, this model takes into account the two possible causes of missing values in 2D PAGE: either (1) the non-expression of a protein; or (2) a level of expression that falls below the limit of detection. RESULTS: We develop a global Bayesian model which extends the previously described model. Unlike the univariate approach, the model reported here is able treat all differentially expressed proteins simultaneously. Whereas each protein is modelled by the univariate likelihood function previously described, several global distributions are used to model the underlying relationship between the parameters associated with individual proteins. These global distributions are able to combine information from each protein to give more accurate estimates of the true parameters. In our implementation of the procedure, all parameters are recovered by Markov chain Monte Carlo (MCMC) integration. The 95% highest posterior density (HPD) intervals for the marginal posterior distributions are used to determine whether differences in protein expression are due to differences in mean expression intensities, and/or differences in the probabilities of expression. CONCLUSIONS: Simulation analyses showed that the global model is able to accurately recover the underlying global distributions, and identify more differentially expressed proteins than the simple application of a LRT. Additionally, simulations also indicate that the probability of incorrectly identifying a protein as differentially expressed (i.e., the False Discovery Rate) is very low. The source code is available at https://github.com/stevenhwu/BIDE-2D. BioMed Central 2012-06-19 /pmc/articles/PMC3505467/ /pubmed/22712439 http://dx.doi.org/10.1186/1471-2105-13-137 Text en Copyright ©2012 Wu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Wu, Steven H
Black, Michael A
North, Robyn A
Rodrigo, Allen G
A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels
title A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels
title_full A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels
title_fullStr A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels
title_full_unstemmed A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels
title_short A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels
title_sort bayesian model for classifying all differentially expressed proteins simultaneously in 2d page gels
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3505467/
https://www.ncbi.nlm.nih.gov/pubmed/22712439
http://dx.doi.org/10.1186/1471-2105-13-137
work_keys_str_mv AT wustevenh abayesianmodelforclassifyingalldifferentiallyexpressedproteinssimultaneouslyin2dpagegels
AT blackmichaela abayesianmodelforclassifyingalldifferentiallyexpressedproteinssimultaneouslyin2dpagegels
AT northrobyna abayesianmodelforclassifyingalldifferentiallyexpressedproteinssimultaneouslyin2dpagegels
AT rodrigoalleng abayesianmodelforclassifyingalldifferentiallyexpressedproteinssimultaneouslyin2dpagegels
AT wustevenh bayesianmodelforclassifyingalldifferentiallyexpressedproteinssimultaneouslyin2dpagegels
AT blackmichaela bayesianmodelforclassifyingalldifferentiallyexpressedproteinssimultaneouslyin2dpagegels
AT northrobyna bayesianmodelforclassifyingalldifferentiallyexpressedproteinssimultaneouslyin2dpagegels
AT rodrigoalleng bayesianmodelforclassifyingalldifferentiallyexpressedproteinssimultaneouslyin2dpagegels