Cargando…

Application of Bayesian networks to GAW20 genetic and blood lipid data

BACKGROUND: Bayesian networks have been proposed as a way to identify possible causal relationships between measured variables based on their conditional dependencies and independencies. We explored the use of Bayesian network analyses applied to the GAW20 data to identify possible causal relationsh...

Descripción completa

Detalles Bibliográficos
Autores principales: Howey, Richard A. J., Cordell, Heather J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157285/
https://www.ncbi.nlm.nih.gov/pubmed/30275876
http://dx.doi.org/10.1186/s12919-018-0116-y
_version_ 1783358248922906624
author Howey, Richard A. J.
Cordell, Heather J.
author_facet Howey, Richard A. J.
Cordell, Heather J.
author_sort Howey, Richard A. J.
collection PubMed
description BACKGROUND: Bayesian networks have been proposed as a way to identify possible causal relationships between measured variables based on their conditional dependencies and independencies. We explored the use of Bayesian network analyses applied to the GAW20 data to identify possible causal relationships between differential methylation of cytosine-phosphate-guanine dinucleotides (CpGs), single-nucleotide polymorphisms (SNPs), and blood lipid trait (triglycerides [TGs]). METHODS: After initial exploratory linear regression analyses, 2 Bayesian networks analyses were performed. First, we used the real data and modeled the effects of 4 CpGs previously found to be associated with TGs in the Genetics of Lipid Lowering Drugs and Diet Network Study (GOLDN). Second, we used the simulated data and modeled the effect of a fictional lipid modifying drug with 5 known causal SNPs and 5 corresponding CpGs. RESULTS: In the real data we show that relationships are present between the CpGs, TGs, and other variables—age, sex, and center. In the simulated data, we show, using linear regression, that no CpGs and only 1 SNP were associated with a change in TG levels, and, using Bayesian network analysis, that relationships are present between the change in TG levels and most SNPs, but not with CpGs. CONCLUSIONS: Even when the causal relationships between variables are known, as with the simulated data, if the relationships are not strong then it is challenging to reproduce them in a Bayesian network.
format Online
Article
Text
id pubmed-6157285
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61572852018-10-01 Application of Bayesian networks to GAW20 genetic and blood lipid data Howey, Richard A. J. Cordell, Heather J. BMC Proc Proceedings BACKGROUND: Bayesian networks have been proposed as a way to identify possible causal relationships between measured variables based on their conditional dependencies and independencies. We explored the use of Bayesian network analyses applied to the GAW20 data to identify possible causal relationships between differential methylation of cytosine-phosphate-guanine dinucleotides (CpGs), single-nucleotide polymorphisms (SNPs), and blood lipid trait (triglycerides [TGs]). METHODS: After initial exploratory linear regression analyses, 2 Bayesian networks analyses were performed. First, we used the real data and modeled the effects of 4 CpGs previously found to be associated with TGs in the Genetics of Lipid Lowering Drugs and Diet Network Study (GOLDN). Second, we used the simulated data and modeled the effect of a fictional lipid modifying drug with 5 known causal SNPs and 5 corresponding CpGs. RESULTS: In the real data we show that relationships are present between the CpGs, TGs, and other variables—age, sex, and center. In the simulated data, we show, using linear regression, that no CpGs and only 1 SNP were associated with a change in TG levels, and, using Bayesian network analysis, that relationships are present between the change in TG levels and most SNPs, but not with CpGs. CONCLUSIONS: Even when the causal relationships between variables are known, as with the simulated data, if the relationships are not strong then it is challenging to reproduce them in a Bayesian network. BioMed Central 2018-09-17 /pmc/articles/PMC6157285/ /pubmed/30275876 http://dx.doi.org/10.1186/s12919-018-0116-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Howey, Richard A. J.
Cordell, Heather J.
Application of Bayesian networks to GAW20 genetic and blood lipid data
title Application of Bayesian networks to GAW20 genetic and blood lipid data
title_full Application of Bayesian networks to GAW20 genetic and blood lipid data
title_fullStr Application of Bayesian networks to GAW20 genetic and blood lipid data
title_full_unstemmed Application of Bayesian networks to GAW20 genetic and blood lipid data
title_short Application of Bayesian networks to GAW20 genetic and blood lipid data
title_sort application of bayesian networks to gaw20 genetic and blood lipid data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157285/
https://www.ncbi.nlm.nih.gov/pubmed/30275876
http://dx.doi.org/10.1186/s12919-018-0116-y
work_keys_str_mv AT howeyrichardaj applicationofbayesiannetworkstogaw20geneticandbloodlipiddata
AT cordellheatherj applicationofbayesiannetworkstogaw20geneticandbloodlipiddata