Cargando…

Detecting gene-by-smoking interactions in a genome-wide association study of early-onset coronary heart disease using random forests

BACKGROUND: Genome-wide association studies are often limited in their ability to attain their full potential due to the sheer volume of information created. We sought to use the random forest algorithm to identify single-nucleotide polymorphisms (SNPs) that may be involved in gene-by-smoking intera...

Descripción completa

Detalles Bibliográficos
Autores principales: Maenner, Matthew J, Denlinger, Loren C, Langton, Asher, Meyers, Kristin J, Engelman, Corinne D, Skinner, Halcyon G
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795991/
https://www.ncbi.nlm.nih.gov/pubmed/20018084
_version_ 1782175487290769408
author Maenner, Matthew J
Denlinger, Loren C
Langton, Asher
Meyers, Kristin J
Engelman, Corinne D
Skinner, Halcyon G
author_facet Maenner, Matthew J
Denlinger, Loren C
Langton, Asher
Meyers, Kristin J
Engelman, Corinne D
Skinner, Halcyon G
author_sort Maenner, Matthew J
collection PubMed
description BACKGROUND: Genome-wide association studies are often limited in their ability to attain their full potential due to the sheer volume of information created. We sought to use the random forest algorithm to identify single-nucleotide polymorphisms (SNPs) that may be involved in gene-by-smoking interactions related to the early-onset of coronary heart disease. METHODS: Using data from the Framingham Heart Study, our analysis used a case-only design in which the outcome of interest was age of onset of early coronary heart disease. RESULTS: Smoking status was dichotomized as ever versus never. The single SNP with the highest importance score assigned by random forests was rs2011345. This SNP was not associated with age alone in the control subjects. Using generalized estimating equations to adjust for sex and account for familial correlation, there was evidence of an interaction between rs2011345 and smoking status. CONCLUSION: The results of this analysis suggest that random forests may be a useful tool for identifying SNPs taking part in gene-by-environment interactions in genome-wide association studies.
format Text
id pubmed-2795991
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27959912009-12-18 Detecting gene-by-smoking interactions in a genome-wide association study of early-onset coronary heart disease using random forests Maenner, Matthew J Denlinger, Loren C Langton, Asher Meyers, Kristin J Engelman, Corinne D Skinner, Halcyon G BMC Proc Proceedings BACKGROUND: Genome-wide association studies are often limited in their ability to attain their full potential due to the sheer volume of information created. We sought to use the random forest algorithm to identify single-nucleotide polymorphisms (SNPs) that may be involved in gene-by-smoking interactions related to the early-onset of coronary heart disease. METHODS: Using data from the Framingham Heart Study, our analysis used a case-only design in which the outcome of interest was age of onset of early coronary heart disease. RESULTS: Smoking status was dichotomized as ever versus never. The single SNP with the highest importance score assigned by random forests was rs2011345. This SNP was not associated with age alone in the control subjects. Using generalized estimating equations to adjust for sex and account for familial correlation, there was evidence of an interaction between rs2011345 and smoking status. CONCLUSION: The results of this analysis suggest that random forests may be a useful tool for identifying SNPs taking part in gene-by-environment interactions in genome-wide association studies. BioMed Central 2009-12-15 /pmc/articles/PMC2795991/ /pubmed/20018084 Text en Copyright ©2009 Maenner et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Maenner, Matthew J
Denlinger, Loren C
Langton, Asher
Meyers, Kristin J
Engelman, Corinne D
Skinner, Halcyon G
Detecting gene-by-smoking interactions in a genome-wide association study of early-onset coronary heart disease using random forests
title Detecting gene-by-smoking interactions in a genome-wide association study of early-onset coronary heart disease using random forests
title_full Detecting gene-by-smoking interactions in a genome-wide association study of early-onset coronary heart disease using random forests
title_fullStr Detecting gene-by-smoking interactions in a genome-wide association study of early-onset coronary heart disease using random forests
title_full_unstemmed Detecting gene-by-smoking interactions in a genome-wide association study of early-onset coronary heart disease using random forests
title_short Detecting gene-by-smoking interactions in a genome-wide association study of early-onset coronary heart disease using random forests
title_sort detecting gene-by-smoking interactions in a genome-wide association study of early-onset coronary heart disease using random forests
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795991/
https://www.ncbi.nlm.nih.gov/pubmed/20018084
work_keys_str_mv AT maennermatthewj detectinggenebysmokinginteractionsinagenomewideassociationstudyofearlyonsetcoronaryheartdiseaseusingrandomforests
AT denlingerlorenc detectinggenebysmokinginteractionsinagenomewideassociationstudyofearlyonsetcoronaryheartdiseaseusingrandomforests
AT langtonasher detectinggenebysmokinginteractionsinagenomewideassociationstudyofearlyonsetcoronaryheartdiseaseusingrandomforests
AT meyerskristinj detectinggenebysmokinginteractionsinagenomewideassociationstudyofearlyonsetcoronaryheartdiseaseusingrandomforests
AT engelmancorinned detectinggenebysmokinginteractionsinagenomewideassociationstudyofearlyonsetcoronaryheartdiseaseusingrandomforests
AT skinnerhalcyong detectinggenebysmokinginteractionsinagenomewideassociationstudyofearlyonsetcoronaryheartdiseaseusingrandomforests