Cargando…

Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects

Random forests (RF) is one of a broad class of machine learning methods that are able to deal with large-scale data without model specification, which makes it an attractive method for genome-wide association studies (GWAS). The performance of RF and other association methods in the presence of inte...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Yoonhee, Wojciechowski, Robert, Sung, Heejong, Mathias, Rasika A, Wang, Li, Klein, Alison P, Lenroot, Rhoshel K, Malley, James, Bailey-Wilson, Joan E
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795965/
https://www.ncbi.nlm.nih.gov/pubmed/20018058
_version_ 1782175481173377024
author Kim, Yoonhee
Wojciechowski, Robert
Sung, Heejong
Mathias, Rasika A
Wang, Li
Klein, Alison P
Lenroot, Rhoshel K
Malley, James
Bailey-Wilson, Joan E
author_facet Kim, Yoonhee
Wojciechowski, Robert
Sung, Heejong
Mathias, Rasika A
Wang, Li
Klein, Alison P
Lenroot, Rhoshel K
Malley, James
Bailey-Wilson, Joan E
author_sort Kim, Yoonhee
collection PubMed
description Random forests (RF) is one of a broad class of machine learning methods that are able to deal with large-scale data without model specification, which makes it an attractive method for genome-wide association studies (GWAS). The performance of RF and other association methods in the presence of interactions was evaluated using the simulated data from Genetic Analysis Workshop 16 Problem 3, with knowledge of the major causative markers, risk factors, and their interactions in the simulated traits. There was good power to detect the environmental risk factors using RF, trend tests, or regression analyses but the power to detect the effects of the causal markers was poor for all methods. The causal marker that had an interactive effect with smoking did show moderate evidence of association in the RF and regression analyses, suggesting that RF may perform well at detecting such interactions in larger, more highly powered datasets.
format Text
id pubmed-2795965
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27959652009-12-18 Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects Kim, Yoonhee Wojciechowski, Robert Sung, Heejong Mathias, Rasika A Wang, Li Klein, Alison P Lenroot, Rhoshel K Malley, James Bailey-Wilson, Joan E BMC Proc Proceedings Random forests (RF) is one of a broad class of machine learning methods that are able to deal with large-scale data without model specification, which makes it an attractive method for genome-wide association studies (GWAS). The performance of RF and other association methods in the presence of interactions was evaluated using the simulated data from Genetic Analysis Workshop 16 Problem 3, with knowledge of the major causative markers, risk factors, and their interactions in the simulated traits. There was good power to detect the environmental risk factors using RF, trend tests, or regression analyses but the power to detect the effects of the causal markers was poor for all methods. The causal marker that had an interactive effect with smoking did show moderate evidence of association in the RF and regression analyses, suggesting that RF may perform well at detecting such interactions in larger, more highly powered datasets. BioMed Central 2009-12-15 /pmc/articles/PMC2795965/ /pubmed/20018058 Text en Copyright ©2009 Kim et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Kim, Yoonhee
Wojciechowski, Robert
Sung, Heejong
Mathias, Rasika A
Wang, Li
Klein, Alison P
Lenroot, Rhoshel K
Malley, James
Bailey-Wilson, Joan E
Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects
title Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects
title_full Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects
title_fullStr Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects
title_full_unstemmed Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects
title_short Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects
title_sort evaluation of random forests performance for genome-wide association studies in the presence of interaction effects
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795965/
https://www.ncbi.nlm.nih.gov/pubmed/20018058
work_keys_str_mv AT kimyoonhee evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects
AT wojciechowskirobert evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects
AT sungheejong evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects
AT mathiasrasikaa evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects
AT wangli evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects
AT kleinalisonp evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects
AT lenrootrhoshelk evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects
AT malleyjames evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects
AT baileywilsonjoane evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects