Cargando…
Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects
Random forests (RF) is one of a broad class of machine learning methods that are able to deal with large-scale data without model specification, which makes it an attractive method for genome-wide association studies (GWAS). The performance of RF and other association methods in the presence of inte...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795965/ https://www.ncbi.nlm.nih.gov/pubmed/20018058 |
_version_ | 1782175481173377024 |
---|---|
author | Kim, Yoonhee Wojciechowski, Robert Sung, Heejong Mathias, Rasika A Wang, Li Klein, Alison P Lenroot, Rhoshel K Malley, James Bailey-Wilson, Joan E |
author_facet | Kim, Yoonhee Wojciechowski, Robert Sung, Heejong Mathias, Rasika A Wang, Li Klein, Alison P Lenroot, Rhoshel K Malley, James Bailey-Wilson, Joan E |
author_sort | Kim, Yoonhee |
collection | PubMed |
description | Random forests (RF) is one of a broad class of machine learning methods that are able to deal with large-scale data without model specification, which makes it an attractive method for genome-wide association studies (GWAS). The performance of RF and other association methods in the presence of interactions was evaluated using the simulated data from Genetic Analysis Workshop 16 Problem 3, with knowledge of the major causative markers, risk factors, and their interactions in the simulated traits. There was good power to detect the environmental risk factors using RF, trend tests, or regression analyses but the power to detect the effects of the causal markers was poor for all methods. The causal marker that had an interactive effect with smoking did show moderate evidence of association in the RF and regression analyses, suggesting that RF may perform well at detecting such interactions in larger, more highly powered datasets. |
format | Text |
id | pubmed-2795965 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27959652009-12-18 Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects Kim, Yoonhee Wojciechowski, Robert Sung, Heejong Mathias, Rasika A Wang, Li Klein, Alison P Lenroot, Rhoshel K Malley, James Bailey-Wilson, Joan E BMC Proc Proceedings Random forests (RF) is one of a broad class of machine learning methods that are able to deal with large-scale data without model specification, which makes it an attractive method for genome-wide association studies (GWAS). The performance of RF and other association methods in the presence of interactions was evaluated using the simulated data from Genetic Analysis Workshop 16 Problem 3, with knowledge of the major causative markers, risk factors, and their interactions in the simulated traits. There was good power to detect the environmental risk factors using RF, trend tests, or regression analyses but the power to detect the effects of the causal markers was poor for all methods. The causal marker that had an interactive effect with smoking did show moderate evidence of association in the RF and regression analyses, suggesting that RF may perform well at detecting such interactions in larger, more highly powered datasets. BioMed Central 2009-12-15 /pmc/articles/PMC2795965/ /pubmed/20018058 Text en Copyright ©2009 Kim et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Kim, Yoonhee Wojciechowski, Robert Sung, Heejong Mathias, Rasika A Wang, Li Klein, Alison P Lenroot, Rhoshel K Malley, James Bailey-Wilson, Joan E Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects |
title | Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects |
title_full | Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects |
title_fullStr | Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects |
title_full_unstemmed | Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects |
title_short | Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects |
title_sort | evaluation of random forests performance for genome-wide association studies in the presence of interaction effects |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795965/ https://www.ncbi.nlm.nih.gov/pubmed/20018058 |
work_keys_str_mv | AT kimyoonhee evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects AT wojciechowskirobert evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects AT sungheejong evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects AT mathiasrasikaa evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects AT wangli evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects AT kleinalisonp evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects AT lenrootrhoshelk evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects AT malleyjames evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects AT baileywilsonjoane evaluationofrandomforestsperformanceforgenomewideassociationstudiesinthepresenceofinteractioneffects |