Cargando…
Identification of genes and haplotypes that predict rheumatoid arthritis using random forests
Random forest (RF) analysis of genetic data does not require specification of the mode of inheritance, and provides measures of variable importance that incorporate interaction effects. In this paper we describe RF-based approaches for assessment of gene and haplotype importance, and apply these app...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795969/ https://www.ncbi.nlm.nih.gov/pubmed/20018062 |
_version_ | 1782175482100318208 |
---|---|
author | Tang, Rui Sinnwell, Jason P Li, Jia Rider, David N de Andrade, Mariza Biernacka, Joanna M |
author_facet | Tang, Rui Sinnwell, Jason P Li, Jia Rider, David N de Andrade, Mariza Biernacka, Joanna M |
author_sort | Tang, Rui |
collection | PubMed |
description | Random forest (RF) analysis of genetic data does not require specification of the mode of inheritance, and provides measures of variable importance that incorporate interaction effects. In this paper we describe RF-based approaches for assessment of gene and haplotype importance, and apply these approaches to a subset of the North American Rheumatoid Arthritis Consortium case-control data provided by Genetic Analysis Workshop 16. The RF analyses of 37 genes identified many of the same genes as logistic regression, but also suggested importance of certain single-nucleotide polymorphism and genes that were not ranked highly by logistic regression. A new permutation method did not reveal strong evidence of gene-gene interaction effects in these data. Although RFs are a promising approach for genetic data analysis, extensions beyond simple single-nucleotide polymorphism analyses and modifications to improve computational feasibility are needed. |
format | Text |
id | pubmed-2795969 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27959692009-12-18 Identification of genes and haplotypes that predict rheumatoid arthritis using random forests Tang, Rui Sinnwell, Jason P Li, Jia Rider, David N de Andrade, Mariza Biernacka, Joanna M BMC Proc Proceedings Random forest (RF) analysis of genetic data does not require specification of the mode of inheritance, and provides measures of variable importance that incorporate interaction effects. In this paper we describe RF-based approaches for assessment of gene and haplotype importance, and apply these approaches to a subset of the North American Rheumatoid Arthritis Consortium case-control data provided by Genetic Analysis Workshop 16. The RF analyses of 37 genes identified many of the same genes as logistic regression, but also suggested importance of certain single-nucleotide polymorphism and genes that were not ranked highly by logistic regression. A new permutation method did not reveal strong evidence of gene-gene interaction effects in these data. Although RFs are a promising approach for genetic data analysis, extensions beyond simple single-nucleotide polymorphism analyses and modifications to improve computational feasibility are needed. BioMed Central 2009-12-15 /pmc/articles/PMC2795969/ /pubmed/20018062 Text en Copyright ©2009 Tang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Tang, Rui Sinnwell, Jason P Li, Jia Rider, David N de Andrade, Mariza Biernacka, Joanna M Identification of genes and haplotypes that predict rheumatoid arthritis using random forests |
title | Identification of genes and haplotypes that predict rheumatoid arthritis using random forests |
title_full | Identification of genes and haplotypes that predict rheumatoid arthritis using random forests |
title_fullStr | Identification of genes and haplotypes that predict rheumatoid arthritis using random forests |
title_full_unstemmed | Identification of genes and haplotypes that predict rheumatoid arthritis using random forests |
title_short | Identification of genes and haplotypes that predict rheumatoid arthritis using random forests |
title_sort | identification of genes and haplotypes that predict rheumatoid arthritis using random forests |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795969/ https://www.ncbi.nlm.nih.gov/pubmed/20018062 |
work_keys_str_mv | AT tangrui identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests AT sinnwelljasonp identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests AT lijia identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests AT riderdavidn identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests AT deandrademariza identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests AT biernackajoannam identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests |