Cargando…

Identification of genes and haplotypes that predict rheumatoid arthritis using random forests

Random forest (RF) analysis of genetic data does not require specification of the mode of inheritance, and provides measures of variable importance that incorporate interaction effects. In this paper we describe RF-based approaches for assessment of gene and haplotype importance, and apply these app...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Rui, Sinnwell, Jason P, Li, Jia, Rider, David N, de Andrade, Mariza, Biernacka, Joanna M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795969/
https://www.ncbi.nlm.nih.gov/pubmed/20018062
_version_ 1782175482100318208
author Tang, Rui
Sinnwell, Jason P
Li, Jia
Rider, David N
de Andrade, Mariza
Biernacka, Joanna M
author_facet Tang, Rui
Sinnwell, Jason P
Li, Jia
Rider, David N
de Andrade, Mariza
Biernacka, Joanna M
author_sort Tang, Rui
collection PubMed
description Random forest (RF) analysis of genetic data does not require specification of the mode of inheritance, and provides measures of variable importance that incorporate interaction effects. In this paper we describe RF-based approaches for assessment of gene and haplotype importance, and apply these approaches to a subset of the North American Rheumatoid Arthritis Consortium case-control data provided by Genetic Analysis Workshop 16. The RF analyses of 37 genes identified many of the same genes as logistic regression, but also suggested importance of certain single-nucleotide polymorphism and genes that were not ranked highly by logistic regression. A new permutation method did not reveal strong evidence of gene-gene interaction effects in these data. Although RFs are a promising approach for genetic data analysis, extensions beyond simple single-nucleotide polymorphism analyses and modifications to improve computational feasibility are needed.
format Text
id pubmed-2795969
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27959692009-12-18 Identification of genes and haplotypes that predict rheumatoid arthritis using random forests Tang, Rui Sinnwell, Jason P Li, Jia Rider, David N de Andrade, Mariza Biernacka, Joanna M BMC Proc Proceedings Random forest (RF) analysis of genetic data does not require specification of the mode of inheritance, and provides measures of variable importance that incorporate interaction effects. In this paper we describe RF-based approaches for assessment of gene and haplotype importance, and apply these approaches to a subset of the North American Rheumatoid Arthritis Consortium case-control data provided by Genetic Analysis Workshop 16. The RF analyses of 37 genes identified many of the same genes as logistic regression, but also suggested importance of certain single-nucleotide polymorphism and genes that were not ranked highly by logistic regression. A new permutation method did not reveal strong evidence of gene-gene interaction effects in these data. Although RFs are a promising approach for genetic data analysis, extensions beyond simple single-nucleotide polymorphism analyses and modifications to improve computational feasibility are needed. BioMed Central 2009-12-15 /pmc/articles/PMC2795969/ /pubmed/20018062 Text en Copyright ©2009 Tang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Tang, Rui
Sinnwell, Jason P
Li, Jia
Rider, David N
de Andrade, Mariza
Biernacka, Joanna M
Identification of genes and haplotypes that predict rheumatoid arthritis using random forests
title Identification of genes and haplotypes that predict rheumatoid arthritis using random forests
title_full Identification of genes and haplotypes that predict rheumatoid arthritis using random forests
title_fullStr Identification of genes and haplotypes that predict rheumatoid arthritis using random forests
title_full_unstemmed Identification of genes and haplotypes that predict rheumatoid arthritis using random forests
title_short Identification of genes and haplotypes that predict rheumatoid arthritis using random forests
title_sort identification of genes and haplotypes that predict rheumatoid arthritis using random forests
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795969/
https://www.ncbi.nlm.nih.gov/pubmed/20018062
work_keys_str_mv AT tangrui identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests
AT sinnwelljasonp identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests
AT lijia identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests
AT riderdavidn identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests
AT deandrademariza identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests
AT biernackajoannam identificationofgenesandhaplotypesthatpredictrheumatoidarthritisusingrandomforests