Cargando…

Evaluation of single-nucleotide polymorphism imputation using random forests

Genome-wide association studies (GWAS) have helped to reveal genetic mechanisms of complex diseases. Although commonly used genotyping technology enables us to determine up to a million single-nucleotide polymorphisms (SNPs), causative variants are typically not genotyped directly. A favored approac...

Descripción completa

Detalles Bibliográficos
Autores principales: Schwarz, Daniel F, Szymczak, Silke, Ziegler, Andreas, König, Inke R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795966/
https://www.ncbi.nlm.nih.gov/pubmed/20018059
_version_ 1782175481414549504
author Schwarz, Daniel F
Szymczak, Silke
Ziegler, Andreas
König, Inke R
author_facet Schwarz, Daniel F
Szymczak, Silke
Ziegler, Andreas
König, Inke R
author_sort Schwarz, Daniel F
collection PubMed
description Genome-wide association studies (GWAS) have helped to reveal genetic mechanisms of complex diseases. Although commonly used genotyping technology enables us to determine up to a million single-nucleotide polymorphisms (SNPs), causative variants are typically not genotyped directly. A favored approach to increase the power of genome-wide association studies is to impute the untyped SNPs using more complete genotype data of a reference population. Random forests (RF) provides an internal method for replacing missing genotypes. A forest of classification trees is used to determine similarities of probands regarding their genotypes. These proximities are then used to impute genotypes of untyped SNPs. We evaluated this approach using genotype data of the Framingham Heart Study provided as Problem 2 for Genetic Analysis Workshop 16 and the Caucasian HapMap samples as reference population. Our results indicate that RFs are faster but less accurate than alternative approaches for imputing untyped SNPs.
format Text
id pubmed-2795966
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27959662009-12-18 Evaluation of single-nucleotide polymorphism imputation using random forests Schwarz, Daniel F Szymczak, Silke Ziegler, Andreas König, Inke R BMC Proc Proceedings Genome-wide association studies (GWAS) have helped to reveal genetic mechanisms of complex diseases. Although commonly used genotyping technology enables us to determine up to a million single-nucleotide polymorphisms (SNPs), causative variants are typically not genotyped directly. A favored approach to increase the power of genome-wide association studies is to impute the untyped SNPs using more complete genotype data of a reference population. Random forests (RF) provides an internal method for replacing missing genotypes. A forest of classification trees is used to determine similarities of probands regarding their genotypes. These proximities are then used to impute genotypes of untyped SNPs. We evaluated this approach using genotype data of the Framingham Heart Study provided as Problem 2 for Genetic Analysis Workshop 16 and the Caucasian HapMap samples as reference population. Our results indicate that RFs are faster but less accurate than alternative approaches for imputing untyped SNPs. BioMed Central 2009-12-15 /pmc/articles/PMC2795966/ /pubmed/20018059 Text en Copyright ©2009 Schwarz et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Schwarz, Daniel F
Szymczak, Silke
Ziegler, Andreas
König, Inke R
Evaluation of single-nucleotide polymorphism imputation using random forests
title Evaluation of single-nucleotide polymorphism imputation using random forests
title_full Evaluation of single-nucleotide polymorphism imputation using random forests
title_fullStr Evaluation of single-nucleotide polymorphism imputation using random forests
title_full_unstemmed Evaluation of single-nucleotide polymorphism imputation using random forests
title_short Evaluation of single-nucleotide polymorphism imputation using random forests
title_sort evaluation of single-nucleotide polymorphism imputation using random forests
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795966/
https://www.ncbi.nlm.nih.gov/pubmed/20018059
work_keys_str_mv AT schwarzdanielf evaluationofsinglenucleotidepolymorphismimputationusingrandomforests
AT szymczaksilke evaluationofsinglenucleotidepolymorphismimputationusingrandomforests
AT zieglerandreas evaluationofsinglenucleotidepolymorphismimputationusingrandomforests
AT koniginker evaluationofsinglenucleotidepolymorphismimputationusingrandomforests