Cargando…
Evaluation of single-nucleotide polymorphism imputation using random forests
Genome-wide association studies (GWAS) have helped to reveal genetic mechanisms of complex diseases. Although commonly used genotyping technology enables us to determine up to a million single-nucleotide polymorphisms (SNPs), causative variants are typically not genotyped directly. A favored approac...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795966/ https://www.ncbi.nlm.nih.gov/pubmed/20018059 |
_version_ | 1782175481414549504 |
---|---|
author | Schwarz, Daniel F Szymczak, Silke Ziegler, Andreas König, Inke R |
author_facet | Schwarz, Daniel F Szymczak, Silke Ziegler, Andreas König, Inke R |
author_sort | Schwarz, Daniel F |
collection | PubMed |
description | Genome-wide association studies (GWAS) have helped to reveal genetic mechanisms of complex diseases. Although commonly used genotyping technology enables us to determine up to a million single-nucleotide polymorphisms (SNPs), causative variants are typically not genotyped directly. A favored approach to increase the power of genome-wide association studies is to impute the untyped SNPs using more complete genotype data of a reference population. Random forests (RF) provides an internal method for replacing missing genotypes. A forest of classification trees is used to determine similarities of probands regarding their genotypes. These proximities are then used to impute genotypes of untyped SNPs. We evaluated this approach using genotype data of the Framingham Heart Study provided as Problem 2 for Genetic Analysis Workshop 16 and the Caucasian HapMap samples as reference population. Our results indicate that RFs are faster but less accurate than alternative approaches for imputing untyped SNPs. |
format | Text |
id | pubmed-2795966 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27959662009-12-18 Evaluation of single-nucleotide polymorphism imputation using random forests Schwarz, Daniel F Szymczak, Silke Ziegler, Andreas König, Inke R BMC Proc Proceedings Genome-wide association studies (GWAS) have helped to reveal genetic mechanisms of complex diseases. Although commonly used genotyping technology enables us to determine up to a million single-nucleotide polymorphisms (SNPs), causative variants are typically not genotyped directly. A favored approach to increase the power of genome-wide association studies is to impute the untyped SNPs using more complete genotype data of a reference population. Random forests (RF) provides an internal method for replacing missing genotypes. A forest of classification trees is used to determine similarities of probands regarding their genotypes. These proximities are then used to impute genotypes of untyped SNPs. We evaluated this approach using genotype data of the Framingham Heart Study provided as Problem 2 for Genetic Analysis Workshop 16 and the Caucasian HapMap samples as reference population. Our results indicate that RFs are faster but less accurate than alternative approaches for imputing untyped SNPs. BioMed Central 2009-12-15 /pmc/articles/PMC2795966/ /pubmed/20018059 Text en Copyright ©2009 Schwarz et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Schwarz, Daniel F Szymczak, Silke Ziegler, Andreas König, Inke R Evaluation of single-nucleotide polymorphism imputation using random forests |
title | Evaluation of single-nucleotide polymorphism imputation using random forests |
title_full | Evaluation of single-nucleotide polymorphism imputation using random forests |
title_fullStr | Evaluation of single-nucleotide polymorphism imputation using random forests |
title_full_unstemmed | Evaluation of single-nucleotide polymorphism imputation using random forests |
title_short | Evaluation of single-nucleotide polymorphism imputation using random forests |
title_sort | evaluation of single-nucleotide polymorphism imputation using random forests |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795966/ https://www.ncbi.nlm.nih.gov/pubmed/20018059 |
work_keys_str_mv | AT schwarzdanielf evaluationofsinglenucleotidepolymorphismimputationusingrandomforests AT szymczaksilke evaluationofsinglenucleotidepolymorphismimputationusingrandomforests AT zieglerandreas evaluationofsinglenucleotidepolymorphismimputationusingrandomforests AT koniginker evaluationofsinglenucleotidepolymorphismimputationusingrandomforests |