Cargando…

Assessing Accuracy of Genotype Imputation in American Indians

BACKGROUND: Genotype imputation is commonly used in genetic association studies to test untyped variants using information on linkage disequilibrium (LD) with typed markers. Imputing genotypes requires a suitable reference population in which the LD pattern is known, most often one selected from Hap...

Descripción completa

Detalles Bibliográficos
Autores principales: Malhotra, Alka, Kobes, Sayuko, Bogardus, Clifton, Knowler, William C., Baier, Leslie J., Hanson, Robert L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4094523/
https://www.ncbi.nlm.nih.gov/pubmed/25014012
http://dx.doi.org/10.1371/journal.pone.0102544
_version_ 1782325845576122368
author Malhotra, Alka
Kobes, Sayuko
Bogardus, Clifton
Knowler, William C.
Baier, Leslie J.
Hanson, Robert L.
author_facet Malhotra, Alka
Kobes, Sayuko
Bogardus, Clifton
Knowler, William C.
Baier, Leslie J.
Hanson, Robert L.
author_sort Malhotra, Alka
collection PubMed
description BACKGROUND: Genotype imputation is commonly used in genetic association studies to test untyped variants using information on linkage disequilibrium (LD) with typed markers. Imputing genotypes requires a suitable reference population in which the LD pattern is known, most often one selected from HapMap. However, some populations, such as American Indians, are not represented in HapMap. In the present study, we assessed accuracy of imputation using HapMap reference populations in a genome-wide association study in Pima Indians. RESULTS: Data from six randomly selected chromosomes were used. Genotypes in the study population were masked (either 1% or 20% of SNPs available for a given chromosome). The masked genotypes were then imputed using the software Markov Chain Haplotyping Algorithm. Using four HapMap reference populations, average genotype error rates ranged from 7.86% for Mexican Americans to 22.30% for Yoruba. In contrast, use of the original Pima Indian data as a reference resulted in an average error rate of 1.73%. CONCLUSIONS: Our results suggest that the use of HapMap reference populations results in substantial inaccuracy in the imputation of genotypes in American Indians. A possible solution would be to densely genotype or sequence a reference American Indian population.
format Online
Article
Text
id pubmed-4094523
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40945232014-07-15 Assessing Accuracy of Genotype Imputation in American Indians Malhotra, Alka Kobes, Sayuko Bogardus, Clifton Knowler, William C. Baier, Leslie J. Hanson, Robert L. PLoS One Research Article BACKGROUND: Genotype imputation is commonly used in genetic association studies to test untyped variants using information on linkage disequilibrium (LD) with typed markers. Imputing genotypes requires a suitable reference population in which the LD pattern is known, most often one selected from HapMap. However, some populations, such as American Indians, are not represented in HapMap. In the present study, we assessed accuracy of imputation using HapMap reference populations in a genome-wide association study in Pima Indians. RESULTS: Data from six randomly selected chromosomes were used. Genotypes in the study population were masked (either 1% or 20% of SNPs available for a given chromosome). The masked genotypes were then imputed using the software Markov Chain Haplotyping Algorithm. Using four HapMap reference populations, average genotype error rates ranged from 7.86% for Mexican Americans to 22.30% for Yoruba. In contrast, use of the original Pima Indian data as a reference resulted in an average error rate of 1.73%. CONCLUSIONS: Our results suggest that the use of HapMap reference populations results in substantial inaccuracy in the imputation of genotypes in American Indians. A possible solution would be to densely genotype or sequence a reference American Indian population. Public Library of Science 2014-07-11 /pmc/articles/PMC4094523/ /pubmed/25014012 http://dx.doi.org/10.1371/journal.pone.0102544 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Malhotra, Alka
Kobes, Sayuko
Bogardus, Clifton
Knowler, William C.
Baier, Leslie J.
Hanson, Robert L.
Assessing Accuracy of Genotype Imputation in American Indians
title Assessing Accuracy of Genotype Imputation in American Indians
title_full Assessing Accuracy of Genotype Imputation in American Indians
title_fullStr Assessing Accuracy of Genotype Imputation in American Indians
title_full_unstemmed Assessing Accuracy of Genotype Imputation in American Indians
title_short Assessing Accuracy of Genotype Imputation in American Indians
title_sort assessing accuracy of genotype imputation in american indians
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4094523/
https://www.ncbi.nlm.nih.gov/pubmed/25014012
http://dx.doi.org/10.1371/journal.pone.0102544
work_keys_str_mv AT malhotraalka assessingaccuracyofgenotypeimputationinamericanindians
AT kobessayuko assessingaccuracyofgenotypeimputationinamericanindians
AT bogardusclifton assessingaccuracyofgenotypeimputationinamericanindians
AT knowlerwilliamc assessingaccuracyofgenotypeimputationinamericanindians
AT baierlesliej assessingaccuracyofgenotypeimputationinamericanindians
AT hansonrobertl assessingaccuracyofgenotypeimputationinamericanindians