Cargando…

Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs

BACKGROUND: We explored the imputation performance of the program IMPUTE in an admixed sample from Mexico City. The following issues were evaluated: (a) the impact of different reference panels (HapMap vs. 1000 Genomes) on imputation; (b) potential differences in imputation performance between singl...

Descripción completa

Detalles Bibliográficos
Autores principales: Krithika, S, Valladares-Salgado, Adán, Peralta, Jesus, La Peña, Jorge Escobedo-de, Kumate-Rodríguez, Jesus, Cruz, Miguel, Parra, Esteban J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3436779/
https://www.ncbi.nlm.nih.gov/pubmed/22549150
http://dx.doi.org/10.1186/1755-8794-5-12
_version_ 1782242695266172928
author Krithika, S
Valladares-Salgado, Adán
Peralta, Jesus
La Peña, Jorge Escobedo-de
Kumate-Rodríguez, Jesus
Cruz, Miguel
Parra, Esteban J
author_facet Krithika, S
Valladares-Salgado, Adán
Peralta, Jesus
La Peña, Jorge Escobedo-de
Kumate-Rodríguez, Jesus
Cruz, Miguel
Parra, Esteban J
author_sort Krithika, S
collection PubMed
description BACKGROUND: We explored the imputation performance of the program IMPUTE in an admixed sample from Mexico City. The following issues were evaluated: (a) the impact of different reference panels (HapMap vs. 1000 Genomes) on imputation; (b) potential differences in imputation performance between single-step vs. two-step (phasing and imputation) approaches; (c) the effect of different INFO score thresholds on imputation performance and (d) imputation performance in common vs. rare markers. METHODS: The sample from Mexico City comprised 1,310 individuals genotyped with the Affymetrix 5.0 array. We randomly masked 5% of the markers directly genotyped on chromosome 12 (n = 1,046) and compared the imputed genotypes with the microarray genotype calls. Imputation was carried out with the program IMPUTE. The concordance rates between the imputed and observed genotypes were used as a measure of imputation accuracy and the proportion of non-missing genotypes as a measure of imputation efficacy. RESULTS: The single-step imputation approach produced slightly higher concordance rates than the two-step strategy (99.1% vs. 98.4% when using the HapMap phase II combined panel), but at the expense of a lower proportion of non-missing genotypes (85.5% vs. 90.1%). The 1,000 Genomes reference sample produced similar concordance rates to the HapMap phase II panel (98.4% for both datasets, using the two-step strategy). However, the 1000 Genomes reference sample increased substantially the proportion of non-missing genotypes (94.7% vs. 90.1%). Rare variants (<1%) had lower imputation accuracy and efficacy than common markers. CONCLUSIONS: The program IMPUTE had an excellent imputation performance for common alleles in an admixed sample from Mexico City, which has primarily Native American (62%) and European (33%) contributions. Genotype concordances were higher than 98.4% using all the imputation strategies, in spite of the fact that no Native American samples are present in the HapMap and 1000 Genomes reference panels. The best balance of imputation accuracy and efficiency was obtained with the 1,000 Genomes panel. Rare variants were not captured effectively by any of the available panels, emphasizing the need to be cautious in the interpretation of association results for imputed rare variants.
format Online
Article
Text
id pubmed-3436779
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34367792012-09-08 Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs Krithika, S Valladares-Salgado, Adán Peralta, Jesus La Peña, Jorge Escobedo-de Kumate-Rodríguez, Jesus Cruz, Miguel Parra, Esteban J BMC Med Genomics Research Article BACKGROUND: We explored the imputation performance of the program IMPUTE in an admixed sample from Mexico City. The following issues were evaluated: (a) the impact of different reference panels (HapMap vs. 1000 Genomes) on imputation; (b) potential differences in imputation performance between single-step vs. two-step (phasing and imputation) approaches; (c) the effect of different INFO score thresholds on imputation performance and (d) imputation performance in common vs. rare markers. METHODS: The sample from Mexico City comprised 1,310 individuals genotyped with the Affymetrix 5.0 array. We randomly masked 5% of the markers directly genotyped on chromosome 12 (n = 1,046) and compared the imputed genotypes with the microarray genotype calls. Imputation was carried out with the program IMPUTE. The concordance rates between the imputed and observed genotypes were used as a measure of imputation accuracy and the proportion of non-missing genotypes as a measure of imputation efficacy. RESULTS: The single-step imputation approach produced slightly higher concordance rates than the two-step strategy (99.1% vs. 98.4% when using the HapMap phase II combined panel), but at the expense of a lower proportion of non-missing genotypes (85.5% vs. 90.1%). The 1,000 Genomes reference sample produced similar concordance rates to the HapMap phase II panel (98.4% for both datasets, using the two-step strategy). However, the 1000 Genomes reference sample increased substantially the proportion of non-missing genotypes (94.7% vs. 90.1%). Rare variants (<1%) had lower imputation accuracy and efficacy than common markers. CONCLUSIONS: The program IMPUTE had an excellent imputation performance for common alleles in an admixed sample from Mexico City, which has primarily Native American (62%) and European (33%) contributions. Genotype concordances were higher than 98.4% using all the imputation strategies, in spite of the fact that no Native American samples are present in the HapMap and 1000 Genomes reference panels. The best balance of imputation accuracy and efficiency was obtained with the 1,000 Genomes panel. Rare variants were not captured effectively by any of the available panels, emphasizing the need to be cautious in the interpretation of association results for imputed rare variants. BioMed Central 2012-05-01 /pmc/articles/PMC3436779/ /pubmed/22549150 http://dx.doi.org/10.1186/1755-8794-5-12 Text en Copyright ©2012 Krithika et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Krithika, S
Valladares-Salgado, Adán
Peralta, Jesus
La Peña, Jorge Escobedo-de
Kumate-Rodríguez, Jesus
Cruz, Miguel
Parra, Esteban J
Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs
title Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs
title_full Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs
title_fullStr Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs
title_full_unstemmed Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs
title_short Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs
title_sort evaluation of the imputation performance of the program impute in an admixed sample from mexico city using several model designs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3436779/
https://www.ncbi.nlm.nih.gov/pubmed/22549150
http://dx.doi.org/10.1186/1755-8794-5-12
work_keys_str_mv AT krithikas evaluationoftheimputationperformanceoftheprogramimputeinanadmixedsamplefrommexicocityusingseveralmodeldesigns
AT valladaressalgadoadan evaluationoftheimputationperformanceoftheprogramimputeinanadmixedsamplefrommexicocityusingseveralmodeldesigns
AT peraltajesus evaluationoftheimputationperformanceoftheprogramimputeinanadmixedsamplefrommexicocityusingseveralmodeldesigns
AT lapenajorgeescobedode evaluationoftheimputationperformanceoftheprogramimputeinanadmixedsamplefrommexicocityusingseveralmodeldesigns
AT kumaterodriguezjesus evaluationoftheimputationperformanceoftheprogramimputeinanadmixedsamplefrommexicocityusingseveralmodeldesigns
AT cruzmiguel evaluationoftheimputationperformanceoftheprogramimputeinanadmixedsamplefrommexicocityusingseveralmodeldesigns
AT parraestebanj evaluationoftheimputationperformanceoftheprogramimputeinanadmixedsamplefrommexicocityusingseveralmodeldesigns