Cargando…

Single versus multiple imputation for genotypic data

Due to the growing need to combine data across multiple studies and to impute untyped markers based on a reference sample, several analytical tools for imputation and analysis of missing genotypes have been developed. Current imputation methods rely on single imputation, which ignores the variation...

Descripción completa

Detalles Bibliográficos
Autores principales: Fridley, Brooke L, McDonnell, Shannon K, Rabe, Kari G, Tang, Rui, Biernacka, Joanna M, Sinnwell, Jason P, Rider, David N, Goode, Ellen L
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795971/
https://www.ncbi.nlm.nih.gov/pubmed/20018064
_version_ 1782175482590003200
author Fridley, Brooke L
McDonnell, Shannon K
Rabe, Kari G
Tang, Rui
Biernacka, Joanna M
Sinnwell, Jason P
Rider, David N
Goode, Ellen L
author_facet Fridley, Brooke L
McDonnell, Shannon K
Rabe, Kari G
Tang, Rui
Biernacka, Joanna M
Sinnwell, Jason P
Rider, David N
Goode, Ellen L
author_sort Fridley, Brooke L
collection PubMed
description Due to the growing need to combine data across multiple studies and to impute untyped markers based on a reference sample, several analytical tools for imputation and analysis of missing genotypes have been developed. Current imputation methods rely on single imputation, which ignores the variation in estimation due to imputation. An alternative to single imputation is multiple imputation. In this paper, we assess the variation in imputation by completing both single and multiple imputations of genotypic data using MACH, a commonly used hidden Markov model imputation method. Using data from the North American Rheumatoid Arthritis Consortium genome-wide study, the use of single and multiple imputation was assessed in four regions of chromosome 1 with varying levels of linkage disequilibrium and association signals. Two scenarios for missing genotypic data were assessed: imputation of untyped markers and combination of genotypic data from two studies. This limited study involving four regions indicates that, contrary to expectations, multiple imputations may not be necessary.
format Text
id pubmed-2795971
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27959712009-12-18 Single versus multiple imputation for genotypic data Fridley, Brooke L McDonnell, Shannon K Rabe, Kari G Tang, Rui Biernacka, Joanna M Sinnwell, Jason P Rider, David N Goode, Ellen L BMC Proc Proceedings Due to the growing need to combine data across multiple studies and to impute untyped markers based on a reference sample, several analytical tools for imputation and analysis of missing genotypes have been developed. Current imputation methods rely on single imputation, which ignores the variation in estimation due to imputation. An alternative to single imputation is multiple imputation. In this paper, we assess the variation in imputation by completing both single and multiple imputations of genotypic data using MACH, a commonly used hidden Markov model imputation method. Using data from the North American Rheumatoid Arthritis Consortium genome-wide study, the use of single and multiple imputation was assessed in four regions of chromosome 1 with varying levels of linkage disequilibrium and association signals. Two scenarios for missing genotypic data were assessed: imputation of untyped markers and combination of genotypic data from two studies. This limited study involving four regions indicates that, contrary to expectations, multiple imputations may not be necessary. BioMed Central 2009-12-15 /pmc/articles/PMC2795971/ /pubmed/20018064 Text en Copyright ©2009 Fridley et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Fridley, Brooke L
McDonnell, Shannon K
Rabe, Kari G
Tang, Rui
Biernacka, Joanna M
Sinnwell, Jason P
Rider, David N
Goode, Ellen L
Single versus multiple imputation for genotypic data
title Single versus multiple imputation for genotypic data
title_full Single versus multiple imputation for genotypic data
title_fullStr Single versus multiple imputation for genotypic data
title_full_unstemmed Single versus multiple imputation for genotypic data
title_short Single versus multiple imputation for genotypic data
title_sort single versus multiple imputation for genotypic data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795971/
https://www.ncbi.nlm.nih.gov/pubmed/20018064
work_keys_str_mv AT fridleybrookel singleversusmultipleimputationforgenotypicdata
AT mcdonnellshannonk singleversusmultipleimputationforgenotypicdata
AT rabekarig singleversusmultipleimputationforgenotypicdata
AT tangrui singleversusmultipleimputationforgenotypicdata
AT biernackajoannam singleversusmultipleimputationforgenotypicdata
AT sinnwelljasonp singleversusmultipleimputationforgenotypicdata
AT riderdavidn singleversusmultipleimputationforgenotypicdata
AT goodeellenl singleversusmultipleimputationforgenotypicdata