Cargando…
Single versus multiple imputation for genotypic data
Due to the growing need to combine data across multiple studies and to impute untyped markers based on a reference sample, several analytical tools for imputation and analysis of missing genotypes have been developed. Current imputation methods rely on single imputation, which ignores the variation...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795971/ https://www.ncbi.nlm.nih.gov/pubmed/20018064 |
_version_ | 1782175482590003200 |
---|---|
author | Fridley, Brooke L McDonnell, Shannon K Rabe, Kari G Tang, Rui Biernacka, Joanna M Sinnwell, Jason P Rider, David N Goode, Ellen L |
author_facet | Fridley, Brooke L McDonnell, Shannon K Rabe, Kari G Tang, Rui Biernacka, Joanna M Sinnwell, Jason P Rider, David N Goode, Ellen L |
author_sort | Fridley, Brooke L |
collection | PubMed |
description | Due to the growing need to combine data across multiple studies and to impute untyped markers based on a reference sample, several analytical tools for imputation and analysis of missing genotypes have been developed. Current imputation methods rely on single imputation, which ignores the variation in estimation due to imputation. An alternative to single imputation is multiple imputation. In this paper, we assess the variation in imputation by completing both single and multiple imputations of genotypic data using MACH, a commonly used hidden Markov model imputation method. Using data from the North American Rheumatoid Arthritis Consortium genome-wide study, the use of single and multiple imputation was assessed in four regions of chromosome 1 with varying levels of linkage disequilibrium and association signals. Two scenarios for missing genotypic data were assessed: imputation of untyped markers and combination of genotypic data from two studies. This limited study involving four regions indicates that, contrary to expectations, multiple imputations may not be necessary. |
format | Text |
id | pubmed-2795971 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27959712009-12-18 Single versus multiple imputation for genotypic data Fridley, Brooke L McDonnell, Shannon K Rabe, Kari G Tang, Rui Biernacka, Joanna M Sinnwell, Jason P Rider, David N Goode, Ellen L BMC Proc Proceedings Due to the growing need to combine data across multiple studies and to impute untyped markers based on a reference sample, several analytical tools for imputation and analysis of missing genotypes have been developed. Current imputation methods rely on single imputation, which ignores the variation in estimation due to imputation. An alternative to single imputation is multiple imputation. In this paper, we assess the variation in imputation by completing both single and multiple imputations of genotypic data using MACH, a commonly used hidden Markov model imputation method. Using data from the North American Rheumatoid Arthritis Consortium genome-wide study, the use of single and multiple imputation was assessed in four regions of chromosome 1 with varying levels of linkage disequilibrium and association signals. Two scenarios for missing genotypic data were assessed: imputation of untyped markers and combination of genotypic data from two studies. This limited study involving four regions indicates that, contrary to expectations, multiple imputations may not be necessary. BioMed Central 2009-12-15 /pmc/articles/PMC2795971/ /pubmed/20018064 Text en Copyright ©2009 Fridley et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Fridley, Brooke L McDonnell, Shannon K Rabe, Kari G Tang, Rui Biernacka, Joanna M Sinnwell, Jason P Rider, David N Goode, Ellen L Single versus multiple imputation for genotypic data |
title | Single versus multiple imputation for genotypic data |
title_full | Single versus multiple imputation for genotypic data |
title_fullStr | Single versus multiple imputation for genotypic data |
title_full_unstemmed | Single versus multiple imputation for genotypic data |
title_short | Single versus multiple imputation for genotypic data |
title_sort | single versus multiple imputation for genotypic data |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795971/ https://www.ncbi.nlm.nih.gov/pubmed/20018064 |
work_keys_str_mv | AT fridleybrookel singleversusmultipleimputationforgenotypicdata AT mcdonnellshannonk singleversusmultipleimputationforgenotypicdata AT rabekarig singleversusmultipleimputationforgenotypicdata AT tangrui singleversusmultipleimputationforgenotypicdata AT biernackajoannam singleversusmultipleimputationforgenotypicdata AT sinnwelljasonp singleversusmultipleimputationforgenotypicdata AT riderdavidn singleversusmultipleimputationforgenotypicdata AT goodeellenl singleversusmultipleimputationforgenotypicdata |