Cargando…

Imputation methods for missing data for polygenic models

Methods to handle missing data have been an area of statistical research for many years. Little has been done within the context of pedigree analysis. In this paper we present two methods for imputing missing data for polygenic models using family data. The imputation schemes take into account famil...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fridley, Brooke, Rabe, Kari, Andrade, Mariza de
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2003
Materias:	Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1866478/ https://www.ncbi.nlm.nih.gov/pubmed/14975110 http://dx.doi.org/10.1186/1471-2156-4-S1-S42

_version_	1782133279281905664
author	Fridley, Brooke Rabe, Kari Andrade, Mariza de
author_facet	Fridley, Brooke Rabe, Kari Andrade, Mariza de
author_sort	Fridley, Brooke
collection	PubMed
description	Methods to handle missing data have been an area of statistical research for many years. Little has been done within the context of pedigree analysis. In this paper we present two methods for imputing missing data for polygenic models using family data. The imputation schemes take into account familial relationships and use the observed familial information for the imputation. A traditional multiple imputation approach and multiple imputation or data augmentation approach within a Gibbs sampler for the handling of missing data for a polygenic model are presented. We used both the Genetic Analysis Workshop 13 simulated missing phenotype and the complete phenotype data sets as the means to illustrate the two methods. We looked at the phenotypic trait systolic blood pressure and the covariate gender at time point 11 (1970) for Cohort 1 and time point 1 (1971) for Cohort 2. Comparing the results for three replicates of complete and missing data incorporating multiple imputation, we find that multiple imputation via a Gibbs sampler produces more accurate results. Thus, we recommend the Gibbs sampler for imputation purposes because of the ease with which it can be extended to more complicated models, the consistency of the results, and the accountability of the variation due to imputation.
format	Text
id	pubmed-1866478
institution	National Center for Biotechnology Information
language	English
publishDate	2003
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-18664782007-05-11 Imputation methods for missing data for polygenic models Fridley, Brooke Rabe, Kari Andrade, Mariza de BMC Genet Proceedings Methods to handle missing data have been an area of statistical research for many years. Little has been done within the context of pedigree analysis. In this paper we present two methods for imputing missing data for polygenic models using family data. The imputation schemes take into account familial relationships and use the observed familial information for the imputation. A traditional multiple imputation approach and multiple imputation or data augmentation approach within a Gibbs sampler for the handling of missing data for a polygenic model are presented. We used both the Genetic Analysis Workshop 13 simulated missing phenotype and the complete phenotype data sets as the means to illustrate the two methods. We looked at the phenotypic trait systolic blood pressure and the covariate gender at time point 11 (1970) for Cohort 1 and time point 1 (1971) for Cohort 2. Comparing the results for three replicates of complete and missing data incorporating multiple imputation, we find that multiple imputation via a Gibbs sampler produces more accurate results. Thus, we recommend the Gibbs sampler for imputation purposes because of the ease with which it can be extended to more complicated models, the consistency of the results, and the accountability of the variation due to imputation. BioMed Central 2003-12-31 /pmc/articles/PMC1866478/ /pubmed/14975110 http://dx.doi.org/10.1186/1471-2156-4-S1-S42 Text en Copyright © 2003 Fridley et al; licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Proceedings Fridley, Brooke Rabe, Kari Andrade, Mariza de Imputation methods for missing data for polygenic models
title	Imputation methods for missing data for polygenic models
title_full	Imputation methods for missing data for polygenic models
title_fullStr	Imputation methods for missing data for polygenic models
title_full_unstemmed	Imputation methods for missing data for polygenic models
title_short	Imputation methods for missing data for polygenic models
title_sort	imputation methods for missing data for polygenic models
topic	Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1866478/ https://www.ncbi.nlm.nih.gov/pubmed/14975110 http://dx.doi.org/10.1186/1471-2156-4-S1-S42
work_keys_str_mv	AT fridleybrooke imputationmethodsformissingdataforpolygenicmodels AT rabekari imputationmethodsformissingdataforpolygenicmodels AT andrademarizade imputationmethodsformissingdataforpolygenicmodels

Imputation methods for missing data for polygenic models

Ejemplares similares