Cargando…

Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study

Imputation has been widely used in genome-wide association studies (GWAS) to infer genotypes of un-genotyped variants based on the linkage disequilibrium in external reference panels such as the HapMap and 1000 Genomes. However, imputation has only rarely been performed based on family relationships...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Ming-Huei, Huang, Jie, Chen, Wei-Min, Larson, Martin G., Fox, Caroline S., Vasan, Ramachandran S., Seshadri, Sudha, O’Donnell, Christopher J., Yang, Qiong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3524237/
https://www.ncbi.nlm.nih.gov/pubmed/23284720
http://dx.doi.org/10.1371/journal.pone.0051589
_version_ 1782253297285988352
author Chen, Ming-Huei
Huang, Jie
Chen, Wei-Min
Larson, Martin G.
Fox, Caroline S.
Vasan, Ramachandran S.
Seshadri, Sudha
O’Donnell, Christopher J.
Yang, Qiong
author_facet Chen, Ming-Huei
Huang, Jie
Chen, Wei-Min
Larson, Martin G.
Fox, Caroline S.
Vasan, Ramachandran S.
Seshadri, Sudha
O’Donnell, Christopher J.
Yang, Qiong
author_sort Chen, Ming-Huei
collection PubMed
description Imputation has been widely used in genome-wide association studies (GWAS) to infer genotypes of un-genotyped variants based on the linkage disequilibrium in external reference panels such as the HapMap and 1000 Genomes. However, imputation has only rarely been performed based on family relationships to infer genotypes of un-genotyped individuals. Using 8998 Framingham Heart Study (FHS) participants genotyped with Affymetrix 550K SNPs, we imputed genotypes of same set of SNPs for additional 3121 participants, most of whom were never genotyped due to lack of DNA sample. Prior to imputation, 122 pedigrees were too large to be handled by the imputation software Merlin. Therefore, we developed a novel pedigree splitting algorithm that can maximize the number of genotyped relatives for imputing each un-genotyped individual, while keeping new sub-pedigrees under a pre-specified size. In GWAS of four phenotypes available in FHS (Alzheimer disease, circulating levels of fibrinogen, high-density lipoprotein cholesterol, and uric acid), we compared results using genotyped individuals only with results using both genotyped and imputed individuals. We studied the impact of applying different imputation quality filtering thresholds on the association results and did not found a universal threshold that always resulted in a more significant p-value for previously identified loci. However most of these loci had a lower p-value when we only included imputed genotypes with with ≥60% SNP- and ≥50% person-specific imputation certainty. In summary, we developed a novel algorithm for splitting large pedigrees for imputation and found a plausible imputation quality filtering threshold based on FHS. Further examination may be required to generalize this threshold to other studies.
format Online
Article
Text
id pubmed-3524237
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35242372013-01-02 Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study Chen, Ming-Huei Huang, Jie Chen, Wei-Min Larson, Martin G. Fox, Caroline S. Vasan, Ramachandran S. Seshadri, Sudha O’Donnell, Christopher J. Yang, Qiong PLoS One Research Article Imputation has been widely used in genome-wide association studies (GWAS) to infer genotypes of un-genotyped variants based on the linkage disequilibrium in external reference panels such as the HapMap and 1000 Genomes. However, imputation has only rarely been performed based on family relationships to infer genotypes of un-genotyped individuals. Using 8998 Framingham Heart Study (FHS) participants genotyped with Affymetrix 550K SNPs, we imputed genotypes of same set of SNPs for additional 3121 participants, most of whom were never genotyped due to lack of DNA sample. Prior to imputation, 122 pedigrees were too large to be handled by the imputation software Merlin. Therefore, we developed a novel pedigree splitting algorithm that can maximize the number of genotyped relatives for imputing each un-genotyped individual, while keeping new sub-pedigrees under a pre-specified size. In GWAS of four phenotypes available in FHS (Alzheimer disease, circulating levels of fibrinogen, high-density lipoprotein cholesterol, and uric acid), we compared results using genotyped individuals only with results using both genotyped and imputed individuals. We studied the impact of applying different imputation quality filtering thresholds on the association results and did not found a universal threshold that always resulted in a more significant p-value for previously identified loci. However most of these loci had a lower p-value when we only included imputed genotypes with with ≥60% SNP- and ≥50% person-specific imputation certainty. In summary, we developed a novel algorithm for splitting large pedigrees for imputation and found a plausible imputation quality filtering threshold based on FHS. Further examination may be required to generalize this threshold to other studies. Public Library of Science 2012-12-17 /pmc/articles/PMC3524237/ /pubmed/23284720 http://dx.doi.org/10.1371/journal.pone.0051589 Text en © 2012 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Chen, Ming-Huei
Huang, Jie
Chen, Wei-Min
Larson, Martin G.
Fox, Caroline S.
Vasan, Ramachandran S.
Seshadri, Sudha
O’Donnell, Christopher J.
Yang, Qiong
Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study
title Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study
title_full Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study
title_fullStr Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study
title_full_unstemmed Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study
title_short Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study
title_sort using family-based imputation in genome-wide association studies with large complex pedigrees: the framingham heart study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3524237/
https://www.ncbi.nlm.nih.gov/pubmed/23284720
http://dx.doi.org/10.1371/journal.pone.0051589
work_keys_str_mv AT chenminghuei usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT huangjie usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT chenweimin usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT larsonmarting usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT foxcarolines usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT vasanramachandrans usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT seshadrisudha usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT odonnellchristopherj usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy
AT yangqiong usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy