Cargando…
Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study
Imputation has been widely used in genome-wide association studies (GWAS) to infer genotypes of un-genotyped variants based on the linkage disequilibrium in external reference panels such as the HapMap and 1000 Genomes. However, imputation has only rarely been performed based on family relationships...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3524237/ https://www.ncbi.nlm.nih.gov/pubmed/23284720 http://dx.doi.org/10.1371/journal.pone.0051589 |
_version_ | 1782253297285988352 |
---|---|
author | Chen, Ming-Huei Huang, Jie Chen, Wei-Min Larson, Martin G. Fox, Caroline S. Vasan, Ramachandran S. Seshadri, Sudha O’Donnell, Christopher J. Yang, Qiong |
author_facet | Chen, Ming-Huei Huang, Jie Chen, Wei-Min Larson, Martin G. Fox, Caroline S. Vasan, Ramachandran S. Seshadri, Sudha O’Donnell, Christopher J. Yang, Qiong |
author_sort | Chen, Ming-Huei |
collection | PubMed |
description | Imputation has been widely used in genome-wide association studies (GWAS) to infer genotypes of un-genotyped variants based on the linkage disequilibrium in external reference panels such as the HapMap and 1000 Genomes. However, imputation has only rarely been performed based on family relationships to infer genotypes of un-genotyped individuals. Using 8998 Framingham Heart Study (FHS) participants genotyped with Affymetrix 550K SNPs, we imputed genotypes of same set of SNPs for additional 3121 participants, most of whom were never genotyped due to lack of DNA sample. Prior to imputation, 122 pedigrees were too large to be handled by the imputation software Merlin. Therefore, we developed a novel pedigree splitting algorithm that can maximize the number of genotyped relatives for imputing each un-genotyped individual, while keeping new sub-pedigrees under a pre-specified size. In GWAS of four phenotypes available in FHS (Alzheimer disease, circulating levels of fibrinogen, high-density lipoprotein cholesterol, and uric acid), we compared results using genotyped individuals only with results using both genotyped and imputed individuals. We studied the impact of applying different imputation quality filtering thresholds on the association results and did not found a universal threshold that always resulted in a more significant p-value for previously identified loci. However most of these loci had a lower p-value when we only included imputed genotypes with with ≥60% SNP- and ≥50% person-specific imputation certainty. In summary, we developed a novel algorithm for splitting large pedigrees for imputation and found a plausible imputation quality filtering threshold based on FHS. Further examination may be required to generalize this threshold to other studies. |
format | Online Article Text |
id | pubmed-3524237 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-35242372013-01-02 Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study Chen, Ming-Huei Huang, Jie Chen, Wei-Min Larson, Martin G. Fox, Caroline S. Vasan, Ramachandran S. Seshadri, Sudha O’Donnell, Christopher J. Yang, Qiong PLoS One Research Article Imputation has been widely used in genome-wide association studies (GWAS) to infer genotypes of un-genotyped variants based on the linkage disequilibrium in external reference panels such as the HapMap and 1000 Genomes. However, imputation has only rarely been performed based on family relationships to infer genotypes of un-genotyped individuals. Using 8998 Framingham Heart Study (FHS) participants genotyped with Affymetrix 550K SNPs, we imputed genotypes of same set of SNPs for additional 3121 participants, most of whom were never genotyped due to lack of DNA sample. Prior to imputation, 122 pedigrees were too large to be handled by the imputation software Merlin. Therefore, we developed a novel pedigree splitting algorithm that can maximize the number of genotyped relatives for imputing each un-genotyped individual, while keeping new sub-pedigrees under a pre-specified size. In GWAS of four phenotypes available in FHS (Alzheimer disease, circulating levels of fibrinogen, high-density lipoprotein cholesterol, and uric acid), we compared results using genotyped individuals only with results using both genotyped and imputed individuals. We studied the impact of applying different imputation quality filtering thresholds on the association results and did not found a universal threshold that always resulted in a more significant p-value for previously identified loci. However most of these loci had a lower p-value when we only included imputed genotypes with with ≥60% SNP- and ≥50% person-specific imputation certainty. In summary, we developed a novel algorithm for splitting large pedigrees for imputation and found a plausible imputation quality filtering threshold based on FHS. Further examination may be required to generalize this threshold to other studies. Public Library of Science 2012-12-17 /pmc/articles/PMC3524237/ /pubmed/23284720 http://dx.doi.org/10.1371/journal.pone.0051589 Text en © 2012 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Chen, Ming-Huei Huang, Jie Chen, Wei-Min Larson, Martin G. Fox, Caroline S. Vasan, Ramachandran S. Seshadri, Sudha O’Donnell, Christopher J. Yang, Qiong Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study |
title | Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study |
title_full | Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study |
title_fullStr | Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study |
title_full_unstemmed | Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study |
title_short | Using Family-Based Imputation in Genome-Wide Association Studies with Large Complex Pedigrees: The Framingham Heart Study |
title_sort | using family-based imputation in genome-wide association studies with large complex pedigrees: the framingham heart study |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3524237/ https://www.ncbi.nlm.nih.gov/pubmed/23284720 http://dx.doi.org/10.1371/journal.pone.0051589 |
work_keys_str_mv | AT chenminghuei usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy AT huangjie usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy AT chenweimin usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy AT larsonmarting usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy AT foxcarolines usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy AT vasanramachandrans usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy AT seshadrisudha usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy AT odonnellchristopherj usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy AT yangqiong usingfamilybasedimputationingenomewideassociationstudieswithlargecomplexpedigreestheframinghamheartstudy |