Cargando…

Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES)

Epidemiologic collections have been a major resource for genotype–phenotype studies of complex disease given their large sample size, racial/ethnic diversity, and breadth and depth of phenotypes, traits, and exposures. A major disadvantage of these collections is they often survey households and com...

Descripción completa

Detalles Bibliográficos
Autores principales: Malinowski, Jennifer, Goodloe, Robert, Brown-Gentry, Kristin, Crawford, Dana C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4620157/
https://www.ncbi.nlm.nih.gov/pubmed/26579192
http://dx.doi.org/10.3389/fgene.2015.00317
_version_ 1782397243666464768
author Malinowski, Jennifer
Goodloe, Robert
Brown-Gentry, Kristin
Crawford, Dana C.
author_facet Malinowski, Jennifer
Goodloe, Robert
Brown-Gentry, Kristin
Crawford, Dana C.
author_sort Malinowski, Jennifer
collection PubMed
description Epidemiologic collections have been a major resource for genotype–phenotype studies of complex disease given their large sample size, racial/ethnic diversity, and breadth and depth of phenotypes, traits, and exposures. A major disadvantage of these collections is they often survey households and communities without collecting extensive pedigree data. Failure to account for substantial relatedness can lead to inflated estimates and spurious associations. To examine the extent of cryptic relatedness in an epidemiologic collection, we as the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study accessed the National Health and Nutrition Examination Surveys (NHANES) linked to DNA samples (“Genetic NHANES”) from NHANES III and NHANES 1999–2002. NHANES are population-based cross-sectional surveys conducted by the National Center for Health Statistics at the Centers for Disease Control and Prevention. Genome-wide genetic data is not yet available in NHANES, and current data use agreements prohibit the generation of GWAS-level data in NHANES samples due issues in maintaining confidentiality among other ethical concerns. To date, only hundreds of single nucleotide polymorphisms (SNPs) genotyped in a variety of candidate genes are available for analysis in NHANES. We performed identity-by-descent (IBD) estimates in three self-identified subpopulations of Genetic NHANES (non-Hispanic white, non- Hispanic black, and Mexican American) using PLINK software to identify potential familial relationships from presumed unrelated subjects. We then compared the PLINKidentified relationships to those identified by an alternative method implemented in Kinship-based INference for Genome-wide association studies (KING). Overall, both methods identified familial relationships in NHANES III and NHANES 1999–2002 for all three subpopulations, but little concordance was observed between the two methods due in major part to the limited SNP data available in Genetic NHANES. Despite the lack of genome-wide data, our results suggest the presence of cryptic relatedness in this epidemiologic collection and highlight the limitations of restricted datasets such as NHANES in the context of modern day genetic epidemiology studies.
format Online
Article
Text
id pubmed-4620157
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-46201572015-11-17 Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES) Malinowski, Jennifer Goodloe, Robert Brown-Gentry, Kristin Crawford, Dana C. Front Genet Genetics Epidemiologic collections have been a major resource for genotype–phenotype studies of complex disease given their large sample size, racial/ethnic diversity, and breadth and depth of phenotypes, traits, and exposures. A major disadvantage of these collections is they often survey households and communities without collecting extensive pedigree data. Failure to account for substantial relatedness can lead to inflated estimates and spurious associations. To examine the extent of cryptic relatedness in an epidemiologic collection, we as the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study accessed the National Health and Nutrition Examination Surveys (NHANES) linked to DNA samples (“Genetic NHANES”) from NHANES III and NHANES 1999–2002. NHANES are population-based cross-sectional surveys conducted by the National Center for Health Statistics at the Centers for Disease Control and Prevention. Genome-wide genetic data is not yet available in NHANES, and current data use agreements prohibit the generation of GWAS-level data in NHANES samples due issues in maintaining confidentiality among other ethical concerns. To date, only hundreds of single nucleotide polymorphisms (SNPs) genotyped in a variety of candidate genes are available for analysis in NHANES. We performed identity-by-descent (IBD) estimates in three self-identified subpopulations of Genetic NHANES (non-Hispanic white, non- Hispanic black, and Mexican American) using PLINK software to identify potential familial relationships from presumed unrelated subjects. We then compared the PLINKidentified relationships to those identified by an alternative method implemented in Kinship-based INference for Genome-wide association studies (KING). Overall, both methods identified familial relationships in NHANES III and NHANES 1999–2002 for all three subpopulations, but little concordance was observed between the two methods due in major part to the limited SNP data available in Genetic NHANES. Despite the lack of genome-wide data, our results suggest the presence of cryptic relatedness in this epidemiologic collection and highlight the limitations of restricted datasets such as NHANES in the context of modern day genetic epidemiology studies. Frontiers Media S.A. 2015-10-26 /pmc/articles/PMC4620157/ /pubmed/26579192 http://dx.doi.org/10.3389/fgene.2015.00317 Text en Copyright © 2015 Malinowski, Goodloe, Brown-Gentry and Crawford. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Malinowski, Jennifer
Goodloe, Robert
Brown-Gentry, Kristin
Crawford, Dana C.
Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES)
title Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES)
title_full Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES)
title_fullStr Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES)
title_full_unstemmed Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES)
title_short Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES)
title_sort cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the epidemiologic architecture for genes linked to environment (eagle) study and the national health and nutrition examination surveys (nhanes)
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4620157/
https://www.ncbi.nlm.nih.gov/pubmed/26579192
http://dx.doi.org/10.3389/fgene.2015.00317
work_keys_str_mv AT malinowskijennifer crypticrelatednessinepidemiologiccollectionsaccessedforgeneticassociationstudiesexperiencesfromtheepidemiologicarchitectureforgeneslinkedtoenvironmenteaglestudyandthenationalhealthandnutritionexaminationsurveysnhanes
AT goodloerobert crypticrelatednessinepidemiologiccollectionsaccessedforgeneticassociationstudiesexperiencesfromtheepidemiologicarchitectureforgeneslinkedtoenvironmenteaglestudyandthenationalhealthandnutritionexaminationsurveysnhanes
AT browngentrykristin crypticrelatednessinepidemiologiccollectionsaccessedforgeneticassociationstudiesexperiencesfromtheepidemiologicarchitectureforgeneslinkedtoenvironmenteaglestudyandthenationalhealthandnutritionexaminationsurveysnhanes
AT crawforddanac crypticrelatednessinepidemiologiccollectionsaccessedforgeneticassociationstudiesexperiencesfromtheepidemiologicarchitectureforgeneslinkedtoenvironmenteaglestudyandthenationalhealthandnutritionexaminationsurveysnhanes