Cargando…
Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES)
Epidemiologic collections have been a major resource for genotype–phenotype studies of complex disease given their large sample size, racial/ethnic diversity, and breadth and depth of phenotypes, traits, and exposures. A major disadvantage of these collections is they often survey households and com...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4620157/ https://www.ncbi.nlm.nih.gov/pubmed/26579192 http://dx.doi.org/10.3389/fgene.2015.00317 |
_version_ | 1782397243666464768 |
---|---|
author | Malinowski, Jennifer Goodloe, Robert Brown-Gentry, Kristin Crawford, Dana C. |
author_facet | Malinowski, Jennifer Goodloe, Robert Brown-Gentry, Kristin Crawford, Dana C. |
author_sort | Malinowski, Jennifer |
collection | PubMed |
description | Epidemiologic collections have been a major resource for genotype–phenotype studies of complex disease given their large sample size, racial/ethnic diversity, and breadth and depth of phenotypes, traits, and exposures. A major disadvantage of these collections is they often survey households and communities without collecting extensive pedigree data. Failure to account for substantial relatedness can lead to inflated estimates and spurious associations. To examine the extent of cryptic relatedness in an epidemiologic collection, we as the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study accessed the National Health and Nutrition Examination Surveys (NHANES) linked to DNA samples (“Genetic NHANES”) from NHANES III and NHANES 1999–2002. NHANES are population-based cross-sectional surveys conducted by the National Center for Health Statistics at the Centers for Disease Control and Prevention. Genome-wide genetic data is not yet available in NHANES, and current data use agreements prohibit the generation of GWAS-level data in NHANES samples due issues in maintaining confidentiality among other ethical concerns. To date, only hundreds of single nucleotide polymorphisms (SNPs) genotyped in a variety of candidate genes are available for analysis in NHANES. We performed identity-by-descent (IBD) estimates in three self-identified subpopulations of Genetic NHANES (non-Hispanic white, non- Hispanic black, and Mexican American) using PLINK software to identify potential familial relationships from presumed unrelated subjects. We then compared the PLINKidentified relationships to those identified by an alternative method implemented in Kinship-based INference for Genome-wide association studies (KING). Overall, both methods identified familial relationships in NHANES III and NHANES 1999–2002 for all three subpopulations, but little concordance was observed between the two methods due in major part to the limited SNP data available in Genetic NHANES. Despite the lack of genome-wide data, our results suggest the presence of cryptic relatedness in this epidemiologic collection and highlight the limitations of restricted datasets such as NHANES in the context of modern day genetic epidemiology studies. |
format | Online Article Text |
id | pubmed-4620157 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-46201572015-11-17 Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES) Malinowski, Jennifer Goodloe, Robert Brown-Gentry, Kristin Crawford, Dana C. Front Genet Genetics Epidemiologic collections have been a major resource for genotype–phenotype studies of complex disease given their large sample size, racial/ethnic diversity, and breadth and depth of phenotypes, traits, and exposures. A major disadvantage of these collections is they often survey households and communities without collecting extensive pedigree data. Failure to account for substantial relatedness can lead to inflated estimates and spurious associations. To examine the extent of cryptic relatedness in an epidemiologic collection, we as the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study accessed the National Health and Nutrition Examination Surveys (NHANES) linked to DNA samples (“Genetic NHANES”) from NHANES III and NHANES 1999–2002. NHANES are population-based cross-sectional surveys conducted by the National Center for Health Statistics at the Centers for Disease Control and Prevention. Genome-wide genetic data is not yet available in NHANES, and current data use agreements prohibit the generation of GWAS-level data in NHANES samples due issues in maintaining confidentiality among other ethical concerns. To date, only hundreds of single nucleotide polymorphisms (SNPs) genotyped in a variety of candidate genes are available for analysis in NHANES. We performed identity-by-descent (IBD) estimates in three self-identified subpopulations of Genetic NHANES (non-Hispanic white, non- Hispanic black, and Mexican American) using PLINK software to identify potential familial relationships from presumed unrelated subjects. We then compared the PLINKidentified relationships to those identified by an alternative method implemented in Kinship-based INference for Genome-wide association studies (KING). Overall, both methods identified familial relationships in NHANES III and NHANES 1999–2002 for all three subpopulations, but little concordance was observed between the two methods due in major part to the limited SNP data available in Genetic NHANES. Despite the lack of genome-wide data, our results suggest the presence of cryptic relatedness in this epidemiologic collection and highlight the limitations of restricted datasets such as NHANES in the context of modern day genetic epidemiology studies. Frontiers Media S.A. 2015-10-26 /pmc/articles/PMC4620157/ /pubmed/26579192 http://dx.doi.org/10.3389/fgene.2015.00317 Text en Copyright © 2015 Malinowski, Goodloe, Brown-Gentry and Crawford. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Malinowski, Jennifer Goodloe, Robert Brown-Gentry, Kristin Crawford, Dana C. Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES) |
title | Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES) |
title_full | Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES) |
title_fullStr | Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES) |
title_full_unstemmed | Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES) |
title_short | Cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) study and the National Health and Nutrition Examination Surveys (NHANES) |
title_sort | cryptic relatedness in epidemiologic collections accessed for genetic association studies: experiences from the epidemiologic architecture for genes linked to environment (eagle) study and the national health and nutrition examination surveys (nhanes) |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4620157/ https://www.ncbi.nlm.nih.gov/pubmed/26579192 http://dx.doi.org/10.3389/fgene.2015.00317 |
work_keys_str_mv | AT malinowskijennifer crypticrelatednessinepidemiologiccollectionsaccessedforgeneticassociationstudiesexperiencesfromtheepidemiologicarchitectureforgeneslinkedtoenvironmenteaglestudyandthenationalhealthandnutritionexaminationsurveysnhanes AT goodloerobert crypticrelatednessinepidemiologiccollectionsaccessedforgeneticassociationstudiesexperiencesfromtheepidemiologicarchitectureforgeneslinkedtoenvironmenteaglestudyandthenationalhealthandnutritionexaminationsurveysnhanes AT browngentrykristin crypticrelatednessinepidemiologiccollectionsaccessedforgeneticassociationstudiesexperiencesfromtheepidemiologicarchitectureforgeneslinkedtoenvironmenteaglestudyandthenationalhealthandnutritionexaminationsurveysnhanes AT crawforddanac crypticrelatednessinepidemiologiccollectionsaccessedforgeneticassociationstudiesexperiencesfromtheepidemiologicarchitectureforgeneslinkedtoenvironmenteaglestudyandthenationalhealthandnutritionexaminationsurveysnhanes |