Cargando…

Efficient Replication of over 180 Genetic Associations with Self-Reported Medical Data

While the cost and speed of generating genomic data have come down dramatically in recent years, the slow pace of collecting medical data for large cohorts continues to hamper genetic research. Here we evaluate a novel online framework for obtaining large amounts of medical information from a recont...

Descripción completa

Detalles Bibliográficos
Autores principales: Tung, Joyce Y., Do, Chuong B., Hinds, David A., Kiefer, Amy K., Macpherson, J. Michael, Chowdry, Arnab B., Francke, Uta, Naughton, Brian T., Mountain, Joanna L., Wojcicki, Anne, Eriksson, Nicholas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3157390/
https://www.ncbi.nlm.nih.gov/pubmed/21858135
http://dx.doi.org/10.1371/journal.pone.0023473
_version_ 1782210294973464576
author Tung, Joyce Y.
Do, Chuong B.
Hinds, David A.
Kiefer, Amy K.
Macpherson, J. Michael
Chowdry, Arnab B.
Francke, Uta
Naughton, Brian T.
Mountain, Joanna L.
Wojcicki, Anne
Eriksson, Nicholas
author_facet Tung, Joyce Y.
Do, Chuong B.
Hinds, David A.
Kiefer, Amy K.
Macpherson, J. Michael
Chowdry, Arnab B.
Francke, Uta
Naughton, Brian T.
Mountain, Joanna L.
Wojcicki, Anne
Eriksson, Nicholas
author_sort Tung, Joyce Y.
collection PubMed
description While the cost and speed of generating genomic data have come down dramatically in recent years, the slow pace of collecting medical data for large cohorts continues to hamper genetic research. Here we evaluate a novel online framework for obtaining large amounts of medical information from a recontactable cohort by assessing our ability to replicate genetic associations using these data. Using web-based questionnaires, we gathered self-reported data on 50 medical phenotypes from a generally unselected cohort of over 20,000 genotyped individuals. Of a list of genetic associations curated by NHGRI, we successfully replicated about 75% of the associations that we expected to (based on the number of cases in our cohort and reported odds ratios, and excluding a set of associations with contradictory published evidence). Altogether we replicated over 180 previously reported associations, including many for type 2 diabetes, prostate cancer, cholesterol levels, and multiple sclerosis. We found significant variation across categories of conditions in the percentage of expected associations that we were able to replicate, which may reflect systematic inflation of the effects in some initial reports, or differences across diseases in the likelihood of misdiagnosis or misreport. We also demonstrated that we could improve replication success by taking advantage of our recontactable cohort, offering more in-depth questions to refine self-reported diagnoses. Our data suggest that online collection of self-reported data from a recontactable cohort may be a viable method for both broad and deep phenotyping in large populations.
format Online
Article
Text
id pubmed-3157390
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31573902011-08-19 Efficient Replication of over 180 Genetic Associations with Self-Reported Medical Data Tung, Joyce Y. Do, Chuong B. Hinds, David A. Kiefer, Amy K. Macpherson, J. Michael Chowdry, Arnab B. Francke, Uta Naughton, Brian T. Mountain, Joanna L. Wojcicki, Anne Eriksson, Nicholas PLoS One Research Article While the cost and speed of generating genomic data have come down dramatically in recent years, the slow pace of collecting medical data for large cohorts continues to hamper genetic research. Here we evaluate a novel online framework for obtaining large amounts of medical information from a recontactable cohort by assessing our ability to replicate genetic associations using these data. Using web-based questionnaires, we gathered self-reported data on 50 medical phenotypes from a generally unselected cohort of over 20,000 genotyped individuals. Of a list of genetic associations curated by NHGRI, we successfully replicated about 75% of the associations that we expected to (based on the number of cases in our cohort and reported odds ratios, and excluding a set of associations with contradictory published evidence). Altogether we replicated over 180 previously reported associations, including many for type 2 diabetes, prostate cancer, cholesterol levels, and multiple sclerosis. We found significant variation across categories of conditions in the percentage of expected associations that we were able to replicate, which may reflect systematic inflation of the effects in some initial reports, or differences across diseases in the likelihood of misdiagnosis or misreport. We also demonstrated that we could improve replication success by taking advantage of our recontactable cohort, offering more in-depth questions to refine self-reported diagnoses. Our data suggest that online collection of self-reported data from a recontactable cohort may be a viable method for both broad and deep phenotyping in large populations. Public Library of Science 2011-08-17 /pmc/articles/PMC3157390/ /pubmed/21858135 http://dx.doi.org/10.1371/journal.pone.0023473 Text en Tung et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Tung, Joyce Y.
Do, Chuong B.
Hinds, David A.
Kiefer, Amy K.
Macpherson, J. Michael
Chowdry, Arnab B.
Francke, Uta
Naughton, Brian T.
Mountain, Joanna L.
Wojcicki, Anne
Eriksson, Nicholas
Efficient Replication of over 180 Genetic Associations with Self-Reported Medical Data
title Efficient Replication of over 180 Genetic Associations with Self-Reported Medical Data
title_full Efficient Replication of over 180 Genetic Associations with Self-Reported Medical Data
title_fullStr Efficient Replication of over 180 Genetic Associations with Self-Reported Medical Data
title_full_unstemmed Efficient Replication of over 180 Genetic Associations with Self-Reported Medical Data
title_short Efficient Replication of over 180 Genetic Associations with Self-Reported Medical Data
title_sort efficient replication of over 180 genetic associations with self-reported medical data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3157390/
https://www.ncbi.nlm.nih.gov/pubmed/21858135
http://dx.doi.org/10.1371/journal.pone.0023473
work_keys_str_mv AT tungjoycey efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT dochuongb efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT hindsdavida efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT kieferamyk efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT macphersonjmichael efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT chowdryarnabb efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT franckeuta efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT naughtonbriant efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT mountainjoannal efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT wojcickianne efficientreplicationofover180geneticassociationswithselfreportedmedicaldata
AT erikssonnicholas efficientreplicationofover180geneticassociationswithselfreportedmedicaldata