Cargando…

Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology

BACKGROUND: In conventional epidemiology confounding of the exposure of interest with lifestyle or socioeconomic factors, and reverse causation whereby disease status influences exposure rather than vice versa, may invalidate causal interpretations of observed associations. Conversely, genetic varia...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, George Davey, Lawlor, Debbie A, Harbord, Roger, Timpson, Nic, Day, Ian, Ebrahim, Shah
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2121108/
https://www.ncbi.nlm.nih.gov/pubmed/18076282
http://dx.doi.org/10.1371/journal.pmed.0040352
_version_ 1782141651500662784
author Smith, George Davey
Lawlor, Debbie A
Harbord, Roger
Timpson, Nic
Day, Ian
Ebrahim, Shah
author_facet Smith, George Davey
Lawlor, Debbie A
Harbord, Roger
Timpson, Nic
Day, Ian
Ebrahim, Shah
author_sort Smith, George Davey
collection PubMed
description BACKGROUND: In conventional epidemiology confounding of the exposure of interest with lifestyle or socioeconomic factors, and reverse causation whereby disease status influences exposure rather than vice versa, may invalidate causal interpretations of observed associations. Conversely, genetic variants should not be related to the confounding factors that distort associations in conventional observational epidemiological studies. Furthermore, disease onset will not influence genotype. Therefore, it has been suggested that genetic variants that are known to be associated with a modifiable (nongenetic) risk factor can be used to help determine the causal effect of this modifiable risk factor on disease outcomes. This approach, mendelian randomization, is increasingly being applied within epidemiological studies. However, there is debate about the underlying premise that associations between genotypes and disease outcomes are not confounded by other risk factors. We examined the extent to which genetic variants, on the one hand, and nongenetic environmental exposures or phenotypic characteristics on the other, tend to be associated with each other, to assess the degree of confounding that would exist in conventional epidemiological studies compared with mendelian randomization studies. METHODS AND FINDINGS: We estimated pairwise correlations between nongenetic baseline variables and genetic variables in a cross-sectional study comparing the number of correlations that were statistically significant at the 5%, 1%, and 0.01% level (α = 0.05, 0.01, and 0.0001, respectively) with the number expected by chance if all variables were in fact uncorrelated, using a two-sided binomial exact test. We demonstrate that behavioural, socioeconomic, and physiological factors are strongly interrelated, with 45% of all possible pairwise associations between 96 nongenetic characteristics (n = 4,560 correlations) being significant at the p < 0.01 level (the ratio of observed to expected significant associations was 45; p-value for difference between observed and expected < 0.000001). Similar findings were observed for other levels of significance. In contrast, genetic variants showed no greater association with each other, or with the 96 behavioural, socioeconomic, and physiological factors, than would be expected by chance. CONCLUSIONS: These data illustrate why observational studies have produced misleading claims regarding potentially causal factors for disease. The findings demonstrate the potential power of a methodology that utilizes genetic variants as indicators of exposure level when studying environmentally modifiable risk factors.
format Text
id pubmed-2121108
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-21211082007-12-11 Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology Smith, George Davey Lawlor, Debbie A Harbord, Roger Timpson, Nic Day, Ian Ebrahim, Shah PLoS Med Research Article BACKGROUND: In conventional epidemiology confounding of the exposure of interest with lifestyle or socioeconomic factors, and reverse causation whereby disease status influences exposure rather than vice versa, may invalidate causal interpretations of observed associations. Conversely, genetic variants should not be related to the confounding factors that distort associations in conventional observational epidemiological studies. Furthermore, disease onset will not influence genotype. Therefore, it has been suggested that genetic variants that are known to be associated with a modifiable (nongenetic) risk factor can be used to help determine the causal effect of this modifiable risk factor on disease outcomes. This approach, mendelian randomization, is increasingly being applied within epidemiological studies. However, there is debate about the underlying premise that associations between genotypes and disease outcomes are not confounded by other risk factors. We examined the extent to which genetic variants, on the one hand, and nongenetic environmental exposures or phenotypic characteristics on the other, tend to be associated with each other, to assess the degree of confounding that would exist in conventional epidemiological studies compared with mendelian randomization studies. METHODS AND FINDINGS: We estimated pairwise correlations between nongenetic baseline variables and genetic variables in a cross-sectional study comparing the number of correlations that were statistically significant at the 5%, 1%, and 0.01% level (α = 0.05, 0.01, and 0.0001, respectively) with the number expected by chance if all variables were in fact uncorrelated, using a two-sided binomial exact test. We demonstrate that behavioural, socioeconomic, and physiological factors are strongly interrelated, with 45% of all possible pairwise associations between 96 nongenetic characteristics (n = 4,560 correlations) being significant at the p < 0.01 level (the ratio of observed to expected significant associations was 45; p-value for difference between observed and expected < 0.000001). Similar findings were observed for other levels of significance. In contrast, genetic variants showed no greater association with each other, or with the 96 behavioural, socioeconomic, and physiological factors, than would be expected by chance. CONCLUSIONS: These data illustrate why observational studies have produced misleading claims regarding potentially causal factors for disease. The findings demonstrate the potential power of a methodology that utilizes genetic variants as indicators of exposure level when studying environmentally modifiable risk factors. Public Library of Science 2007-12 2007-12-11 /pmc/articles/PMC2121108/ /pubmed/18076282 http://dx.doi.org/10.1371/journal.pmed.0040352 Text en : © 2007 Davey Smith et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Smith, George Davey
Lawlor, Debbie A
Harbord, Roger
Timpson, Nic
Day, Ian
Ebrahim, Shah
Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology
title Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology
title_full Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology
title_fullStr Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology
title_full_unstemmed Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology
title_short Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology
title_sort clustered environments and randomized genes: a fundamental distinction between conventional and genetic epidemiology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2121108/
https://www.ncbi.nlm.nih.gov/pubmed/18076282
http://dx.doi.org/10.1371/journal.pmed.0040352
work_keys_str_mv AT smithgeorgedavey clusteredenvironmentsandrandomizedgenesafundamentaldistinctionbetweenconventionalandgeneticepidemiology
AT lawlordebbiea clusteredenvironmentsandrandomizedgenesafundamentaldistinctionbetweenconventionalandgeneticepidemiology
AT harbordroger clusteredenvironmentsandrandomizedgenesafundamentaldistinctionbetweenconventionalandgeneticepidemiology
AT timpsonnic clusteredenvironmentsandrandomizedgenesafundamentaldistinctionbetweenconventionalandgeneticepidemiology
AT dayian clusteredenvironmentsandrandomizedgenesafundamentaldistinctionbetweenconventionalandgeneticepidemiology
AT ebrahimshah clusteredenvironmentsandrandomizedgenesafundamentaldistinctionbetweenconventionalandgeneticepidemiology