Cargando…
A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design
Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5520083/ https://www.ncbi.nlm.nih.gov/pubmed/28594416 http://dx.doi.org/10.1038/ejhg.2017.78 |
_version_ | 1783251750309855232 |
---|---|
author | Staley, James R Jones, Edmund Kaptoge, Stephen Butterworth, Adam S Sweeting, Michael J Wood, Angela M Howson, Joanna M M |
author_facet | Staley, James R Jones, Edmund Kaptoge, Stephen Butterworth, Adam S Sweeting, Michael J Wood, Angela M Howson, Joanna M M |
author_sort | Staley, James R |
collection | PubMed |
description | Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP–disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease. |
format | Online Article Text |
id | pubmed-5520083 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-55200832017-08-23 A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design Staley, James R Jones, Edmund Kaptoge, Stephen Butterworth, Adam S Sweeting, Michael J Wood, Angela M Howson, Joanna M M Eur J Hum Genet Article Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP–disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease. Nature Publishing Group 2017-07 2017-05-03 /pmc/articles/PMC5520083/ /pubmed/28594416 http://dx.doi.org/10.1038/ejhg.2017.78 Text en Copyright © 2017 The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Staley, James R Jones, Edmund Kaptoge, Stephen Butterworth, Adam S Sweeting, Michael J Wood, Angela M Howson, Joanna M M A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design |
title | A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design |
title_full | A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design |
title_fullStr | A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design |
title_full_unstemmed | A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design |
title_short | A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design |
title_sort | comparison of cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5520083/ https://www.ncbi.nlm.nih.gov/pubmed/28594416 http://dx.doi.org/10.1038/ejhg.2017.78 |
work_keys_str_mv | AT staleyjamesr acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT jonesedmund acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT kaptogestephen acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT butterworthadams acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT sweetingmichaelj acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT woodangelam acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT howsonjoannamm acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT staleyjamesr comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT jonesedmund comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT kaptogestephen comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT butterworthadams comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT sweetingmichaelj comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT woodangelam comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign AT howsonjoannamm comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign |