Cargando…

A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design

Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have...

Descripción completa

Detalles Bibliográficos
Autores principales: Staley, James R, Jones, Edmund, Kaptoge, Stephen, Butterworth, Adam S, Sweeting, Michael J, Wood, Angela M, Howson, Joanna M M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5520083/
https://www.ncbi.nlm.nih.gov/pubmed/28594416
http://dx.doi.org/10.1038/ejhg.2017.78
_version_ 1783251750309855232
author Staley, James R
Jones, Edmund
Kaptoge, Stephen
Butterworth, Adam S
Sweeting, Michael J
Wood, Angela M
Howson, Joanna M M
author_facet Staley, James R
Jones, Edmund
Kaptoge, Stephen
Butterworth, Adam S
Sweeting, Michael J
Wood, Angela M
Howson, Joanna M M
author_sort Staley, James R
collection PubMed
description Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP–disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease.
format Online
Article
Text
id pubmed-5520083
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-55200832017-08-23 A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design Staley, James R Jones, Edmund Kaptoge, Stephen Butterworth, Adam S Sweeting, Michael J Wood, Angela M Howson, Joanna M M Eur J Hum Genet Article Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP–disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease. Nature Publishing Group 2017-07 2017-05-03 /pmc/articles/PMC5520083/ /pubmed/28594416 http://dx.doi.org/10.1038/ejhg.2017.78 Text en Copyright © 2017 The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Staley, James R
Jones, Edmund
Kaptoge, Stephen
Butterworth, Adam S
Sweeting, Michael J
Wood, Angela M
Howson, Joanna M M
A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design
title A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design
title_full A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design
title_fullStr A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design
title_full_unstemmed A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design
title_short A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design
title_sort comparison of cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5520083/
https://www.ncbi.nlm.nih.gov/pubmed/28594416
http://dx.doi.org/10.1038/ejhg.2017.78
work_keys_str_mv AT staleyjamesr acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT jonesedmund acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT kaptogestephen acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT butterworthadams acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT sweetingmichaelj acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT woodangelam acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT howsonjoannamm acomparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT staleyjamesr comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT jonesedmund comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT kaptogestephen comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT butterworthadams comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT sweetingmichaelj comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT woodangelam comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign
AT howsonjoannamm comparisonofcoxandlogisticregressionforuseingenomewideassociationstudiesofcohortandcasecohortdesign