Cargando…

Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record

BACKGROUND: The growth of DNA biobanks linked to data from electronic health records (EHRs) has enabled the discovery of numerous associations between genomic variants and clinical phenotypes. Nonetheless, although clinical data are generally longitudinal, standard approaches for detecting genotype-...

Descripción completa

Detalles Bibliográficos
Autores principales: Hughey, Jacob J., Rhoades, Seth D., Fu, Darwin Y., Bastarache, Lisa, Denny, Joshua C., Chen, Qingxia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6829851/
https://www.ncbi.nlm.nih.gov/pubmed/31684865
http://dx.doi.org/10.1186/s12864-019-6192-1
_version_ 1783465655121477632
author Hughey, Jacob J.
Rhoades, Seth D.
Fu, Darwin Y.
Bastarache, Lisa
Denny, Joshua C.
Chen, Qingxia
author_facet Hughey, Jacob J.
Rhoades, Seth D.
Fu, Darwin Y.
Bastarache, Lisa
Denny, Joshua C.
Chen, Qingxia
author_sort Hughey, Jacob J.
collection PubMed
description BACKGROUND: The growth of DNA biobanks linked to data from electronic health records (EHRs) has enabled the discovery of numerous associations between genomic variants and clinical phenotypes. Nonetheless, although clinical data are generally longitudinal, standard approaches for detecting genotype-phenotype associations in such linked data, notably logistic regression, do not naturally account for variation in the period of follow-up or the time at which an event occurs. Here we explored the advantages of quantifying associations using Cox proportional hazards regression, which can account for the age at which a patient first visited the healthcare system (left truncation) and the age at which a patient either last visited the healthcare system or acquired a particular phenotype (right censoring). RESULTS: In comprehensive simulations, we found that, compared to logistic regression, Cox regression had greater power at equivalent Type I error. We then scanned for genotype-phenotype associations using logistic regression and Cox regression on 50 phenotypes derived from the EHRs of 49,792 genotyped individuals. Consistent with the findings from our simulations, Cox regression had approximately 10% greater relative sensitivity for detecting known associations from the NHGRI-EBI GWAS Catalog. In terms of effect sizes, the hazard ratios estimated by Cox regression were strongly correlated with the odds ratios estimated by logistic regression. CONCLUSIONS: As longitudinal health-related data continue to grow, Cox regression may improve our ability to identify the genetic basis for a wide range of human phenotypes.
format Online
Article
Text
id pubmed-6829851
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68298512019-11-07 Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record Hughey, Jacob J. Rhoades, Seth D. Fu, Darwin Y. Bastarache, Lisa Denny, Joshua C. Chen, Qingxia BMC Genomics Research Article BACKGROUND: The growth of DNA biobanks linked to data from electronic health records (EHRs) has enabled the discovery of numerous associations between genomic variants and clinical phenotypes. Nonetheless, although clinical data are generally longitudinal, standard approaches for detecting genotype-phenotype associations in such linked data, notably logistic regression, do not naturally account for variation in the period of follow-up or the time at which an event occurs. Here we explored the advantages of quantifying associations using Cox proportional hazards regression, which can account for the age at which a patient first visited the healthcare system (left truncation) and the age at which a patient either last visited the healthcare system or acquired a particular phenotype (right censoring). RESULTS: In comprehensive simulations, we found that, compared to logistic regression, Cox regression had greater power at equivalent Type I error. We then scanned for genotype-phenotype associations using logistic regression and Cox regression on 50 phenotypes derived from the EHRs of 49,792 genotyped individuals. Consistent with the findings from our simulations, Cox regression had approximately 10% greater relative sensitivity for detecting known associations from the NHGRI-EBI GWAS Catalog. In terms of effect sizes, the hazard ratios estimated by Cox regression were strongly correlated with the odds ratios estimated by logistic regression. CONCLUSIONS: As longitudinal health-related data continue to grow, Cox regression may improve our ability to identify the genetic basis for a wide range of human phenotypes. BioMed Central 2019-11-04 /pmc/articles/PMC6829851/ /pubmed/31684865 http://dx.doi.org/10.1186/s12864-019-6192-1 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Hughey, Jacob J.
Rhoades, Seth D.
Fu, Darwin Y.
Bastarache, Lisa
Denny, Joshua C.
Chen, Qingxia
Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record
title Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record
title_full Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record
title_fullStr Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record
title_full_unstemmed Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record
title_short Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record
title_sort cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6829851/
https://www.ncbi.nlm.nih.gov/pubmed/31684865
http://dx.doi.org/10.1186/s12864-019-6192-1
work_keys_str_mv AT hugheyjacobj coxregressionincreasespowertodetectgenotypephenotypeassociationsingenomicstudiesusingtheelectronichealthrecord
AT rhoadessethd coxregressionincreasespowertodetectgenotypephenotypeassociationsingenomicstudiesusingtheelectronichealthrecord
AT fudarwiny coxregressionincreasespowertodetectgenotypephenotypeassociationsingenomicstudiesusingtheelectronichealthrecord
AT bastarachelisa coxregressionincreasespowertodetectgenotypephenotypeassociationsingenomicstudiesusingtheelectronichealthrecord
AT dennyjoshuac coxregressionincreasespowertodetectgenotypephenotypeassociationsingenomicstudiesusingtheelectronichealthrecord
AT chenqingxia coxregressionincreasespowertodetectgenotypephenotypeassociationsingenomicstudiesusingtheelectronichealthrecord