Cargando…

PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations

Motivation: Emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association scans (PheWAS) for disease–gene associations. We propose a novel method to scan phenomic data for genetic associations using International Classification...

Descripción completa

Detalles Bibliográficos
Autores principales: Denny, Joshua C., Ritchie, Marylyn D., Basford, Melissa A., Pulley, Jill M., Bastarache, Lisa, Brown-Gentry, Kristin, Wang, Deede, Masys, Dan R., Roden, Dan M., Crawford, Dana C.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2859132/
https://www.ncbi.nlm.nih.gov/pubmed/20335276
http://dx.doi.org/10.1093/bioinformatics/btq126
_version_ 1782180483436642304
author Denny, Joshua C.
Ritchie, Marylyn D.
Basford, Melissa A.
Pulley, Jill M.
Bastarache, Lisa
Brown-Gentry, Kristin
Wang, Deede
Masys, Dan R.
Roden, Dan M.
Crawford, Dana C.
author_facet Denny, Joshua C.
Ritchie, Marylyn D.
Basford, Melissa A.
Pulley, Jill M.
Bastarache, Lisa
Brown-Gentry, Kristin
Wang, Deede
Masys, Dan R.
Roden, Dan M.
Crawford, Dana C.
author_sort Denny, Joshua C.
collection PubMed
description Motivation: Emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association scans (PheWAS) for disease–gene associations. We propose a novel method to scan phenomic data for genetic associations using International Classification of Disease (ICD9) billing codes, which are available in most EMR systems. We have developed a code translation table to automatically define 776 different disease populations and their controls using prevalent ICD9 codes derived from EMR data. As a proof of concept of this algorithm, we genotyped the first 6005 European–Americans accrued into BioVU, Vanderbilt's DNA biobank, at five single nucleotide polymorphisms (SNPs) with previously reported disease associations: atrial fibrillation, Crohn's disease, carotid artery stenosis, coronary artery disease, multiple sclerosis, systemic lupus erythematosus and rheumatoid arthritis. The PheWAS software generated cases and control populations across all ICD9 code groups for each of these five SNPs, and disease-SNP associations were analyzed. The primary outcome of this study was replication of seven previously known SNP–disease associations for these SNPs. Results: Four of seven known SNP–disease associations using the PheWAS algorithm were replicated with P-values between 2.8 × 10(−6) and 0.011. The PheWAS algorithm also identified 19 previously unknown statistical associations between these SNPs and diseases at P < 0.01. This study indicates that PheWAS analysis is a feasible method to investigate SNP–disease associations. Further evaluation is needed to determine the validity of these associations and the appropriate statistical thresholds for clinical significance. Availability:The PheWAS software and code translation table are freely available at http://knowledgemap.mc.vanderbilt.edu/research. Contact: josh.denny@vanderbilt.edu
format Text
id pubmed-2859132
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28591322010-04-26 PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations Denny, Joshua C. Ritchie, Marylyn D. Basford, Melissa A. Pulley, Jill M. Bastarache, Lisa Brown-Gentry, Kristin Wang, Deede Masys, Dan R. Roden, Dan M. Crawford, Dana C. Bioinformatics Original Papers Motivation: Emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association scans (PheWAS) for disease–gene associations. We propose a novel method to scan phenomic data for genetic associations using International Classification of Disease (ICD9) billing codes, which are available in most EMR systems. We have developed a code translation table to automatically define 776 different disease populations and their controls using prevalent ICD9 codes derived from EMR data. As a proof of concept of this algorithm, we genotyped the first 6005 European–Americans accrued into BioVU, Vanderbilt's DNA biobank, at five single nucleotide polymorphisms (SNPs) with previously reported disease associations: atrial fibrillation, Crohn's disease, carotid artery stenosis, coronary artery disease, multiple sclerosis, systemic lupus erythematosus and rheumatoid arthritis. The PheWAS software generated cases and control populations across all ICD9 code groups for each of these five SNPs, and disease-SNP associations were analyzed. The primary outcome of this study was replication of seven previously known SNP–disease associations for these SNPs. Results: Four of seven known SNP–disease associations using the PheWAS algorithm were replicated with P-values between 2.8 × 10(−6) and 0.011. The PheWAS algorithm also identified 19 previously unknown statistical associations between these SNPs and diseases at P < 0.01. This study indicates that PheWAS analysis is a feasible method to investigate SNP–disease associations. Further evaluation is needed to determine the validity of these associations and the appropriate statistical thresholds for clinical significance. Availability:The PheWAS software and code translation table are freely available at http://knowledgemap.mc.vanderbilt.edu/research. Contact: josh.denny@vanderbilt.edu Oxford University Press 2010-05-01 2010-03-24 /pmc/articles/PMC2859132/ /pubmed/20335276 http://dx.doi.org/10.1093/bioinformatics/btq126 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Denny, Joshua C.
Ritchie, Marylyn D.
Basford, Melissa A.
Pulley, Jill M.
Bastarache, Lisa
Brown-Gentry, Kristin
Wang, Deede
Masys, Dan R.
Roden, Dan M.
Crawford, Dana C.
PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations
title PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations
title_full PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations
title_fullStr PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations
title_full_unstemmed PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations
title_short PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations
title_sort phewas: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2859132/
https://www.ncbi.nlm.nih.gov/pubmed/20335276
http://dx.doi.org/10.1093/bioinformatics/btq126
work_keys_str_mv AT dennyjoshuac phewasdemonstratingthefeasibilityofaphenomewidescantodiscovergenediseaseassociations
AT ritchiemarylynd phewasdemonstratingthefeasibilityofaphenomewidescantodiscovergenediseaseassociations
AT basfordmelissaa phewasdemonstratingthefeasibilityofaphenomewidescantodiscovergenediseaseassociations
AT pulleyjillm phewasdemonstratingthefeasibilityofaphenomewidescantodiscovergenediseaseassociations
AT bastarachelisa phewasdemonstratingthefeasibilityofaphenomewidescantodiscovergenediseaseassociations
AT browngentrykristin phewasdemonstratingthefeasibilityofaphenomewidescantodiscovergenediseaseassociations
AT wangdeede phewasdemonstratingthefeasibilityofaphenomewidescantodiscovergenediseaseassociations
AT masysdanr phewasdemonstratingthefeasibilityofaphenomewidescantodiscovergenediseaseassociations
AT rodendanm phewasdemonstratingthefeasibilityofaphenomewidescantodiscovergenediseaseassociations
AT crawforddanac phewasdemonstratingthefeasibilityofaphenomewidescantodiscovergenediseaseassociations