Cargando…
STAMS: STRING-assisted module search for genome wide association studies and application to autism
Motivation: Analyzing genome wide association data in the context of biological pathways helps us understand how genetic variation influences phenotype and increases power to find associations. However, the utility of pathway-based analysis tools is hampered by undercuration and reliance on a distri...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5167061/ https://www.ncbi.nlm.nih.gov/pubmed/27542772 http://dx.doi.org/10.1093/bioinformatics/btw530 |
_version_ | 1782483129104072704 |
---|---|
author | Hillenmeyer, Sara Davis, Lea K. Gamazon, Eric R. Cook, Edwin H. Cox, Nancy J. Altman, Russ B. |
author_facet | Hillenmeyer, Sara Davis, Lea K. Gamazon, Eric R. Cook, Edwin H. Cox, Nancy J. Altman, Russ B. |
author_sort | Hillenmeyer, Sara |
collection | PubMed |
description | Motivation: Analyzing genome wide association data in the context of biological pathways helps us understand how genetic variation influences phenotype and increases power to find associations. However, the utility of pathway-based analysis tools is hampered by undercuration and reliance on a distribution of signal across all of the genes in a pathway. Methods that combine genome wide association results with genetic networks to infer the key phenotype-modulating subnetworks combat these issues, but have primarily been limited to network definitions with yes/no labels for gene-gene interactions. A recent method (EW_dmGWAS) incorporates a biological network with weighted edge probability by requiring a secondary phenotype-specific expression dataset. In this article, we combine an algorithm for weighted-edge module searching and a probabilistic interaction network in order to develop a method, STAMS, for recovering modules of genes with strong associations to the phenotype and probable biologic coherence. Our method builds on EW_dmGWAS but does not require a secondary expression dataset and performs better in six test cases. Results: We show that our algorithm improves over EW_dmGWAS and standard gene-based analysis by measuring precision and recall of each method on separately identified associations. In the Wellcome Trust Rheumatoid Arthritis study, STAMS-identified modules were more enriched for separately identified associations than EW_dmGWAS (STAMS P-value 3.0 × 10(−4); EW_dmGWAS- P-value = 0.8). We demonstrate that the area under the Precision-Recall curve is 5.9 times higher with STAMS than EW_dmGWAS run on the Wellcome Trust Type 1 Diabetes data. Availability and Implementation: STAMS is implemented as an R package and is freely available at https://simtk.org/projects/stams. Contact: rbaltman@stanford.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-5167061 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-51670612016-12-20 STAMS: STRING-assisted module search for genome wide association studies and application to autism Hillenmeyer, Sara Davis, Lea K. Gamazon, Eric R. Cook, Edwin H. Cox, Nancy J. Altman, Russ B. Bioinformatics Original Papers Motivation: Analyzing genome wide association data in the context of biological pathways helps us understand how genetic variation influences phenotype and increases power to find associations. However, the utility of pathway-based analysis tools is hampered by undercuration and reliance on a distribution of signal across all of the genes in a pathway. Methods that combine genome wide association results with genetic networks to infer the key phenotype-modulating subnetworks combat these issues, but have primarily been limited to network definitions with yes/no labels for gene-gene interactions. A recent method (EW_dmGWAS) incorporates a biological network with weighted edge probability by requiring a secondary phenotype-specific expression dataset. In this article, we combine an algorithm for weighted-edge module searching and a probabilistic interaction network in order to develop a method, STAMS, for recovering modules of genes with strong associations to the phenotype and probable biologic coherence. Our method builds on EW_dmGWAS but does not require a secondary expression dataset and performs better in six test cases. Results: We show that our algorithm improves over EW_dmGWAS and standard gene-based analysis by measuring precision and recall of each method on separately identified associations. In the Wellcome Trust Rheumatoid Arthritis study, STAMS-identified modules were more enriched for separately identified associations than EW_dmGWAS (STAMS P-value 3.0 × 10(−4); EW_dmGWAS- P-value = 0.8). We demonstrate that the area under the Precision-Recall curve is 5.9 times higher with STAMS than EW_dmGWAS run on the Wellcome Trust Type 1 Diabetes data. Availability and Implementation: STAMS is implemented as an R package and is freely available at https://simtk.org/projects/stams. Contact: rbaltman@stanford.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-12-15 2016-08-19 /pmc/articles/PMC5167061/ /pubmed/27542772 http://dx.doi.org/10.1093/bioinformatics/btw530 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Hillenmeyer, Sara Davis, Lea K. Gamazon, Eric R. Cook, Edwin H. Cox, Nancy J. Altman, Russ B. STAMS: STRING-assisted module search for genome wide association studies and application to autism |
title | STAMS: STRING-assisted module search for genome wide association studies and application to autism |
title_full | STAMS: STRING-assisted module search for genome wide association studies and application to autism |
title_fullStr | STAMS: STRING-assisted module search for genome wide association studies and application to autism |
title_full_unstemmed | STAMS: STRING-assisted module search for genome wide association studies and application to autism |
title_short | STAMS: STRING-assisted module search for genome wide association studies and application to autism |
title_sort | stams: string-assisted module search for genome wide association studies and application to autism |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5167061/ https://www.ncbi.nlm.nih.gov/pubmed/27542772 http://dx.doi.org/10.1093/bioinformatics/btw530 |
work_keys_str_mv | AT hillenmeyersara stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism AT davisleak stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism AT gamazonericr stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism AT cookedwinh stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism AT coxnancyj stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism AT altmanrussb stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism |