Cargando…

STAMS: STRING-assisted module search for genome wide association studies and application to autism

Motivation: Analyzing genome wide association data in the context of biological pathways helps us understand how genetic variation influences phenotype and increases power to find associations. However, the utility of pathway-based analysis tools is hampered by undercuration and reliance on a distri...

Descripción completa

Detalles Bibliográficos
Autores principales: Hillenmeyer, Sara, Davis, Lea K., Gamazon, Eric R., Cook, Edwin H., Cox, Nancy J., Altman, Russ B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5167061/
https://www.ncbi.nlm.nih.gov/pubmed/27542772
http://dx.doi.org/10.1093/bioinformatics/btw530
_version_ 1782483129104072704
author Hillenmeyer, Sara
Davis, Lea K.
Gamazon, Eric R.
Cook, Edwin H.
Cox, Nancy J.
Altman, Russ B.
author_facet Hillenmeyer, Sara
Davis, Lea K.
Gamazon, Eric R.
Cook, Edwin H.
Cox, Nancy J.
Altman, Russ B.
author_sort Hillenmeyer, Sara
collection PubMed
description Motivation: Analyzing genome wide association data in the context of biological pathways helps us understand how genetic variation influences phenotype and increases power to find associations. However, the utility of pathway-based analysis tools is hampered by undercuration and reliance on a distribution of signal across all of the genes in a pathway. Methods that combine genome wide association results with genetic networks to infer the key phenotype-modulating subnetworks combat these issues, but have primarily been limited to network definitions with yes/no labels for gene-gene interactions. A recent method (EW_dmGWAS) incorporates a biological network with weighted edge probability by requiring a secondary phenotype-specific expression dataset. In this article, we combine an algorithm for weighted-edge module searching and a probabilistic interaction network in order to develop a method, STAMS, for recovering modules of genes with strong associations to the phenotype and probable biologic coherence. Our method builds on EW_dmGWAS but does not require a secondary expression dataset and performs better in six test cases. Results: We show that our algorithm improves over EW_dmGWAS and standard gene-based analysis by measuring precision and recall of each method on separately identified associations. In the Wellcome Trust Rheumatoid Arthritis study, STAMS-identified modules were more enriched for separately identified associations than EW_dmGWAS (STAMS P-value 3.0 × 10(−4); EW_dmGWAS- P-value = 0.8). We demonstrate that the area under the Precision-Recall curve is 5.9 times higher with STAMS than EW_dmGWAS run on the Wellcome Trust Type 1 Diabetes data. Availability and Implementation: STAMS is implemented as an R package and is freely available at https://simtk.org/projects/stams. Contact: rbaltman@stanford.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5167061
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-51670612016-12-20 STAMS: STRING-assisted module search for genome wide association studies and application to autism Hillenmeyer, Sara Davis, Lea K. Gamazon, Eric R. Cook, Edwin H. Cox, Nancy J. Altman, Russ B. Bioinformatics Original Papers Motivation: Analyzing genome wide association data in the context of biological pathways helps us understand how genetic variation influences phenotype and increases power to find associations. However, the utility of pathway-based analysis tools is hampered by undercuration and reliance on a distribution of signal across all of the genes in a pathway. Methods that combine genome wide association results with genetic networks to infer the key phenotype-modulating subnetworks combat these issues, but have primarily been limited to network definitions with yes/no labels for gene-gene interactions. A recent method (EW_dmGWAS) incorporates a biological network with weighted edge probability by requiring a secondary phenotype-specific expression dataset. In this article, we combine an algorithm for weighted-edge module searching and a probabilistic interaction network in order to develop a method, STAMS, for recovering modules of genes with strong associations to the phenotype and probable biologic coherence. Our method builds on EW_dmGWAS but does not require a secondary expression dataset and performs better in six test cases. Results: We show that our algorithm improves over EW_dmGWAS and standard gene-based analysis by measuring precision and recall of each method on separately identified associations. In the Wellcome Trust Rheumatoid Arthritis study, STAMS-identified modules were more enriched for separately identified associations than EW_dmGWAS (STAMS P-value 3.0 × 10(−4); EW_dmGWAS- P-value = 0.8). We demonstrate that the area under the Precision-Recall curve is 5.9 times higher with STAMS than EW_dmGWAS run on the Wellcome Trust Type 1 Diabetes data. Availability and Implementation: STAMS is implemented as an R package and is freely available at https://simtk.org/projects/stams. Contact: rbaltman@stanford.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-12-15 2016-08-19 /pmc/articles/PMC5167061/ /pubmed/27542772 http://dx.doi.org/10.1093/bioinformatics/btw530 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Hillenmeyer, Sara
Davis, Lea K.
Gamazon, Eric R.
Cook, Edwin H.
Cox, Nancy J.
Altman, Russ B.
STAMS: STRING-assisted module search for genome wide association studies and application to autism
title STAMS: STRING-assisted module search for genome wide association studies and application to autism
title_full STAMS: STRING-assisted module search for genome wide association studies and application to autism
title_fullStr STAMS: STRING-assisted module search for genome wide association studies and application to autism
title_full_unstemmed STAMS: STRING-assisted module search for genome wide association studies and application to autism
title_short STAMS: STRING-assisted module search for genome wide association studies and application to autism
title_sort stams: string-assisted module search for genome wide association studies and application to autism
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5167061/
https://www.ncbi.nlm.nih.gov/pubmed/27542772
http://dx.doi.org/10.1093/bioinformatics/btw530
work_keys_str_mv AT hillenmeyersara stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism
AT davisleak stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism
AT gamazonericr stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism
AT cookedwinh stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism
AT coxnancyj stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism
AT altmanrussb stamsstringassistedmodulesearchforgenomewideassociationstudiesandapplicationtoautism