Cargando…

PoCos: Population Covering Locus Sets for Risk Assessment in Complex Diseases

Susceptibility loci identified by GWAS generally account for a limited fraction of heritability. Predictive models based on identified loci also have modest success in risk assessment and therefore are of limited practical use. Many methods have been developed to overcome these limitations by incorp...

Descripción completa

Detalles Bibliográficos
Autores principales: Ayati, Marzieh, Koyutürk, Mehmet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5105987/
https://www.ncbi.nlm.nih.gov/pubmed/27835645
http://dx.doi.org/10.1371/journal.pcbi.1005195
_version_ 1782466970597195776
author Ayati, Marzieh
Koyutürk, Mehmet
author_facet Ayati, Marzieh
Koyutürk, Mehmet
author_sort Ayati, Marzieh
collection PubMed
description Susceptibility loci identified by GWAS generally account for a limited fraction of heritability. Predictive models based on identified loci also have modest success in risk assessment and therefore are of limited practical use. Many methods have been developed to overcome these limitations by incorporating prior biological knowledge. However, most of the information utilized by these methods is at the level of genes, limiting analyses to variants that are in or proximate to coding regions. We propose a new method that integrates protein protein interaction (PPI) as well as expression quantitative trait loci (eQTL) data to identify sets of functionally related loci that are collectively associated with a trait of interest. We call such sets of loci “population covering locus sets” (PoCos). The contributions of the proposed approach are three-fold: 1) We consider all possible genotype models for each locus, thereby enabling identification of combinatorial relationships between multiple loci. 2) We develop a framework for the integration of PPI and eQTL into a heterogenous network model, enabling efficient identification of functionally related variants that are associated with the disease. 3) We develop a novel method to integrate the genotypes of multiple loci in a PoCo into a representative genotype to be used in risk assessment. We test the proposed framework in the context of risk assessment for seven complex diseases, type 1 diabetes (T1D), type 2 diabetes (T2D), psoriasis (PS), bipolar disorder (BD), coronary artery disease (CAD), hypertension (HT), and multiple sclerosis (MS). Our results show that the proposed method significantly outperforms individual variant based risk assessment models as well as the state-of-the-art polygenic score. We also show that incorporation of eQTL data improves the performance of identified POCOs in risk assessment. We also assess the biological relevance of PoCos for three diseases that have similar biological mechanisms and identify novel candidate genes. The resulting software is publicly available at http://compbio.case.edu/pocos/.
format Online
Article
Text
id pubmed-5105987
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-51059872016-12-08 PoCos: Population Covering Locus Sets for Risk Assessment in Complex Diseases Ayati, Marzieh Koyutürk, Mehmet PLoS Comput Biol Research Article Susceptibility loci identified by GWAS generally account for a limited fraction of heritability. Predictive models based on identified loci also have modest success in risk assessment and therefore are of limited practical use. Many methods have been developed to overcome these limitations by incorporating prior biological knowledge. However, most of the information utilized by these methods is at the level of genes, limiting analyses to variants that are in or proximate to coding regions. We propose a new method that integrates protein protein interaction (PPI) as well as expression quantitative trait loci (eQTL) data to identify sets of functionally related loci that are collectively associated with a trait of interest. We call such sets of loci “population covering locus sets” (PoCos). The contributions of the proposed approach are three-fold: 1) We consider all possible genotype models for each locus, thereby enabling identification of combinatorial relationships between multiple loci. 2) We develop a framework for the integration of PPI and eQTL into a heterogenous network model, enabling efficient identification of functionally related variants that are associated with the disease. 3) We develop a novel method to integrate the genotypes of multiple loci in a PoCo into a representative genotype to be used in risk assessment. We test the proposed framework in the context of risk assessment for seven complex diseases, type 1 diabetes (T1D), type 2 diabetes (T2D), psoriasis (PS), bipolar disorder (BD), coronary artery disease (CAD), hypertension (HT), and multiple sclerosis (MS). Our results show that the proposed method significantly outperforms individual variant based risk assessment models as well as the state-of-the-art polygenic score. We also show that incorporation of eQTL data improves the performance of identified POCOs in risk assessment. We also assess the biological relevance of PoCos for three diseases that have similar biological mechanisms and identify novel candidate genes. The resulting software is publicly available at http://compbio.case.edu/pocos/. Public Library of Science 2016-11-11 /pmc/articles/PMC5105987/ /pubmed/27835645 http://dx.doi.org/10.1371/journal.pcbi.1005195 Text en © 2016 Ayati, Koyutürk http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ayati, Marzieh
Koyutürk, Mehmet
PoCos: Population Covering Locus Sets for Risk Assessment in Complex Diseases
title PoCos: Population Covering Locus Sets for Risk Assessment in Complex Diseases
title_full PoCos: Population Covering Locus Sets for Risk Assessment in Complex Diseases
title_fullStr PoCos: Population Covering Locus Sets for Risk Assessment in Complex Diseases
title_full_unstemmed PoCos: Population Covering Locus Sets for Risk Assessment in Complex Diseases
title_short PoCos: Population Covering Locus Sets for Risk Assessment in Complex Diseases
title_sort pocos: population covering locus sets for risk assessment in complex diseases
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5105987/
https://www.ncbi.nlm.nih.gov/pubmed/27835645
http://dx.doi.org/10.1371/journal.pcbi.1005195
work_keys_str_mv AT ayatimarzieh pocospopulationcoveringlocussetsforriskassessmentincomplexdiseases
AT koyuturkmehmet pocospopulationcoveringlocussetsforriskassessmentincomplexdiseases