Cargando…

Complex-disease networks of trait-associated single-nucleotide polymorphisms (SNPs) unveiled by information theory

OBJECTIVE: Thousands of complex-disease single-nucleotide polymorphisms (SNPs) have been discovered in genome-wide association studies (GWAS). However, these intragenic SNPs have not been collectively mined to unveil the genetic architecture between complex clinical traits. The authors hypothesize t...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Haiquan, Lee, Younghee, Chen, James L, Rebman, Ellen, Li, Jianrong, Lussier, Yves A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Group 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3277620/
https://www.ncbi.nlm.nih.gov/pubmed/22278381
http://dx.doi.org/10.1136/amiajnl-2011-000482
_version_ 1782223508292501504
author Li, Haiquan
Lee, Younghee
Chen, James L
Rebman, Ellen
Li, Jianrong
Lussier, Yves A
author_facet Li, Haiquan
Lee, Younghee
Chen, James L
Rebman, Ellen
Li, Jianrong
Lussier, Yves A
author_sort Li, Haiquan
collection PubMed
description OBJECTIVE: Thousands of complex-disease single-nucleotide polymorphisms (SNPs) have been discovered in genome-wide association studies (GWAS). However, these intragenic SNPs have not been collectively mined to unveil the genetic architecture between complex clinical traits. The authors hypothesize that biological annotations of host genes of trait-associated SNPs may reveal the biomolecular modularity across complex-disease traits and offer insights for drug repositioning. METHODS: Trait-to-polymorphism (SNPs) associations confirmed in GWAS were used. A novel method to quantify trait–trait similarity anchored in Gene Ontology annotations of human proteins and information theory was developed. The results were then validated with the shortest paths of physical protein interactions between biologically similar traits. RESULTS: A network was constructed consisting of 280 significant intertrait similarities among 177 disease traits, which covered 1438 well-validated disease-associated SNPs. Thirty-nine percent of intertrait connections were confirmed by curators, and the following additional studies demonstrated the validity of a proportion of the remainder. On a phenotypic trait level, higher Gene Ontology similarity between proteins correlated with smaller ‘shortest distance’ in protein interaction networks of complexly inherited diseases (Spearman p<2.2×10(−16)). Further, ‘cancer traits’ were similar to one another, as were ‘metabolic syndrome traits’ (Fisher's exact test p=0.001 and 3.5×10(−7), respectively). CONCLUSION: An imputed disease network by information-anchored functional similarity from GWAS trait-associated SNPs is reported. It is also demonstrated that small shortest paths of protein interactions correlate with complex-disease function. Taken together, these findings provide the framework for investigating drug targets with unbiased functional biomolecular networks rather than worn-out single-gene and subjective canonical pathway approaches.
format Online
Article
Text
id pubmed-3277620
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BMJ Group
record_format MEDLINE/PubMed
spelling pubmed-32776202012-02-13 Complex-disease networks of trait-associated single-nucleotide polymorphisms (SNPs) unveiled by information theory Li, Haiquan Lee, Younghee Chen, James L Rebman, Ellen Li, Jianrong Lussier, Yves A J Am Med Inform Assoc Research and Applications OBJECTIVE: Thousands of complex-disease single-nucleotide polymorphisms (SNPs) have been discovered in genome-wide association studies (GWAS). However, these intragenic SNPs have not been collectively mined to unveil the genetic architecture between complex clinical traits. The authors hypothesize that biological annotations of host genes of trait-associated SNPs may reveal the biomolecular modularity across complex-disease traits and offer insights for drug repositioning. METHODS: Trait-to-polymorphism (SNPs) associations confirmed in GWAS were used. A novel method to quantify trait–trait similarity anchored in Gene Ontology annotations of human proteins and information theory was developed. The results were then validated with the shortest paths of physical protein interactions between biologically similar traits. RESULTS: A network was constructed consisting of 280 significant intertrait similarities among 177 disease traits, which covered 1438 well-validated disease-associated SNPs. Thirty-nine percent of intertrait connections were confirmed by curators, and the following additional studies demonstrated the validity of a proportion of the remainder. On a phenotypic trait level, higher Gene Ontology similarity between proteins correlated with smaller ‘shortest distance’ in protein interaction networks of complexly inherited diseases (Spearman p<2.2×10(−16)). Further, ‘cancer traits’ were similar to one another, as were ‘metabolic syndrome traits’ (Fisher's exact test p=0.001 and 3.5×10(−7), respectively). CONCLUSION: An imputed disease network by information-anchored functional similarity from GWAS trait-associated SNPs is reported. It is also demonstrated that small shortest paths of protein interactions correlate with complex-disease function. Taken together, these findings provide the framework for investigating drug targets with unbiased functional biomolecular networks rather than worn-out single-gene and subjective canonical pathway approaches. BMJ Group 2012-01-25 2012 /pmc/articles/PMC3277620/ /pubmed/22278381 http://dx.doi.org/10.1136/amiajnl-2011-000482 Text en © 2012, Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions. This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/2.0/ and http://creativecommons.org/licenses/by-nc/2.0/legalcode.
spellingShingle Research and Applications
Li, Haiquan
Lee, Younghee
Chen, James L
Rebman, Ellen
Li, Jianrong
Lussier, Yves A
Complex-disease networks of trait-associated single-nucleotide polymorphisms (SNPs) unveiled by information theory
title Complex-disease networks of trait-associated single-nucleotide polymorphisms (SNPs) unveiled by information theory
title_full Complex-disease networks of trait-associated single-nucleotide polymorphisms (SNPs) unveiled by information theory
title_fullStr Complex-disease networks of trait-associated single-nucleotide polymorphisms (SNPs) unveiled by information theory
title_full_unstemmed Complex-disease networks of trait-associated single-nucleotide polymorphisms (SNPs) unveiled by information theory
title_short Complex-disease networks of trait-associated single-nucleotide polymorphisms (SNPs) unveiled by information theory
title_sort complex-disease networks of trait-associated single-nucleotide polymorphisms (snps) unveiled by information theory
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3277620/
https://www.ncbi.nlm.nih.gov/pubmed/22278381
http://dx.doi.org/10.1136/amiajnl-2011-000482
work_keys_str_mv AT lihaiquan complexdiseasenetworksoftraitassociatedsinglenucleotidepolymorphismssnpsunveiledbyinformationtheory
AT leeyounghee complexdiseasenetworksoftraitassociatedsinglenucleotidepolymorphismssnpsunveiledbyinformationtheory
AT chenjamesl complexdiseasenetworksoftraitassociatedsinglenucleotidepolymorphismssnpsunveiledbyinformationtheory
AT rebmanellen complexdiseasenetworksoftraitassociatedsinglenucleotidepolymorphismssnpsunveiledbyinformationtheory
AT lijianrong complexdiseasenetworksoftraitassociatedsinglenucleotidepolymorphismssnpsunveiledbyinformationtheory
AT lussieryvesa complexdiseasenetworksoftraitassociatedsinglenucleotidepolymorphismssnpsunveiledbyinformationtheory