Cargando…

Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction

The expansion of protein-ligand annotation databases has enabled large-scale networking of proteins by ligand similarity. These ligand-based protein networks, which implicitly predict the ability of neighboring proteins to bind related ligands, may complement biologically-oriented gene networks, whi...

Descripción completa

Detalles Bibliográficos
Autores principales: O’Meara, Matthew J., Ballouz, Sara, Shoichet, Brian K., Gillis, Jesse
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4965129/
https://www.ncbi.nlm.nih.gov/pubmed/27467773
http://dx.doi.org/10.1371/journal.pone.0160098
_version_ 1782445219726229504
author O’Meara, Matthew J.
Ballouz, Sara
Shoichet, Brian K.
Gillis, Jesse
author_facet O’Meara, Matthew J.
Ballouz, Sara
Shoichet, Brian K.
Gillis, Jesse
author_sort O’Meara, Matthew J.
collection PubMed
description The expansion of protein-ligand annotation databases has enabled large-scale networking of proteins by ligand similarity. These ligand-based protein networks, which implicitly predict the ability of neighboring proteins to bind related ligands, may complement biologically-oriented gene networks, which are used to predict functional or disease relevance. To quantify the degree to which such ligand-based protein associations might complement functional genomic associations, including sequence similarity, physical protein-protein interactions, co-expression, and disease gene annotations, we calculated a network based on the Similarity Ensemble Approach (SEA: sea.docking.org), where protein neighbors reflect the similarity of their ligands. We also measured the similarity with functional genomic networks over a common set of 1,131 genes, and found that the networks had only small overlaps, which were significant only due to the large scale of the data. Consistent with the view that the networks contain different information, combining them substantially improved Molecular Function prediction within GO (from AUROC~0.63–0.75 for the individual data modalities to AUROC~0.8 in the aggregate). We investigated the boost in guilt-by-association gene function prediction when the networks are combined and describe underlying properties that can be further exploited.
format Online
Article
Text
id pubmed-4965129
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-49651292016-08-18 Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction O’Meara, Matthew J. Ballouz, Sara Shoichet, Brian K. Gillis, Jesse PLoS One Research Article The expansion of protein-ligand annotation databases has enabled large-scale networking of proteins by ligand similarity. These ligand-based protein networks, which implicitly predict the ability of neighboring proteins to bind related ligands, may complement biologically-oriented gene networks, which are used to predict functional or disease relevance. To quantify the degree to which such ligand-based protein associations might complement functional genomic associations, including sequence similarity, physical protein-protein interactions, co-expression, and disease gene annotations, we calculated a network based on the Similarity Ensemble Approach (SEA: sea.docking.org), where protein neighbors reflect the similarity of their ligands. We also measured the similarity with functional genomic networks over a common set of 1,131 genes, and found that the networks had only small overlaps, which were significant only due to the large scale of the data. Consistent with the view that the networks contain different information, combining them substantially improved Molecular Function prediction within GO (from AUROC~0.63–0.75 for the individual data modalities to AUROC~0.8 in the aggregate). We investigated the boost in guilt-by-association gene function prediction when the networks are combined and describe underlying properties that can be further exploited. Public Library of Science 2016-07-28 /pmc/articles/PMC4965129/ /pubmed/27467773 http://dx.doi.org/10.1371/journal.pone.0160098 Text en © 2016 O’Meara et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
O’Meara, Matthew J.
Ballouz, Sara
Shoichet, Brian K.
Gillis, Jesse
Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction
title Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction
title_full Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction
title_fullStr Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction
title_full_unstemmed Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction
title_short Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction
title_sort ligand similarity complements sequence, physical interaction, and co-expression for gene function prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4965129/
https://www.ncbi.nlm.nih.gov/pubmed/27467773
http://dx.doi.org/10.1371/journal.pone.0160098
work_keys_str_mv AT omearamatthewj ligandsimilaritycomplementssequencephysicalinteractionandcoexpressionforgenefunctionprediction
AT ballouzsara ligandsimilaritycomplementssequencephysicalinteractionandcoexpressionforgenefunctionprediction
AT shoichetbriank ligandsimilaritycomplementssequencephysicalinteractionandcoexpressionforgenefunctionprediction
AT gillisjesse ligandsimilaritycomplementssequencephysicalinteractionandcoexpressionforgenefunctionprediction