Cargando…

Probabilistic Inference of Biological Networks via Data Integration

There is significant interest in inferring the structure of subcellular networks of interaction. Here we consider supervised interactive network inference in which a reference set of known network links and nonlinks is used to train a classifier for predicting new links. Many types of data are relev...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rogers, Mark F., Campbell, Colin, Ying, Yiming
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi Publishing Corporation 2015
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4385617/ https://www.ncbi.nlm.nih.gov/pubmed/25874225 http://dx.doi.org/10.1155/2015/707453

_version_	1782365059235708928
author	Rogers, Mark F. Campbell, Colin Ying, Yiming
author_facet	Rogers, Mark F. Campbell, Colin Ying, Yiming
author_sort	Rogers, Mark F.
collection	PubMed
description	There is significant interest in inferring the structure of subcellular networks of interaction. Here we consider supervised interactive network inference in which a reference set of known network links and nonlinks is used to train a classifier for predicting new links. Many types of data are relevant to inferring functional links between genes, motivating the use of data integration. We use pairwise kernels to predict novel links, along with multiple kernel learning to integrate distinct sources of data into a decision function. We evaluate various pairwise kernels to establish which are most informative and compare individual kernel accuracies with accuracies for weighted combinations. By associating a probability measure with classifier predictions, we enable cautious classification, which can increase accuracy by restricting predictions to high-confidence instances, and data cleaning that can mitigate the influence of mislabeled training instances. Although one pairwise kernel (the tensor product pairwise kernel) appears to work best, different kernels may contribute complimentary information about interactions: experiments in S. cerevisiae (yeast) reveal that a weighted combination of pairwise kernels applied to different types of data yields the highest predictive accuracy. Combined with cautious classification and data cleaning, we can achieve predictive accuracies of up to 99.6%.
format	Online Article Text
id	pubmed-4385617
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Hindawi Publishing Corporation
record_format	MEDLINE/PubMed
spelling	pubmed-43856172015-04-13 Probabilistic Inference of Biological Networks via Data Integration Rogers, Mark F. Campbell, Colin Ying, Yiming Biomed Res Int Research Article There is significant interest in inferring the structure of subcellular networks of interaction. Here we consider supervised interactive network inference in which a reference set of known network links and nonlinks is used to train a classifier for predicting new links. Many types of data are relevant to inferring functional links between genes, motivating the use of data integration. We use pairwise kernels to predict novel links, along with multiple kernel learning to integrate distinct sources of data into a decision function. We evaluate various pairwise kernels to establish which are most informative and compare individual kernel accuracies with accuracies for weighted combinations. By associating a probability measure with classifier predictions, we enable cautious classification, which can increase accuracy by restricting predictions to high-confidence instances, and data cleaning that can mitigate the influence of mislabeled training instances. Although one pairwise kernel (the tensor product pairwise kernel) appears to work best, different kernels may contribute complimentary information about interactions: experiments in S. cerevisiae (yeast) reveal that a weighted combination of pairwise kernels applied to different types of data yields the highest predictive accuracy. Combined with cautious classification and data cleaning, we can achieve predictive accuracies of up to 99.6%. Hindawi Publishing Corporation 2015 2015-03-22 /pmc/articles/PMC4385617/ /pubmed/25874225 http://dx.doi.org/10.1155/2015/707453 Text en Copyright © 2015 Mark F. Rogers et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Rogers, Mark F. Campbell, Colin Ying, Yiming Probabilistic Inference of Biological Networks via Data Integration
title	Probabilistic Inference of Biological Networks via Data Integration
title_full	Probabilistic Inference of Biological Networks via Data Integration
title_fullStr	Probabilistic Inference of Biological Networks via Data Integration
title_full_unstemmed	Probabilistic Inference of Biological Networks via Data Integration
title_short	Probabilistic Inference of Biological Networks via Data Integration
title_sort	probabilistic inference of biological networks via data integration
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4385617/ https://www.ncbi.nlm.nih.gov/pubmed/25874225 http://dx.doi.org/10.1155/2015/707453
work_keys_str_mv	AT rogersmarkf probabilisticinferenceofbiologicalnetworksviadataintegration AT campbellcolin probabilisticinferenceofbiologicalnetworksviadataintegration AT yingyiming probabilisticinferenceofbiologicalnetworksviadataintegration

Probabilistic Inference of Biological Networks via Data Integration

Ejemplares similares