Cargando…

Probability Weighted Ensemble Transfer Learning for Predicting Interactions between HIV-1 and Human Proteins

Reconstruction of host-pathogen protein interaction networks is of great significance to reveal the underlying microbic pathogenesis. However, the current experimentally-derived networks are generally small and should be augmented by computational methods for less-biased biological inference. From t...

Descripción completa

Detalles Bibliográficos
Autor principal: Mei, Suyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3832534/
https://www.ncbi.nlm.nih.gov/pubmed/24260261
http://dx.doi.org/10.1371/journal.pone.0079606
_version_ 1782291700787445760
author Mei, Suyu
author_facet Mei, Suyu
author_sort Mei, Suyu
collection PubMed
description Reconstruction of host-pathogen protein interaction networks is of great significance to reveal the underlying microbic pathogenesis. However, the current experimentally-derived networks are generally small and should be augmented by computational methods for less-biased biological inference. From the point of view of computational modelling, data scarcity, data unavailability and negative data sampling are the three major problems for host-pathogen protein interaction networks reconstruction. In this work, we are motivated to address the three concerns and propose a probability weighted ensemble transfer learning model for HIV-human protein interaction prediction (PWEN-TLM), where support vector machine (SVM) is adopted as the individual classifier of the ensemble model. In the model, data scarcity and data unavailability are tackled by homolog knowledge transfer. The importance of homolog knowledge is measured by the ROC-AUC metric of the individual classifiers, whose outputs are probability weighted to yield the final decision. In addition, we further validate the assumption that only the homolog knowledge is sufficient to train a satisfactory model for host-pathogen protein interaction prediction. Thus the model is more robust against data unavailability with less demanding data constraint. As regards with negative data construction, experiments show that exclusiveness of subcellular co-localized proteins is unbiased and more reliable than random sampling. Last, we conduct analysis of overlapped predictions between our model and the existing models, and apply the model to novel host-pathogen PPIs recognition for further biological research.
format Online
Article
Text
id pubmed-3832534
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38325342013-11-20 Probability Weighted Ensemble Transfer Learning for Predicting Interactions between HIV-1 and Human Proteins Mei, Suyu PLoS One Research Article Reconstruction of host-pathogen protein interaction networks is of great significance to reveal the underlying microbic pathogenesis. However, the current experimentally-derived networks are generally small and should be augmented by computational methods for less-biased biological inference. From the point of view of computational modelling, data scarcity, data unavailability and negative data sampling are the three major problems for host-pathogen protein interaction networks reconstruction. In this work, we are motivated to address the three concerns and propose a probability weighted ensemble transfer learning model for HIV-human protein interaction prediction (PWEN-TLM), where support vector machine (SVM) is adopted as the individual classifier of the ensemble model. In the model, data scarcity and data unavailability are tackled by homolog knowledge transfer. The importance of homolog knowledge is measured by the ROC-AUC metric of the individual classifiers, whose outputs are probability weighted to yield the final decision. In addition, we further validate the assumption that only the homolog knowledge is sufficient to train a satisfactory model for host-pathogen protein interaction prediction. Thus the model is more robust against data unavailability with less demanding data constraint. As regards with negative data construction, experiments show that exclusiveness of subcellular co-localized proteins is unbiased and more reliable than random sampling. Last, we conduct analysis of overlapped predictions between our model and the existing models, and apply the model to novel host-pathogen PPIs recognition for further biological research. Public Library of Science 2013-11-18 /pmc/articles/PMC3832534/ /pubmed/24260261 http://dx.doi.org/10.1371/journal.pone.0079606 Text en © 2013 Suyu Mei http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mei, Suyu
Probability Weighted Ensemble Transfer Learning for Predicting Interactions between HIV-1 and Human Proteins
title Probability Weighted Ensemble Transfer Learning for Predicting Interactions between HIV-1 and Human Proteins
title_full Probability Weighted Ensemble Transfer Learning for Predicting Interactions between HIV-1 and Human Proteins
title_fullStr Probability Weighted Ensemble Transfer Learning for Predicting Interactions between HIV-1 and Human Proteins
title_full_unstemmed Probability Weighted Ensemble Transfer Learning for Predicting Interactions between HIV-1 and Human Proteins
title_short Probability Weighted Ensemble Transfer Learning for Predicting Interactions between HIV-1 and Human Proteins
title_sort probability weighted ensemble transfer learning for predicting interactions between hiv-1 and human proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3832534/
https://www.ncbi.nlm.nih.gov/pubmed/24260261
http://dx.doi.org/10.1371/journal.pone.0079606
work_keys_str_mv AT meisuyu probabilityweightedensembletransferlearningforpredictinginteractionsbetweenhiv1andhumanproteins