Cargando…

Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins

Motivation: Protein–protein interactions (PPIs) are critical for virtually every biological function. Recently, researchers suggested to use supervised learning for the task of classifying pairs of proteins as interacting or not. However, its performance is largely restricted by the availability of...

Descripción completa

Detalles Bibliográficos
Autores principales: Qi, Yanjun, Tastan, Oznur, Carbonell, Jaime G., Klein-Seetharaman, Judith, Weston, Jason
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935441/
https://www.ncbi.nlm.nih.gov/pubmed/20823334
http://dx.doi.org/10.1093/bioinformatics/btq394
_version_ 1782186403821518848
author Qi, Yanjun
Tastan, Oznur
Carbonell, Jaime G.
Klein-Seetharaman, Judith
Weston, Jason
author_facet Qi, Yanjun
Tastan, Oznur
Carbonell, Jaime G.
Klein-Seetharaman, Judith
Weston, Jason
author_sort Qi, Yanjun
collection PubMed
description Motivation: Protein–protein interactions (PPIs) are critical for virtually every biological function. Recently, researchers suggested to use supervised learning for the task of classifying pairs of proteins as interacting or not. However, its performance is largely restricted by the availability of truly interacting proteins (labeled). Meanwhile, there exists a considerable amount of protein pairs where an association appears between two partners, but not enough experimental evidence to support it as a direct interaction (partially labeled). Results: We propose a semi-supervised multi-task framework for predicting PPIs from not only labeled, but also partially labeled reference sets. The basic idea is to perform multi-task learning on a supervised classification task and a semi-supervised auxiliary task. The supervised classifier trains a multi-layer perceptron network for PPI predictions from labeled examples. The semi-supervised auxiliary task shares network layers of the supervised classifier and trains with partially labeled examples. Semi-supervision could be utilized in multiple ways. We tried three approaches in this article, (i) classification (to distinguish partial positives with negatives); (ii) ranking (to rate partial positive more likely than negatives); (iii) embedding (to make data clusters get similar labels). We applied this framework to improve the identification of interacting pairs between HIV-1 and human proteins. Our method improved upon the state-of-the-art method for this task indicating the benefits of semi-supervised multi-task learning using auxiliary information. Availability: http://www.cs.cmu.edu/∼qyj/HIVsemi Contact: qyj@cs.cmu.edu
format Text
id pubmed-2935441
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29354412010-09-08 Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins Qi, Yanjun Tastan, Oznur Carbonell, Jaime G. Klein-Seetharaman, Judith Weston, Jason Bioinformatics Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Motivation: Protein–protein interactions (PPIs) are critical for virtually every biological function. Recently, researchers suggested to use supervised learning for the task of classifying pairs of proteins as interacting or not. However, its performance is largely restricted by the availability of truly interacting proteins (labeled). Meanwhile, there exists a considerable amount of protein pairs where an association appears between two partners, but not enough experimental evidence to support it as a direct interaction (partially labeled). Results: We propose a semi-supervised multi-task framework for predicting PPIs from not only labeled, but also partially labeled reference sets. The basic idea is to perform multi-task learning on a supervised classification task and a semi-supervised auxiliary task. The supervised classifier trains a multi-layer perceptron network for PPI predictions from labeled examples. The semi-supervised auxiliary task shares network layers of the supervised classifier and trains with partially labeled examples. Semi-supervision could be utilized in multiple ways. We tried three approaches in this article, (i) classification (to distinguish partial positives with negatives); (ii) ranking (to rate partial positive more likely than negatives); (iii) embedding (to make data clusters get similar labels). We applied this framework to improve the identification of interacting pairs between HIV-1 and human proteins. Our method improved upon the state-of-the-art method for this task indicating the benefits of semi-supervised multi-task learning using auxiliary information. Availability: http://www.cs.cmu.edu/∼qyj/HIVsemi Contact: qyj@cs.cmu.edu Oxford University Press 2010-09-15 2010-09-04 /pmc/articles/PMC2935441/ /pubmed/20823334 http://dx.doi.org/10.1093/bioinformatics/btq394 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
Qi, Yanjun
Tastan, Oznur
Carbonell, Jaime G.
Klein-Seetharaman, Judith
Weston, Jason
Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins
title Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins
title_full Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins
title_fullStr Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins
title_full_unstemmed Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins
title_short Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins
title_sort semi-supervised multi-task learning for predicting interactions between hiv-1 and human proteins
topic Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935441/
https://www.ncbi.nlm.nih.gov/pubmed/20823334
http://dx.doi.org/10.1093/bioinformatics/btq394
work_keys_str_mv AT qiyanjun semisupervisedmultitasklearningforpredictinginteractionsbetweenhiv1andhumanproteins
AT tastanoznur semisupervisedmultitasklearningforpredictinginteractionsbetweenhiv1andhumanproteins
AT carbonelljaimeg semisupervisedmultitasklearningforpredictinginteractionsbetweenhiv1andhumanproteins
AT kleinseetharamanjudith semisupervisedmultitasklearningforpredictinginteractionsbetweenhiv1andhumanproteins
AT westonjason semisupervisedmultitasklearningforpredictinginteractionsbetweenhiv1andhumanproteins