Cargando…

Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction

Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Yonghui, Min, Huaqing, Wu, Qingyao, Song, Hengjie, Ye, Bicui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5292966/
https://www.ncbi.nlm.nih.gov/pubmed/28165495
http://dx.doi.org/10.1038/srep41831
_version_ 1782505020802990080
author Xu, Yonghui
Min, Huaqing
Wu, Qingyao
Song, Hengjie
Ye, Bicui
author_facet Xu, Yonghui
Min, Huaqing
Wu, Qingyao
Song, Hengjie
Ye, Bicui
author_sort Xu, Yonghui
collection PubMed
description Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods.
format Online
Article
Text
id pubmed-5292966
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-52929662017-02-10 Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction Xu, Yonghui Min, Huaqing Wu, Qingyao Song, Hengjie Ye, Bicui Sci Rep Article Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods. Nature Publishing Group 2017-02-06 /pmc/articles/PMC5292966/ /pubmed/28165495 http://dx.doi.org/10.1038/srep41831 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Xu, Yonghui
Min, Huaqing
Wu, Qingyao
Song, Hengjie
Ye, Bicui
Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction
title Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction
title_full Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction
title_fullStr Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction
title_full_unstemmed Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction
title_short Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction
title_sort multi-instance metric transfer learning for genome-wide protein function prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5292966/
https://www.ncbi.nlm.nih.gov/pubmed/28165495
http://dx.doi.org/10.1038/srep41831
work_keys_str_mv AT xuyonghui multiinstancemetrictransferlearningforgenomewideproteinfunctionprediction
AT minhuaqing multiinstancemetrictransferlearningforgenomewideproteinfunctionprediction
AT wuqingyao multiinstancemetrictransferlearningforgenomewideproteinfunctionprediction
AT songhengjie multiinstancemetrictransferlearningforgenomewideproteinfunctionprediction
AT yebicui multiinstancemetrictransferlearningforgenomewideproteinfunctionprediction