Cargando…
Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction
Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5292966/ https://www.ncbi.nlm.nih.gov/pubmed/28165495 http://dx.doi.org/10.1038/srep41831 |
_version_ | 1782505020802990080 |
---|---|
author | Xu, Yonghui Min, Huaqing Wu, Qingyao Song, Hengjie Ye, Bicui |
author_facet | Xu, Yonghui Min, Huaqing Wu, Qingyao Song, Hengjie Ye, Bicui |
author_sort | Xu, Yonghui |
collection | PubMed |
description | Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods. |
format | Online Article Text |
id | pubmed-5292966 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-52929662017-02-10 Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction Xu, Yonghui Min, Huaqing Wu, Qingyao Song, Hengjie Ye, Bicui Sci Rep Article Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods. Nature Publishing Group 2017-02-06 /pmc/articles/PMC5292966/ /pubmed/28165495 http://dx.doi.org/10.1038/srep41831 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Xu, Yonghui Min, Huaqing Wu, Qingyao Song, Hengjie Ye, Bicui Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction |
title | Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction |
title_full | Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction |
title_fullStr | Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction |
title_full_unstemmed | Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction |
title_short | Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction |
title_sort | multi-instance metric transfer learning for genome-wide protein function prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5292966/ https://www.ncbi.nlm.nih.gov/pubmed/28165495 http://dx.doi.org/10.1038/srep41831 |
work_keys_str_mv | AT xuyonghui multiinstancemetrictransferlearningforgenomewideproteinfunctionprediction AT minhuaqing multiinstancemetrictransferlearningforgenomewideproteinfunctionprediction AT wuqingyao multiinstancemetrictransferlearningforgenomewideproteinfunctionprediction AT songhengjie multiinstancemetrictransferlearningforgenomewideproteinfunctionprediction AT yebicui multiinstancemetrictransferlearningforgenomewideproteinfunctionprediction |