Cargando…

Multi-Source Selection Transfer Learning with Privacy-Preserving

Transfer learning has ability to create learning task of weakly labeled or unlabeled target domain by using knowledge of source domain to help, which can effectively improve the performance of target learning task. At present, the increased awareness of privacy protection restricts access to data so...

Descripción completa

Detalles Bibliográficos
Autor principal: Wu, Weifei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9077647/
https://www.ncbi.nlm.nih.gov/pubmed/35573261
http://dx.doi.org/10.1007/s11063-022-10841-6
_version_ 1784702154757373952
author Wu, Weifei
author_facet Wu, Weifei
author_sort Wu, Weifei
collection PubMed
description Transfer learning has ability to create learning task of weakly labeled or unlabeled target domain by using knowledge of source domain to help, which can effectively improve the performance of target learning task. At present, the increased awareness of privacy protection restricts access to data sources and poses new challenges to the development of transfer learning. However, the research on privacy protection in transfer learning is very rare. The existing work mainly uses differential privacy technology and does not consider the distribution difference between data sources, or does not consider the conditional probability distribution of data, which causes negative transfer to harm the effect of algorithm. Therefore, this paper proposes multi-source selection transfer learning algorithm with privacy-preserving MultiSTLP, which is used in scenarios where target domain contains unlabeled data sets with only a small amount of group probability information and multiple source domains with a large number of labeled data sets. Group probability means that the class label of each sample in target data set is unknown, but the probability of each class in a given data group is available, and multiple source domains indicate that there are more than two source domains. The number of data set contains more than two data sets of source domain and one data set of target domain. The algorithm adapts to the marginal probability distribution and conditional probability distribution differences between domains, and can protect the privacy of target data and improve classification accuracy by fusing the idea of multi-source transfer learning and group probability into support vector machine. At the same time, it can select the representative dataset in source domains to improve efficiency relied on speeding up the training process of algorithm. Experimental results on several real datasets show the effectiveness of MultiSTLP, and it also has some advantages compared with the state-of-the-art transfer learning algorithm.
format Online
Article
Text
id pubmed-9077647
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-90776472022-05-09 Multi-Source Selection Transfer Learning with Privacy-Preserving Wu, Weifei Neural Process Lett Article Transfer learning has ability to create learning task of weakly labeled or unlabeled target domain by using knowledge of source domain to help, which can effectively improve the performance of target learning task. At present, the increased awareness of privacy protection restricts access to data sources and poses new challenges to the development of transfer learning. However, the research on privacy protection in transfer learning is very rare. The existing work mainly uses differential privacy technology and does not consider the distribution difference between data sources, or does not consider the conditional probability distribution of data, which causes negative transfer to harm the effect of algorithm. Therefore, this paper proposes multi-source selection transfer learning algorithm with privacy-preserving MultiSTLP, which is used in scenarios where target domain contains unlabeled data sets with only a small amount of group probability information and multiple source domains with a large number of labeled data sets. Group probability means that the class label of each sample in target data set is unknown, but the probability of each class in a given data group is available, and multiple source domains indicate that there are more than two source domains. The number of data set contains more than two data sets of source domain and one data set of target domain. The algorithm adapts to the marginal probability distribution and conditional probability distribution differences between domains, and can protect the privacy of target data and improve classification accuracy by fusing the idea of multi-source transfer learning and group probability into support vector machine. At the same time, it can select the representative dataset in source domains to improve efficiency relied on speeding up the training process of algorithm. Experimental results on several real datasets show the effectiveness of MultiSTLP, and it also has some advantages compared with the state-of-the-art transfer learning algorithm. Springer US 2022-05-07 2022 /pmc/articles/PMC9077647/ /pubmed/35573261 http://dx.doi.org/10.1007/s11063-022-10841-6 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Wu, Weifei
Multi-Source Selection Transfer Learning with Privacy-Preserving
title Multi-Source Selection Transfer Learning with Privacy-Preserving
title_full Multi-Source Selection Transfer Learning with Privacy-Preserving
title_fullStr Multi-Source Selection Transfer Learning with Privacy-Preserving
title_full_unstemmed Multi-Source Selection Transfer Learning with Privacy-Preserving
title_short Multi-Source Selection Transfer Learning with Privacy-Preserving
title_sort multi-source selection transfer learning with privacy-preserving
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9077647/
https://www.ncbi.nlm.nih.gov/pubmed/35573261
http://dx.doi.org/10.1007/s11063-022-10841-6
work_keys_str_mv AT wuweifei multisourceselectiontransferlearningwithprivacypreserving