Cargando…

Efficient differentially private learning improves drug sensitivity prediction

BACKGROUND: Users of a personalised recommendation system face a dilemma: recommendations can be improved by learning from data, but only if other users are willing to share their private information. Good personalised predictions are vitally important in precision medicine, but genomic information...

Descripción completa

Detalles Bibliográficos
Autores principales: Honkela, Antti, Das, Mrinal, Nieminen, Arttu, Dikmen, Onur, Kaski, Samuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5801888/
https://www.ncbi.nlm.nih.gov/pubmed/29409513
http://dx.doi.org/10.1186/s13062-017-0203-4
_version_ 1783298429141647360
author Honkela, Antti
Das, Mrinal
Nieminen, Arttu
Dikmen, Onur
Kaski, Samuel
author_facet Honkela, Antti
Das, Mrinal
Nieminen, Arttu
Dikmen, Onur
Kaski, Samuel
author_sort Honkela, Antti
collection PubMed
description BACKGROUND: Users of a personalised recommendation system face a dilemma: recommendations can be improved by learning from data, but only if other users are willing to share their private information. Good personalised predictions are vitally important in precision medicine, but genomic information on which the predictions are based is also particularly sensitive, as it directly identifies the patients and hence cannot easily be anonymised. Differential privacy has emerged as a potentially promising solution: privacy is considered sufficient if presence of individual patients cannot be distinguished. However, differentially private learning with current methods does not improve predictions with feasible data sizes and dimensionalities. RESULTS: We show that useful predictors can be learned under powerful differential privacy guarantees, and even from moderately-sized data sets, by demonstrating significant improvements in the accuracy of private drug sensitivity prediction with a new robust private regression method. Our method matches the predictive accuracy of the state-of-the-art non-private lasso regression using only 4x more samples under relatively strong differential privacy guarantees. Good performance with limited data is achieved by limiting the sharing of private information by decreasing the dimensionality and by projecting outliers to fit tighter bounds, therefore needing to add less noise for equal privacy. CONCLUSIONS: The proposed differentially private regression method combines theoretical appeal and asymptotic efficiency with good prediction accuracy even with moderate-sized data. As already the simple-to-implement method shows promise on the challenging genomic data, we anticipate rapid progress towards practical applications in many fields. REVIEWERS: This article was reviewed by Zoltan Gaspari and David Kreil. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13062-017-0203-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5801888
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58018882018-02-14 Efficient differentially private learning improves drug sensitivity prediction Honkela, Antti Das, Mrinal Nieminen, Arttu Dikmen, Onur Kaski, Samuel Biol Direct Research BACKGROUND: Users of a personalised recommendation system face a dilemma: recommendations can be improved by learning from data, but only if other users are willing to share their private information. Good personalised predictions are vitally important in precision medicine, but genomic information on which the predictions are based is also particularly sensitive, as it directly identifies the patients and hence cannot easily be anonymised. Differential privacy has emerged as a potentially promising solution: privacy is considered sufficient if presence of individual patients cannot be distinguished. However, differentially private learning with current methods does not improve predictions with feasible data sizes and dimensionalities. RESULTS: We show that useful predictors can be learned under powerful differential privacy guarantees, and even from moderately-sized data sets, by demonstrating significant improvements in the accuracy of private drug sensitivity prediction with a new robust private regression method. Our method matches the predictive accuracy of the state-of-the-art non-private lasso regression using only 4x more samples under relatively strong differential privacy guarantees. Good performance with limited data is achieved by limiting the sharing of private information by decreasing the dimensionality and by projecting outliers to fit tighter bounds, therefore needing to add less noise for equal privacy. CONCLUSIONS: The proposed differentially private regression method combines theoretical appeal and asymptotic efficiency with good prediction accuracy even with moderate-sized data. As already the simple-to-implement method shows promise on the challenging genomic data, we anticipate rapid progress towards practical applications in many fields. REVIEWERS: This article was reviewed by Zoltan Gaspari and David Kreil. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13062-017-0203-4) contains supplementary material, which is available to authorized users. BioMed Central 2018-02-06 /pmc/articles/PMC5801888/ /pubmed/29409513 http://dx.doi.org/10.1186/s13062-017-0203-4 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Honkela, Antti
Das, Mrinal
Nieminen, Arttu
Dikmen, Onur
Kaski, Samuel
Efficient differentially private learning improves drug sensitivity prediction
title Efficient differentially private learning improves drug sensitivity prediction
title_full Efficient differentially private learning improves drug sensitivity prediction
title_fullStr Efficient differentially private learning improves drug sensitivity prediction
title_full_unstemmed Efficient differentially private learning improves drug sensitivity prediction
title_short Efficient differentially private learning improves drug sensitivity prediction
title_sort efficient differentially private learning improves drug sensitivity prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5801888/
https://www.ncbi.nlm.nih.gov/pubmed/29409513
http://dx.doi.org/10.1186/s13062-017-0203-4
work_keys_str_mv AT honkelaantti efficientdifferentiallyprivatelearningimprovesdrugsensitivityprediction
AT dasmrinal efficientdifferentiallyprivatelearningimprovesdrugsensitivityprediction
AT nieminenarttu efficientdifferentiallyprivatelearningimprovesdrugsensitivityprediction
AT dikmenonur efficientdifferentiallyprivatelearningimprovesdrugsensitivityprediction
AT kaskisamuel efficientdifferentiallyprivatelearningimprovesdrugsensitivityprediction