Cargando…

Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation

BACKGROUND: Learning a model without accessing raw data has been an intriguing idea to security and machine learning researchers for years. In an ideal setting, we want to encrypt sensitive data to store them on a commercial cloud and run certain analyses without ever decrypting the data to preserve...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Miran, Song, Yongsoo, Wang, Shuang, Xia, Yuhou, Jiang, Xiaoqian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5930176/
https://www.ncbi.nlm.nih.gov/pubmed/29666041
http://dx.doi.org/10.2196/medinform.8805
_version_ 1783319454344544256
author Kim, Miran
Song, Yongsoo
Wang, Shuang
Xia, Yuhou
Jiang, Xiaoqian
author_facet Kim, Miran
Song, Yongsoo
Wang, Shuang
Xia, Yuhou
Jiang, Xiaoqian
author_sort Kim, Miran
collection PubMed
description BACKGROUND: Learning a model without accessing raw data has been an intriguing idea to security and machine learning researchers for years. In an ideal setting, we want to encrypt sensitive data to store them on a commercial cloud and run certain analyses without ever decrypting the data to preserve privacy. Homomorphic encryption technique is a promising candidate for secure data outsourcing, but it is a very challenging task to support real-world machine learning tasks. Existing frameworks can only handle simplified cases with low-degree polynomials such as linear means classifier and linear discriminative analysis. OBJECTIVE: The goal of this study is to provide a practical support to the mainstream learning models (eg, logistic regression). METHODS: We adapted a novel homomorphic encryption scheme optimized for real numbers computation. We devised (1) the least squares approximation of the logistic function for accuracy and efficiency (ie, reduce computation cost) and (2) new packing and parallelization techniques. RESULTS: Using real-world datasets, we evaluated the performance of our model and demonstrated its feasibility in speed and memory consumption. For example, it took approximately 116 minutes to obtain the training model from the homomorphically encrypted Edinburgh dataset. In addition, it gives fairly accurate predictions on the testing dataset. CONCLUSIONS: We present the first homomorphically encrypted logistic regression outsourcing model based on the critical observation that the precision loss of classification models is sufficiently small so that the decision plan stays still.
format Online
Article
Text
id pubmed-5930176
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-59301762018-05-09 Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation Kim, Miran Song, Yongsoo Wang, Shuang Xia, Yuhou Jiang, Xiaoqian JMIR Med Inform Original Paper BACKGROUND: Learning a model without accessing raw data has been an intriguing idea to security and machine learning researchers for years. In an ideal setting, we want to encrypt sensitive data to store them on a commercial cloud and run certain analyses without ever decrypting the data to preserve privacy. Homomorphic encryption technique is a promising candidate for secure data outsourcing, but it is a very challenging task to support real-world machine learning tasks. Existing frameworks can only handle simplified cases with low-degree polynomials such as linear means classifier and linear discriminative analysis. OBJECTIVE: The goal of this study is to provide a practical support to the mainstream learning models (eg, logistic regression). METHODS: We adapted a novel homomorphic encryption scheme optimized for real numbers computation. We devised (1) the least squares approximation of the logistic function for accuracy and efficiency (ie, reduce computation cost) and (2) new packing and parallelization techniques. RESULTS: Using real-world datasets, we evaluated the performance of our model and demonstrated its feasibility in speed and memory consumption. For example, it took approximately 116 minutes to obtain the training model from the homomorphically encrypted Edinburgh dataset. In addition, it gives fairly accurate predictions on the testing dataset. CONCLUSIONS: We present the first homomorphically encrypted logistic regression outsourcing model based on the critical observation that the precision loss of classification models is sufficiently small so that the decision plan stays still. JMIR Publications 2018-04-17 /pmc/articles/PMC5930176/ /pubmed/29666041 http://dx.doi.org/10.2196/medinform.8805 Text en ©Miran Kim, Yongsoo Song, Shuang Wang, Yuhou Xia, Xiaoqian Jiang. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 17.04.2018. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Kim, Miran
Song, Yongsoo
Wang, Shuang
Xia, Yuhou
Jiang, Xiaoqian
Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation
title Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation
title_full Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation
title_fullStr Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation
title_full_unstemmed Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation
title_short Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation
title_sort secure logistic regression based on homomorphic encryption: design and evaluation
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5930176/
https://www.ncbi.nlm.nih.gov/pubmed/29666041
http://dx.doi.org/10.2196/medinform.8805
work_keys_str_mv AT kimmiran securelogisticregressionbasedonhomomorphicencryptiondesignandevaluation
AT songyongsoo securelogisticregressionbasedonhomomorphicencryptiondesignandevaluation
AT wangshuang securelogisticregressionbasedonhomomorphicencryptiondesignandevaluation
AT xiayuhou securelogisticregressionbasedonhomomorphicencryptiondesignandevaluation
AT jiangxiaoqian securelogisticregressionbasedonhomomorphicencryptiondesignandevaluation