Cargando…

Secure tumor classification by shallow neural network using homomorphic encryption

BACKGROUND: Disclosure of patients’ genetic information in the process of applying machine learning techniques for tumor classification hinders the privacy of personal information. Homomorphic Encryption (HE), which supports operations between encrypted data, can be used as one of the tools to perfo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hong, Seungwan, Park, Jai Hyun, Cho, Wonhee, Choe, Hyeongmin, Cheon, Jung Hee
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8994372/ https://www.ncbi.nlm.nih.gov/pubmed/35395714 http://dx.doi.org/10.1186/s12864-022-08469-w

_version_	1784684094305599488
author	Hong, Seungwan Park, Jai Hyun Cho, Wonhee Choe, Hyeongmin Cheon, Jung Hee
author_facet	Hong, Seungwan Park, Jai Hyun Cho, Wonhee Choe, Hyeongmin Cheon, Jung Hee
author_sort	Hong, Seungwan
collection	PubMed
description	BACKGROUND: Disclosure of patients’ genetic information in the process of applying machine learning techniques for tumor classification hinders the privacy of personal information. Homomorphic Encryption (HE), which supports operations between encrypted data, can be used as one of the tools to perform such computation without information leakage, but it brings great challenges for directly applying general machine learning algorithms due to the limitations of operations supported by HE. In particular, non-polynomial activation functions, including softmax functions, are difficult to implement with HE and require a suitable approximation method to minimize the loss of accuracy. In the secure genome analysis competition called iDASH 2020, it is presented as a competition task that a multi-label tumor classification method that predicts the class of samples based on genetic information using HE. METHODS: We develop a secure multi-label tumor classification method using HE to ensure privacy during all the computations of the model inference process. Our solution is based on a 1-layer neural network with the softmax activation function model and uses the approximate HE scheme. We present an approximation method that enables softmax activation in the model using HE and a technique for efficiently encoding data to reduce computational costs. In addition, we propose a HE-friendly data filtering method to reduce the size of large-scale genetic data. RESULTS: We aim to analyze the dataset from The Cancer Genome Atlas (TCGA) dataset, which consists of 3,622 samples from 11 types of cancers, genetic features from 25,128 genes. Our preprocessing method reduces the number of genes to 4,096 or less and achieves a microAUC value of 0.9882 (85% accuracy) with a 1-layer shallow neural network. Using our model, we successfully compute the tumor classification inference steps on the encrypted test data in 3.75 minutes. As a result of exceptionally high microAUC values, our solution was awarded co-first place in iDASH 2020 Track 1: “Secure multi-label Tumor classification using Homomorphic Encryption”. CONCLUSIONS: Our solution is the first result of implementing a neural network model with softmax activation using HE. Also, HE optimization methods presented in this work enable machine learning implementation using HE or other challenging HE applications.
format	Online Article Text
id	pubmed-8994372
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-89943722022-04-10 Secure tumor classification by shallow neural network using homomorphic encryption Hong, Seungwan Park, Jai Hyun Cho, Wonhee Choe, Hyeongmin Cheon, Jung Hee BMC Genomics Research BACKGROUND: Disclosure of patients’ genetic information in the process of applying machine learning techniques for tumor classification hinders the privacy of personal information. Homomorphic Encryption (HE), which supports operations between encrypted data, can be used as one of the tools to perform such computation without information leakage, but it brings great challenges for directly applying general machine learning algorithms due to the limitations of operations supported by HE. In particular, non-polynomial activation functions, including softmax functions, are difficult to implement with HE and require a suitable approximation method to minimize the loss of accuracy. In the secure genome analysis competition called iDASH 2020, it is presented as a competition task that a multi-label tumor classification method that predicts the class of samples based on genetic information using HE. METHODS: We develop a secure multi-label tumor classification method using HE to ensure privacy during all the computations of the model inference process. Our solution is based on a 1-layer neural network with the softmax activation function model and uses the approximate HE scheme. We present an approximation method that enables softmax activation in the model using HE and a technique for efficiently encoding data to reduce computational costs. In addition, we propose a HE-friendly data filtering method to reduce the size of large-scale genetic data. RESULTS: We aim to analyze the dataset from The Cancer Genome Atlas (TCGA) dataset, which consists of 3,622 samples from 11 types of cancers, genetic features from 25,128 genes. Our preprocessing method reduces the number of genes to 4,096 or less and achieves a microAUC value of 0.9882 (85% accuracy) with a 1-layer shallow neural network. Using our model, we successfully compute the tumor classification inference steps on the encrypted test data in 3.75 minutes. As a result of exceptionally high microAUC values, our solution was awarded co-first place in iDASH 2020 Track 1: “Secure multi-label Tumor classification using Homomorphic Encryption”. CONCLUSIONS: Our solution is the first result of implementing a neural network model with softmax activation using HE. Also, HE optimization methods presented in this work enable machine learning implementation using HE or other challenging HE applications. BioMed Central 2022-04-09 /pmc/articles/PMC8994372/ /pubmed/35395714 http://dx.doi.org/10.1186/s12864-022-08469-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Hong, Seungwan Park, Jai Hyun Cho, Wonhee Choe, Hyeongmin Cheon, Jung Hee Secure tumor classification by shallow neural network using homomorphic encryption
title	Secure tumor classification by shallow neural network using homomorphic encryption
title_full	Secure tumor classification by shallow neural network using homomorphic encryption
title_fullStr	Secure tumor classification by shallow neural network using homomorphic encryption
title_full_unstemmed	Secure tumor classification by shallow neural network using homomorphic encryption
title_short	Secure tumor classification by shallow neural network using homomorphic encryption
title_sort	secure tumor classification by shallow neural network using homomorphic encryption
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8994372/ https://www.ncbi.nlm.nih.gov/pubmed/35395714 http://dx.doi.org/10.1186/s12864-022-08469-w
work_keys_str_mv	AT hongseungwan securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption AT parkjaihyun securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption AT chowonhee securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption AT choehyeongmin securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption AT cheonjunghee securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption

Secure tumor classification by shallow neural network using homomorphic encryption

Ejemplares similares