Cargando…

Secure tumor classification by shallow neural network using homomorphic encryption

BACKGROUND: Disclosure of patients’ genetic information in the process of applying machine learning techniques for tumor classification hinders the privacy of personal information. Homomorphic Encryption (HE), which supports operations between encrypted data, can be used as one of the tools to perfo...

Descripción completa

Detalles Bibliográficos
Autores principales: Hong, Seungwan, Park, Jai Hyun, Cho, Wonhee, Choe, Hyeongmin, Cheon, Jung Hee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8994372/
https://www.ncbi.nlm.nih.gov/pubmed/35395714
http://dx.doi.org/10.1186/s12864-022-08469-w
_version_ 1784684094305599488
author Hong, Seungwan
Park, Jai Hyun
Cho, Wonhee
Choe, Hyeongmin
Cheon, Jung Hee
author_facet Hong, Seungwan
Park, Jai Hyun
Cho, Wonhee
Choe, Hyeongmin
Cheon, Jung Hee
author_sort Hong, Seungwan
collection PubMed
description BACKGROUND: Disclosure of patients’ genetic information in the process of applying machine learning techniques for tumor classification hinders the privacy of personal information. Homomorphic Encryption (HE), which supports operations between encrypted data, can be used as one of the tools to perform such computation without information leakage, but it brings great challenges for directly applying general machine learning algorithms due to the limitations of operations supported by HE. In particular, non-polynomial activation functions, including softmax functions, are difficult to implement with HE and require a suitable approximation method to minimize the loss of accuracy. In the secure genome analysis competition called iDASH 2020, it is presented as a competition task that a multi-label tumor classification method that predicts the class of samples based on genetic information using HE. METHODS: We develop a secure multi-label tumor classification method using HE to ensure privacy during all the computations of the model inference process. Our solution is based on a 1-layer neural network with the softmax activation function model and uses the approximate HE scheme. We present an approximation method that enables softmax activation in the model using HE and a technique for efficiently encoding data to reduce computational costs. In addition, we propose a HE-friendly data filtering method to reduce the size of large-scale genetic data. RESULTS: We aim to analyze the dataset from The Cancer Genome Atlas (TCGA) dataset, which consists of 3,622 samples from 11 types of cancers, genetic features from 25,128 genes. Our preprocessing method reduces the number of genes to 4,096 or less and achieves a microAUC value of 0.9882 (85% accuracy) with a 1-layer shallow neural network. Using our model, we successfully compute the tumor classification inference steps on the encrypted test data in 3.75 minutes. As a result of exceptionally high microAUC values, our solution was awarded co-first place in iDASH 2020 Track 1: “Secure multi-label Tumor classification using Homomorphic Encryption”. CONCLUSIONS: Our solution is the first result of implementing a neural network model with softmax activation using HE. Also, HE optimization methods presented in this work enable machine learning implementation using HE or other challenging HE applications.
format Online
Article
Text
id pubmed-8994372
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-89943722022-04-10 Secure tumor classification by shallow neural network using homomorphic encryption Hong, Seungwan Park, Jai Hyun Cho, Wonhee Choe, Hyeongmin Cheon, Jung Hee BMC Genomics Research BACKGROUND: Disclosure of patients’ genetic information in the process of applying machine learning techniques for tumor classification hinders the privacy of personal information. Homomorphic Encryption (HE), which supports operations between encrypted data, can be used as one of the tools to perform such computation without information leakage, but it brings great challenges for directly applying general machine learning algorithms due to the limitations of operations supported by HE. In particular, non-polynomial activation functions, including softmax functions, are difficult to implement with HE and require a suitable approximation method to minimize the loss of accuracy. In the secure genome analysis competition called iDASH 2020, it is presented as a competition task that a multi-label tumor classification method that predicts the class of samples based on genetic information using HE. METHODS: We develop a secure multi-label tumor classification method using HE to ensure privacy during all the computations of the model inference process. Our solution is based on a 1-layer neural network with the softmax activation function model and uses the approximate HE scheme. We present an approximation method that enables softmax activation in the model using HE and a technique for efficiently encoding data to reduce computational costs. In addition, we propose a HE-friendly data filtering method to reduce the size of large-scale genetic data. RESULTS: We aim to analyze the dataset from The Cancer Genome Atlas (TCGA) dataset, which consists of 3,622 samples from 11 types of cancers, genetic features from 25,128 genes. Our preprocessing method reduces the number of genes to 4,096 or less and achieves a microAUC value of 0.9882 (85% accuracy) with a 1-layer shallow neural network. Using our model, we successfully compute the tumor classification inference steps on the encrypted test data in 3.75 minutes. As a result of exceptionally high microAUC values, our solution was awarded co-first place in iDASH 2020 Track 1: “Secure multi-label Tumor classification using Homomorphic Encryption”. CONCLUSIONS: Our solution is the first result of implementing a neural network model with softmax activation using HE. Also, HE optimization methods presented in this work enable machine learning implementation using HE or other challenging HE applications. BioMed Central 2022-04-09 /pmc/articles/PMC8994372/ /pubmed/35395714 http://dx.doi.org/10.1186/s12864-022-08469-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Hong, Seungwan
Park, Jai Hyun
Cho, Wonhee
Choe, Hyeongmin
Cheon, Jung Hee
Secure tumor classification by shallow neural network using homomorphic encryption
title Secure tumor classification by shallow neural network using homomorphic encryption
title_full Secure tumor classification by shallow neural network using homomorphic encryption
title_fullStr Secure tumor classification by shallow neural network using homomorphic encryption
title_full_unstemmed Secure tumor classification by shallow neural network using homomorphic encryption
title_short Secure tumor classification by shallow neural network using homomorphic encryption
title_sort secure tumor classification by shallow neural network using homomorphic encryption
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8994372/
https://www.ncbi.nlm.nih.gov/pubmed/35395714
http://dx.doi.org/10.1186/s12864-022-08469-w
work_keys_str_mv AT hongseungwan securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption
AT parkjaihyun securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption
AT chowonhee securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption
AT choehyeongmin securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption
AT cheonjunghee securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption