Cargando…
Secure tumor classification by shallow neural network using homomorphic encryption
BACKGROUND: Disclosure of patients’ genetic information in the process of applying machine learning techniques for tumor classification hinders the privacy of personal information. Homomorphic Encryption (HE), which supports operations between encrypted data, can be used as one of the tools to perfo...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8994372/ https://www.ncbi.nlm.nih.gov/pubmed/35395714 http://dx.doi.org/10.1186/s12864-022-08469-w |
_version_ | 1784684094305599488 |
---|---|
author | Hong, Seungwan Park, Jai Hyun Cho, Wonhee Choe, Hyeongmin Cheon, Jung Hee |
author_facet | Hong, Seungwan Park, Jai Hyun Cho, Wonhee Choe, Hyeongmin Cheon, Jung Hee |
author_sort | Hong, Seungwan |
collection | PubMed |
description | BACKGROUND: Disclosure of patients’ genetic information in the process of applying machine learning techniques for tumor classification hinders the privacy of personal information. Homomorphic Encryption (HE), which supports operations between encrypted data, can be used as one of the tools to perform such computation without information leakage, but it brings great challenges for directly applying general machine learning algorithms due to the limitations of operations supported by HE. In particular, non-polynomial activation functions, including softmax functions, are difficult to implement with HE and require a suitable approximation method to minimize the loss of accuracy. In the secure genome analysis competition called iDASH 2020, it is presented as a competition task that a multi-label tumor classification method that predicts the class of samples based on genetic information using HE. METHODS: We develop a secure multi-label tumor classification method using HE to ensure privacy during all the computations of the model inference process. Our solution is based on a 1-layer neural network with the softmax activation function model and uses the approximate HE scheme. We present an approximation method that enables softmax activation in the model using HE and a technique for efficiently encoding data to reduce computational costs. In addition, we propose a HE-friendly data filtering method to reduce the size of large-scale genetic data. RESULTS: We aim to analyze the dataset from The Cancer Genome Atlas (TCGA) dataset, which consists of 3,622 samples from 11 types of cancers, genetic features from 25,128 genes. Our preprocessing method reduces the number of genes to 4,096 or less and achieves a microAUC value of 0.9882 (85% accuracy) with a 1-layer shallow neural network. Using our model, we successfully compute the tumor classification inference steps on the encrypted test data in 3.75 minutes. As a result of exceptionally high microAUC values, our solution was awarded co-first place in iDASH 2020 Track 1: “Secure multi-label Tumor classification using Homomorphic Encryption”. CONCLUSIONS: Our solution is the first result of implementing a neural network model with softmax activation using HE. Also, HE optimization methods presented in this work enable machine learning implementation using HE or other challenging HE applications. |
format | Online Article Text |
id | pubmed-8994372 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-89943722022-04-10 Secure tumor classification by shallow neural network using homomorphic encryption Hong, Seungwan Park, Jai Hyun Cho, Wonhee Choe, Hyeongmin Cheon, Jung Hee BMC Genomics Research BACKGROUND: Disclosure of patients’ genetic information in the process of applying machine learning techniques for tumor classification hinders the privacy of personal information. Homomorphic Encryption (HE), which supports operations between encrypted data, can be used as one of the tools to perform such computation without information leakage, but it brings great challenges for directly applying general machine learning algorithms due to the limitations of operations supported by HE. In particular, non-polynomial activation functions, including softmax functions, are difficult to implement with HE and require a suitable approximation method to minimize the loss of accuracy. In the secure genome analysis competition called iDASH 2020, it is presented as a competition task that a multi-label tumor classification method that predicts the class of samples based on genetic information using HE. METHODS: We develop a secure multi-label tumor classification method using HE to ensure privacy during all the computations of the model inference process. Our solution is based on a 1-layer neural network with the softmax activation function model and uses the approximate HE scheme. We present an approximation method that enables softmax activation in the model using HE and a technique for efficiently encoding data to reduce computational costs. In addition, we propose a HE-friendly data filtering method to reduce the size of large-scale genetic data. RESULTS: We aim to analyze the dataset from The Cancer Genome Atlas (TCGA) dataset, which consists of 3,622 samples from 11 types of cancers, genetic features from 25,128 genes. Our preprocessing method reduces the number of genes to 4,096 or less and achieves a microAUC value of 0.9882 (85% accuracy) with a 1-layer shallow neural network. Using our model, we successfully compute the tumor classification inference steps on the encrypted test data in 3.75 minutes. As a result of exceptionally high microAUC values, our solution was awarded co-first place in iDASH 2020 Track 1: “Secure multi-label Tumor classification using Homomorphic Encryption”. CONCLUSIONS: Our solution is the first result of implementing a neural network model with softmax activation using HE. Also, HE optimization methods presented in this work enable machine learning implementation using HE or other challenging HE applications. BioMed Central 2022-04-09 /pmc/articles/PMC8994372/ /pubmed/35395714 http://dx.doi.org/10.1186/s12864-022-08469-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Hong, Seungwan Park, Jai Hyun Cho, Wonhee Choe, Hyeongmin Cheon, Jung Hee Secure tumor classification by shallow neural network using homomorphic encryption |
title | Secure tumor classification by shallow neural network using homomorphic encryption |
title_full | Secure tumor classification by shallow neural network using homomorphic encryption |
title_fullStr | Secure tumor classification by shallow neural network using homomorphic encryption |
title_full_unstemmed | Secure tumor classification by shallow neural network using homomorphic encryption |
title_short | Secure tumor classification by shallow neural network using homomorphic encryption |
title_sort | secure tumor classification by shallow neural network using homomorphic encryption |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8994372/ https://www.ncbi.nlm.nih.gov/pubmed/35395714 http://dx.doi.org/10.1186/s12864-022-08469-w |
work_keys_str_mv | AT hongseungwan securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption AT parkjaihyun securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption AT chowonhee securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption AT choehyeongmin securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption AT cheonjunghee securetumorclassificationbyshallowneuralnetworkusinghomomorphicencryption |