Cargando…
MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data
Deep learning has massive potential in predicting phenotype from different omics profiles. However, deep neural networks are viewed as black boxes, providing predictions without explanation. Therefore, the requirements for these models to become interpretable are increasing, especially in the medica...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353773/ https://www.ncbi.nlm.nih.gov/pubmed/35936793 http://dx.doi.org/10.3389/fmolb.2022.913602 |
_version_ | 1784762927134277632 |
---|---|
author | Albaradei, Somayah Albaradei, Abdurhman Alsaedi, Asim Uludag, Mahmut Thafar, Maha A. Gojobori, Takashi Essack, Magbubah Gao, Xin |
author_facet | Albaradei, Somayah Albaradei, Abdurhman Alsaedi, Asim Uludag, Mahmut Thafar, Maha A. Gojobori, Takashi Essack, Magbubah Gao, Xin |
author_sort | Albaradei, Somayah |
collection | PubMed |
description | Deep learning has massive potential in predicting phenotype from different omics profiles. However, deep neural networks are viewed as black boxes, providing predictions without explanation. Therefore, the requirements for these models to become interpretable are increasing, especially in the medical field. Here we propose a computational framework that takes the gene expression profile of any primary cancer sample and predicts whether patients’ samples are primary (localized) or metastasized to the brain, bone, lung, or liver based on deep learning architecture. Specifically, we first constructed an AutoEncoder framework to learn the non-linear relationship between genes, and then DeepLIFT was applied to calculate genes’ importance scores. Next, to mine the top essential genes that can distinguish the primary and metastasized tumors, we iteratively added ten top-ranked genes based upon their importance score to train a DNN model. Then we trained a final multi-class DNN that uses the output from the previous part as an input and predicts whether samples are primary or metastasized to the brain, bone, lung, or liver. The prediction performances ranged from AUC of 0.93–0.82. We further designed the model’s workflow to provide a second functionality beyond metastasis site prediction, i.e., to identify the biological functions that the DL model uses to perform the prediction. To our knowledge, this is the first multi-class DNN model developed for the generic prediction of metastasis to various sites. |
format | Online Article Text |
id | pubmed-9353773 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93537732022-08-06 MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data Albaradei, Somayah Albaradei, Abdurhman Alsaedi, Asim Uludag, Mahmut Thafar, Maha A. Gojobori, Takashi Essack, Magbubah Gao, Xin Front Mol Biosci Molecular Biosciences Deep learning has massive potential in predicting phenotype from different omics profiles. However, deep neural networks are viewed as black boxes, providing predictions without explanation. Therefore, the requirements for these models to become interpretable are increasing, especially in the medical field. Here we propose a computational framework that takes the gene expression profile of any primary cancer sample and predicts whether patients’ samples are primary (localized) or metastasized to the brain, bone, lung, or liver based on deep learning architecture. Specifically, we first constructed an AutoEncoder framework to learn the non-linear relationship between genes, and then DeepLIFT was applied to calculate genes’ importance scores. Next, to mine the top essential genes that can distinguish the primary and metastasized tumors, we iteratively added ten top-ranked genes based upon their importance score to train a DNN model. Then we trained a final multi-class DNN that uses the output from the previous part as an input and predicts whether samples are primary or metastasized to the brain, bone, lung, or liver. The prediction performances ranged from AUC of 0.93–0.82. We further designed the model’s workflow to provide a second functionality beyond metastasis site prediction, i.e., to identify the biological functions that the DL model uses to perform the prediction. To our knowledge, this is the first multi-class DNN model developed for the generic prediction of metastasis to various sites. Frontiers Media S.A. 2022-07-22 /pmc/articles/PMC9353773/ /pubmed/35936793 http://dx.doi.org/10.3389/fmolb.2022.913602 Text en Copyright © 2022 Albaradei, Albaradei, Alsaedi, Uludag, Thafar, Gojobori, Essack and Gao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Molecular Biosciences Albaradei, Somayah Albaradei, Abdurhman Alsaedi, Asim Uludag, Mahmut Thafar, Maha A. Gojobori, Takashi Essack, Magbubah Gao, Xin MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data |
title |
MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data |
title_full |
MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data |
title_fullStr |
MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data |
title_full_unstemmed |
MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data |
title_short |
MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data |
title_sort | metastasite: predicting metastasis to different sites using deep learning with gene expression data |
topic | Molecular Biosciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353773/ https://www.ncbi.nlm.nih.gov/pubmed/35936793 http://dx.doi.org/10.3389/fmolb.2022.913602 |
work_keys_str_mv | AT albaradeisomayah metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata AT albaradeiabdurhman metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata AT alsaediasim metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata AT uludagmahmut metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata AT thafarmahaa metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata AT gojoboritakashi metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata AT essackmagbubah metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata AT gaoxin metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata |