Cargando…

MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data

Deep learning has massive potential in predicting phenotype from different omics profiles. However, deep neural networks are viewed as black boxes, providing predictions without explanation. Therefore, the requirements for these models to become interpretable are increasing, especially in the medica...

Descripción completa

Detalles Bibliográficos
Autores principales: Albaradei, Somayah, Albaradei, Abdurhman, Alsaedi, Asim, Uludag, Mahmut, Thafar, Maha A., Gojobori, Takashi, Essack, Magbubah, Gao, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353773/
https://www.ncbi.nlm.nih.gov/pubmed/35936793
http://dx.doi.org/10.3389/fmolb.2022.913602
_version_ 1784762927134277632
author Albaradei, Somayah
Albaradei, Abdurhman
Alsaedi, Asim
Uludag, Mahmut
Thafar, Maha A.
Gojobori, Takashi
Essack, Magbubah
Gao, Xin
author_facet Albaradei, Somayah
Albaradei, Abdurhman
Alsaedi, Asim
Uludag, Mahmut
Thafar, Maha A.
Gojobori, Takashi
Essack, Magbubah
Gao, Xin
author_sort Albaradei, Somayah
collection PubMed
description Deep learning has massive potential in predicting phenotype from different omics profiles. However, deep neural networks are viewed as black boxes, providing predictions without explanation. Therefore, the requirements for these models to become interpretable are increasing, especially in the medical field. Here we propose a computational framework that takes the gene expression profile of any primary cancer sample and predicts whether patients’ samples are primary (localized) or metastasized to the brain, bone, lung, or liver based on deep learning architecture. Specifically, we first constructed an AutoEncoder framework to learn the non-linear relationship between genes, and then DeepLIFT was applied to calculate genes’ importance scores. Next, to mine the top essential genes that can distinguish the primary and metastasized tumors, we iteratively added ten top-ranked genes based upon their importance score to train a DNN model. Then we trained a final multi-class DNN that uses the output from the previous part as an input and predicts whether samples are primary or metastasized to the brain, bone, lung, or liver. The prediction performances ranged from AUC of 0.93–0.82. We further designed the model’s workflow to provide a second functionality beyond metastasis site prediction, i.e., to identify the biological functions that the DL model uses to perform the prediction. To our knowledge, this is the first multi-class DNN model developed for the generic prediction of metastasis to various sites.
format Online
Article
Text
id pubmed-9353773
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93537732022-08-06 MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data Albaradei, Somayah Albaradei, Abdurhman Alsaedi, Asim Uludag, Mahmut Thafar, Maha A. Gojobori, Takashi Essack, Magbubah Gao, Xin Front Mol Biosci Molecular Biosciences Deep learning has massive potential in predicting phenotype from different omics profiles. However, deep neural networks are viewed as black boxes, providing predictions without explanation. Therefore, the requirements for these models to become interpretable are increasing, especially in the medical field. Here we propose a computational framework that takes the gene expression profile of any primary cancer sample and predicts whether patients’ samples are primary (localized) or metastasized to the brain, bone, lung, or liver based on deep learning architecture. Specifically, we first constructed an AutoEncoder framework to learn the non-linear relationship between genes, and then DeepLIFT was applied to calculate genes’ importance scores. Next, to mine the top essential genes that can distinguish the primary and metastasized tumors, we iteratively added ten top-ranked genes based upon their importance score to train a DNN model. Then we trained a final multi-class DNN that uses the output from the previous part as an input and predicts whether samples are primary or metastasized to the brain, bone, lung, or liver. The prediction performances ranged from AUC of 0.93–0.82. We further designed the model’s workflow to provide a second functionality beyond metastasis site prediction, i.e., to identify the biological functions that the DL model uses to perform the prediction. To our knowledge, this is the first multi-class DNN model developed for the generic prediction of metastasis to various sites. Frontiers Media S.A. 2022-07-22 /pmc/articles/PMC9353773/ /pubmed/35936793 http://dx.doi.org/10.3389/fmolb.2022.913602 Text en Copyright © 2022 Albaradei, Albaradei, Alsaedi, Uludag, Thafar, Gojobori, Essack and Gao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Albaradei, Somayah
Albaradei, Abdurhman
Alsaedi, Asim
Uludag, Mahmut
Thafar, Maha A.
Gojobori, Takashi
Essack, Magbubah
Gao, Xin
MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data
title MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data
title_full MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data
title_fullStr MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data
title_full_unstemmed MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data
title_short MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data
title_sort metastasite: predicting metastasis to different sites using deep learning with gene expression data
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353773/
https://www.ncbi.nlm.nih.gov/pubmed/35936793
http://dx.doi.org/10.3389/fmolb.2022.913602
work_keys_str_mv AT albaradeisomayah metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata
AT albaradeiabdurhman metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata
AT alsaediasim metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata
AT uludagmahmut metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata
AT thafarmahaa metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata
AT gojoboritakashi metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata
AT essackmagbubah metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata
AT gaoxin metastasitepredictingmetastasistodifferentsitesusingdeeplearningwithgeneexpressiondata