Cargando…

DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues

BACKGROUND: P4 medicine (predict, prevent, personalize, and participate) is a new approach to diagnosing and predicting diseases on a patient-by-patient basis. For the prevention and treatment of diseases, prediction plays a fundamental role. One of the intelligent strategies is the design of deep l...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahdi-Esferizi, Roohallah, Haji Molla Hoseyni, Behnaz, Mehrpanah, Amir, Golzade, Yazdan, Najafi, Ali, Elahian, Fatemeh, Zadeh Shirazi, Amin, Gomez, Guillermo A., Tahmasebian, Shahram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320882/
https://www.ncbi.nlm.nih.gov/pubmed/37403016
http://dx.doi.org/10.1186/s12859-023-05400-2
_version_ 1785068520562753536
author Mahdi-Esferizi, Roohallah
Haji Molla Hoseyni, Behnaz
Mehrpanah, Amir
Golzade, Yazdan
Najafi, Ali
Elahian, Fatemeh
Zadeh Shirazi, Amin
Gomez, Guillermo A.
Tahmasebian, Shahram
author_facet Mahdi-Esferizi, Roohallah
Haji Molla Hoseyni, Behnaz
Mehrpanah, Amir
Golzade, Yazdan
Najafi, Ali
Elahian, Fatemeh
Zadeh Shirazi, Amin
Gomez, Guillermo A.
Tahmasebian, Shahram
author_sort Mahdi-Esferizi, Roohallah
collection PubMed
description BACKGROUND: P4 medicine (predict, prevent, personalize, and participate) is a new approach to diagnosing and predicting diseases on a patient-by-patient basis. For the prevention and treatment of diseases, prediction plays a fundamental role. One of the intelligent strategies is the design of deep learning models that can predict the state of the disease using gene expression data. RESULTS: We create an autoencoder deep learning model called DeeP4med, including a Classifier and a Transferor that predicts cancer's gene expression (mRNA) matrix from its matched normal sample and vice versa. The range of the F1 score of the model, depending on tissue type in the Classifier, is from 0.935 to 0.999 and in Transferor from 0.944 to 0.999. The accuracy of DeeP4med for tissue and disease classification was 0.986 and 0.992, respectively, which performed better compared to seven classic machine learning models (Support Vector Classifier, Logistic Regression, Linear Discriminant Analysis, Naive Bayes, Decision Tree, Random Forest, K Nearest Neighbors). CONCLUSIONS: Based on the idea of DeeP4med, by having the gene expression matrix of a normal tissue, we can predict its tumor gene expression matrix and, in this way, find effective genes in transforming a normal tissue into a tumor tissue. Results of Differentially Expressed Genes (DEGs) and enrichment analysis on the predicted matrices for 13 types of cancer showed a good correlation with the literature and biological databases. This led that by using the gene expression matrix, to train the model with features of each person in a normal and cancer state, this model could predict diagnosis based on gene expression data from healthy tissue and be used to identify possible therapeutic interventions for those patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05400-2.
format Online
Article
Text
id pubmed-10320882
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103208822023-07-06 DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues Mahdi-Esferizi, Roohallah Haji Molla Hoseyni, Behnaz Mehrpanah, Amir Golzade, Yazdan Najafi, Ali Elahian, Fatemeh Zadeh Shirazi, Amin Gomez, Guillermo A. Tahmasebian, Shahram BMC Bioinformatics Research BACKGROUND: P4 medicine (predict, prevent, personalize, and participate) is a new approach to diagnosing and predicting diseases on a patient-by-patient basis. For the prevention and treatment of diseases, prediction plays a fundamental role. One of the intelligent strategies is the design of deep learning models that can predict the state of the disease using gene expression data. RESULTS: We create an autoencoder deep learning model called DeeP4med, including a Classifier and a Transferor that predicts cancer's gene expression (mRNA) matrix from its matched normal sample and vice versa. The range of the F1 score of the model, depending on tissue type in the Classifier, is from 0.935 to 0.999 and in Transferor from 0.944 to 0.999. The accuracy of DeeP4med for tissue and disease classification was 0.986 and 0.992, respectively, which performed better compared to seven classic machine learning models (Support Vector Classifier, Logistic Regression, Linear Discriminant Analysis, Naive Bayes, Decision Tree, Random Forest, K Nearest Neighbors). CONCLUSIONS: Based on the idea of DeeP4med, by having the gene expression matrix of a normal tissue, we can predict its tumor gene expression matrix and, in this way, find effective genes in transforming a normal tissue into a tumor tissue. Results of Differentially Expressed Genes (DEGs) and enrichment analysis on the predicted matrices for 13 types of cancer showed a good correlation with the literature and biological databases. This led that by using the gene expression matrix, to train the model with features of each person in a normal and cancer state, this model could predict diagnosis based on gene expression data from healthy tissue and be used to identify possible therapeutic interventions for those patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05400-2. BioMed Central 2023-07-04 /pmc/articles/PMC10320882/ /pubmed/37403016 http://dx.doi.org/10.1186/s12859-023-05400-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Mahdi-Esferizi, Roohallah
Haji Molla Hoseyni, Behnaz
Mehrpanah, Amir
Golzade, Yazdan
Najafi, Ali
Elahian, Fatemeh
Zadeh Shirazi, Amin
Gomez, Guillermo A.
Tahmasebian, Shahram
DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues
title DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues
title_full DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues
title_fullStr DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues
title_full_unstemmed DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues
title_short DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues
title_sort deep4med: deep learning for p4 medicine to predict normal and cancer transcriptome in multiple human tissues
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320882/
https://www.ncbi.nlm.nih.gov/pubmed/37403016
http://dx.doi.org/10.1186/s12859-023-05400-2
work_keys_str_mv AT mahdiesferiziroohallah deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues
AT hajimollahoseynibehnaz deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues
AT mehrpanahamir deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues
AT golzadeyazdan deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues
AT najafiali deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues
AT elahianfatemeh deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues
AT zadehshiraziamin deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues
AT gomezguillermoa deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues
AT tahmasebianshahram deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues