Cargando…
DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues
BACKGROUND: P4 medicine (predict, prevent, personalize, and participate) is a new approach to diagnosing and predicting diseases on a patient-by-patient basis. For the prevention and treatment of diseases, prediction plays a fundamental role. One of the intelligent strategies is the design of deep l...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320882/ https://www.ncbi.nlm.nih.gov/pubmed/37403016 http://dx.doi.org/10.1186/s12859-023-05400-2 |
_version_ | 1785068520562753536 |
---|---|
author | Mahdi-Esferizi, Roohallah Haji Molla Hoseyni, Behnaz Mehrpanah, Amir Golzade, Yazdan Najafi, Ali Elahian, Fatemeh Zadeh Shirazi, Amin Gomez, Guillermo A. Tahmasebian, Shahram |
author_facet | Mahdi-Esferizi, Roohallah Haji Molla Hoseyni, Behnaz Mehrpanah, Amir Golzade, Yazdan Najafi, Ali Elahian, Fatemeh Zadeh Shirazi, Amin Gomez, Guillermo A. Tahmasebian, Shahram |
author_sort | Mahdi-Esferizi, Roohallah |
collection | PubMed |
description | BACKGROUND: P4 medicine (predict, prevent, personalize, and participate) is a new approach to diagnosing and predicting diseases on a patient-by-patient basis. For the prevention and treatment of diseases, prediction plays a fundamental role. One of the intelligent strategies is the design of deep learning models that can predict the state of the disease using gene expression data. RESULTS: We create an autoencoder deep learning model called DeeP4med, including a Classifier and a Transferor that predicts cancer's gene expression (mRNA) matrix from its matched normal sample and vice versa. The range of the F1 score of the model, depending on tissue type in the Classifier, is from 0.935 to 0.999 and in Transferor from 0.944 to 0.999. The accuracy of DeeP4med for tissue and disease classification was 0.986 and 0.992, respectively, which performed better compared to seven classic machine learning models (Support Vector Classifier, Logistic Regression, Linear Discriminant Analysis, Naive Bayes, Decision Tree, Random Forest, K Nearest Neighbors). CONCLUSIONS: Based on the idea of DeeP4med, by having the gene expression matrix of a normal tissue, we can predict its tumor gene expression matrix and, in this way, find effective genes in transforming a normal tissue into a tumor tissue. Results of Differentially Expressed Genes (DEGs) and enrichment analysis on the predicted matrices for 13 types of cancer showed a good correlation with the literature and biological databases. This led that by using the gene expression matrix, to train the model with features of each person in a normal and cancer state, this model could predict diagnosis based on gene expression data from healthy tissue and be used to identify possible therapeutic interventions for those patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05400-2. |
format | Online Article Text |
id | pubmed-10320882 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-103208822023-07-06 DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues Mahdi-Esferizi, Roohallah Haji Molla Hoseyni, Behnaz Mehrpanah, Amir Golzade, Yazdan Najafi, Ali Elahian, Fatemeh Zadeh Shirazi, Amin Gomez, Guillermo A. Tahmasebian, Shahram BMC Bioinformatics Research BACKGROUND: P4 medicine (predict, prevent, personalize, and participate) is a new approach to diagnosing and predicting diseases on a patient-by-patient basis. For the prevention and treatment of diseases, prediction plays a fundamental role. One of the intelligent strategies is the design of deep learning models that can predict the state of the disease using gene expression data. RESULTS: We create an autoencoder deep learning model called DeeP4med, including a Classifier and a Transferor that predicts cancer's gene expression (mRNA) matrix from its matched normal sample and vice versa. The range of the F1 score of the model, depending on tissue type in the Classifier, is from 0.935 to 0.999 and in Transferor from 0.944 to 0.999. The accuracy of DeeP4med for tissue and disease classification was 0.986 and 0.992, respectively, which performed better compared to seven classic machine learning models (Support Vector Classifier, Logistic Regression, Linear Discriminant Analysis, Naive Bayes, Decision Tree, Random Forest, K Nearest Neighbors). CONCLUSIONS: Based on the idea of DeeP4med, by having the gene expression matrix of a normal tissue, we can predict its tumor gene expression matrix and, in this way, find effective genes in transforming a normal tissue into a tumor tissue. Results of Differentially Expressed Genes (DEGs) and enrichment analysis on the predicted matrices for 13 types of cancer showed a good correlation with the literature and biological databases. This led that by using the gene expression matrix, to train the model with features of each person in a normal and cancer state, this model could predict diagnosis based on gene expression data from healthy tissue and be used to identify possible therapeutic interventions for those patients. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05400-2. BioMed Central 2023-07-04 /pmc/articles/PMC10320882/ /pubmed/37403016 http://dx.doi.org/10.1186/s12859-023-05400-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Mahdi-Esferizi, Roohallah Haji Molla Hoseyni, Behnaz Mehrpanah, Amir Golzade, Yazdan Najafi, Ali Elahian, Fatemeh Zadeh Shirazi, Amin Gomez, Guillermo A. Tahmasebian, Shahram DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues |
title | DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues |
title_full | DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues |
title_fullStr | DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues |
title_full_unstemmed | DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues |
title_short | DeeP4med: deep learning for P4 medicine to predict normal and cancer transcriptome in multiple human tissues |
title_sort | deep4med: deep learning for p4 medicine to predict normal and cancer transcriptome in multiple human tissues |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320882/ https://www.ncbi.nlm.nih.gov/pubmed/37403016 http://dx.doi.org/10.1186/s12859-023-05400-2 |
work_keys_str_mv | AT mahdiesferiziroohallah deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues AT hajimollahoseynibehnaz deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues AT mehrpanahamir deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues AT golzadeyazdan deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues AT najafiali deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues AT elahianfatemeh deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues AT zadehshiraziamin deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues AT gomezguillermoa deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues AT tahmasebianshahram deep4meddeeplearningforp4medicinetopredictnormalandcancertranscriptomeinmultiplehumantissues |