Cargando…

Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration

Early diagnosis of lung cancer to increase the survival rate, which is currently at a low range of mid-30%, remains a critical need. Despite this, multi-omics data have rarely been applied to non-small-cell lung cancer (NSCLC) diagnosis. We developed a multi-omics data-affinitive artificial intellig...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Min-Koo, Lim, Jin-Muk, Jeong, Jinwoo, Jang, Yeongjae, Lee, Ji-Won, Lee, Jeong-Chan, Kim, Hyungyu, Koh, Euiyul, Hwang, Sung-Joo, Kim, Hong-Gee, Kim, Keun-Cheol
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9775093/
https://www.ncbi.nlm.nih.gov/pubmed/36551266
http://dx.doi.org/10.3390/biom12121839
_version_ 1784855559444365312
author Park, Min-Koo
Lim, Jin-Muk
Jeong, Jinwoo
Jang, Yeongjae
Lee, Ji-Won
Lee, Jeong-Chan
Kim, Hyungyu
Koh, Euiyul
Hwang, Sung-Joo
Kim, Hong-Gee
Kim, Keun-Cheol
author_facet Park, Min-Koo
Lim, Jin-Muk
Jeong, Jinwoo
Jang, Yeongjae
Lee, Ji-Won
Lee, Jeong-Chan
Kim, Hyungyu
Koh, Euiyul
Hwang, Sung-Joo
Kim, Hong-Gee
Kim, Keun-Cheol
author_sort Park, Min-Koo
collection PubMed
description Early diagnosis of lung cancer to increase the survival rate, which is currently at a low range of mid-30%, remains a critical need. Despite this, multi-omics data have rarely been applied to non-small-cell lung cancer (NSCLC) diagnosis. We developed a multi-omics data-affinitive artificial intelligence algorithm based on the graph convolutional network that integrates mRNA expression, DNA methylation, and DNA sequencing data. This NSCLC prediction model achieved a 93.7% macro F1-score, indicating that values for false positives and negatives were substantially low, which is desirable for accurate classification. Gene ontology enrichment and pathway analysis of features revealed that two major subtypes of NSCLC, lung adenocarcinoma and lung squamous cell carcinoma, have both specific and common GO biological processes. Numerous biomarkers (i.e., microRNA, long non-coding RNA, differentially methylated regions) were newly identified, whereas some biomarkers were consistent with previous findings in NSCLC (e.g., SPRR1B). Thus, using multi-omics data integration, we developed a promising cancer prediction algorithm.
format Online
Article
Text
id pubmed-9775093
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97750932022-12-23 Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration Park, Min-Koo Lim, Jin-Muk Jeong, Jinwoo Jang, Yeongjae Lee, Ji-Won Lee, Jeong-Chan Kim, Hyungyu Koh, Euiyul Hwang, Sung-Joo Kim, Hong-Gee Kim, Keun-Cheol Biomolecules Article Early diagnosis of lung cancer to increase the survival rate, which is currently at a low range of mid-30%, remains a critical need. Despite this, multi-omics data have rarely been applied to non-small-cell lung cancer (NSCLC) diagnosis. We developed a multi-omics data-affinitive artificial intelligence algorithm based on the graph convolutional network that integrates mRNA expression, DNA methylation, and DNA sequencing data. This NSCLC prediction model achieved a 93.7% macro F1-score, indicating that values for false positives and negatives were substantially low, which is desirable for accurate classification. Gene ontology enrichment and pathway analysis of features revealed that two major subtypes of NSCLC, lung adenocarcinoma and lung squamous cell carcinoma, have both specific and common GO biological processes. Numerous biomarkers (i.e., microRNA, long non-coding RNA, differentially methylated regions) were newly identified, whereas some biomarkers were consistent with previous findings in NSCLC (e.g., SPRR1B). Thus, using multi-omics data integration, we developed a promising cancer prediction algorithm. MDPI 2022-12-08 /pmc/articles/PMC9775093/ /pubmed/36551266 http://dx.doi.org/10.3390/biom12121839 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Park, Min-Koo
Lim, Jin-Muk
Jeong, Jinwoo
Jang, Yeongjae
Lee, Ji-Won
Lee, Jeong-Chan
Kim, Hyungyu
Koh, Euiyul
Hwang, Sung-Joo
Kim, Hong-Gee
Kim, Keun-Cheol
Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title_full Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title_fullStr Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title_full_unstemmed Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title_short Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title_sort deep-learning algorithm and concomitant biomarker identification for nsclc prediction using multi-omics data integration
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9775093/
https://www.ncbi.nlm.nih.gov/pubmed/36551266
http://dx.doi.org/10.3390/biom12121839
work_keys_str_mv AT parkminkoo deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT limjinmuk deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT jeongjinwoo deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT jangyeongjae deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT leejiwon deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT leejeongchan deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT kimhyungyu deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT koheuiyul deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT hwangsungjoo deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT kimhonggee deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT kimkeuncheol deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration