Cargando…

Machine-Learning-Based Late Fusion on Multi-Omics and Multi-Scale Data for Non-Small-Cell Lung Cancer Diagnosis

Differentiation between the various non-small-cell lung cancer subtypes is crucial for providing an effective treatment to the patient. For this purpose, machine learning techniques have been used in recent years over the available biological data from patients. However, in most cases this problem h...

Descripción completa

Detalles Bibliográficos
Autores principales: Carrillo-Perez, Francisco, Morales, Juan Carlos, Castillo-Secilla, Daniel, Gevaert, Olivier, Rojas, Ignacio, Herrera, Luis Javier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9025878/
https://www.ncbi.nlm.nih.gov/pubmed/35455716
http://dx.doi.org/10.3390/jpm12040601
_version_ 1784690983975256064
author Carrillo-Perez, Francisco
Morales, Juan Carlos
Castillo-Secilla, Daniel
Gevaert, Olivier
Rojas, Ignacio
Herrera, Luis Javier
author_facet Carrillo-Perez, Francisco
Morales, Juan Carlos
Castillo-Secilla, Daniel
Gevaert, Olivier
Rojas, Ignacio
Herrera, Luis Javier
author_sort Carrillo-Perez, Francisco
collection PubMed
description Differentiation between the various non-small-cell lung cancer subtypes is crucial for providing an effective treatment to the patient. For this purpose, machine learning techniques have been used in recent years over the available biological data from patients. However, in most cases this problem has been treated using a single-modality approach, not exploring the potential of the multi-scale and multi-omic nature of cancer data for the classification. In this work, we study the fusion of five multi-scale and multi-omic modalities (RNA-Seq, miRNA-Seq, whole-slide imaging, copy number variation, and DNA methylation) by using a late fusion strategy and machine learning techniques. We train an independent machine learning model for each modality and we explore the interactions and gains that can be obtained by fusing their outputs in an increasing manner, by using a novel optimization approach to compute the parameters of the late fusion. The final classification model, using all modalities, obtains an F1 score of [Formula: see text] , an AUC of [Formula: see text] , and an AUPRC of [Formula: see text] , improving those results that each independent model obtains and those presented in the literature for this problem. These obtained results show that leveraging the multi-scale and multi-omic nature of cancer data can enhance the performance of single-modality clinical decision support systems in personalized medicine, consequently improving the diagnosis of the patient.
format Online
Article
Text
id pubmed-9025878
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-90258782022-04-23 Machine-Learning-Based Late Fusion on Multi-Omics and Multi-Scale Data for Non-Small-Cell Lung Cancer Diagnosis Carrillo-Perez, Francisco Morales, Juan Carlos Castillo-Secilla, Daniel Gevaert, Olivier Rojas, Ignacio Herrera, Luis Javier J Pers Med Article Differentiation between the various non-small-cell lung cancer subtypes is crucial for providing an effective treatment to the patient. For this purpose, machine learning techniques have been used in recent years over the available biological data from patients. However, in most cases this problem has been treated using a single-modality approach, not exploring the potential of the multi-scale and multi-omic nature of cancer data for the classification. In this work, we study the fusion of five multi-scale and multi-omic modalities (RNA-Seq, miRNA-Seq, whole-slide imaging, copy number variation, and DNA methylation) by using a late fusion strategy and machine learning techniques. We train an independent machine learning model for each modality and we explore the interactions and gains that can be obtained by fusing their outputs in an increasing manner, by using a novel optimization approach to compute the parameters of the late fusion. The final classification model, using all modalities, obtains an F1 score of [Formula: see text] , an AUC of [Formula: see text] , and an AUPRC of [Formula: see text] , improving those results that each independent model obtains and those presented in the literature for this problem. These obtained results show that leveraging the multi-scale and multi-omic nature of cancer data can enhance the performance of single-modality clinical decision support systems in personalized medicine, consequently improving the diagnosis of the patient. MDPI 2022-04-08 /pmc/articles/PMC9025878/ /pubmed/35455716 http://dx.doi.org/10.3390/jpm12040601 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Carrillo-Perez, Francisco
Morales, Juan Carlos
Castillo-Secilla, Daniel
Gevaert, Olivier
Rojas, Ignacio
Herrera, Luis Javier
Machine-Learning-Based Late Fusion on Multi-Omics and Multi-Scale Data for Non-Small-Cell Lung Cancer Diagnosis
title Machine-Learning-Based Late Fusion on Multi-Omics and Multi-Scale Data for Non-Small-Cell Lung Cancer Diagnosis
title_full Machine-Learning-Based Late Fusion on Multi-Omics and Multi-Scale Data for Non-Small-Cell Lung Cancer Diagnosis
title_fullStr Machine-Learning-Based Late Fusion on Multi-Omics and Multi-Scale Data for Non-Small-Cell Lung Cancer Diagnosis
title_full_unstemmed Machine-Learning-Based Late Fusion on Multi-Omics and Multi-Scale Data for Non-Small-Cell Lung Cancer Diagnosis
title_short Machine-Learning-Based Late Fusion on Multi-Omics and Multi-Scale Data for Non-Small-Cell Lung Cancer Diagnosis
title_sort machine-learning-based late fusion on multi-omics and multi-scale data for non-small-cell lung cancer diagnosis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9025878/
https://www.ncbi.nlm.nih.gov/pubmed/35455716
http://dx.doi.org/10.3390/jpm12040601
work_keys_str_mv AT carrilloperezfrancisco machinelearningbasedlatefusiononmultiomicsandmultiscaledatafornonsmallcelllungcancerdiagnosis
AT moralesjuancarlos machinelearningbasedlatefusiononmultiomicsandmultiscaledatafornonsmallcelllungcancerdiagnosis
AT castillosecilladaniel machinelearningbasedlatefusiononmultiomicsandmultiscaledatafornonsmallcelllungcancerdiagnosis
AT gevaertolivier machinelearningbasedlatefusiononmultiomicsandmultiscaledatafornonsmallcelllungcancerdiagnosis
AT rojasignacio machinelearningbasedlatefusiononmultiomicsandmultiscaledatafornonsmallcelllungcancerdiagnosis
AT herreraluisjavier machinelearningbasedlatefusiononmultiomicsandmultiscaledatafornonsmallcelllungcancerdiagnosis