Cargando…

A Radiogenomics Ensemble to Predict EGFR and KRAS Mutations in NSCLC

Lung cancer causes more deaths globally than any other type of cancer. To determine the best treatment, detecting EGFR and KRAS mutations is of interest. However, non-invasive ways to obtain this information are not available. Furthermore, many times there is a lack of big enough relevant public dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Moreno, Silvia, Bonfante, Mario, Zurek, Eduardo, Cherezov, Dmitry, Goldgof, Dmitry, Hall, Lawrence, Schabath, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8162978/
https://www.ncbi.nlm.nih.gov/pubmed/33946756
http://dx.doi.org/10.3390/tomography7020014
_version_ 1783700815541698560
author Moreno, Silvia
Bonfante, Mario
Zurek, Eduardo
Cherezov, Dmitry
Goldgof, Dmitry
Hall, Lawrence
Schabath, Matthew
author_facet Moreno, Silvia
Bonfante, Mario
Zurek, Eduardo
Cherezov, Dmitry
Goldgof, Dmitry
Hall, Lawrence
Schabath, Matthew
author_sort Moreno, Silvia
collection PubMed
description Lung cancer causes more deaths globally than any other type of cancer. To determine the best treatment, detecting EGFR and KRAS mutations is of interest. However, non-invasive ways to obtain this information are not available. Furthermore, many times there is a lack of big enough relevant public datasets, so the performance of single classifiers is not outstanding. In this paper, an ensemble approach is applied to increase the performance of EGFR and KRAS mutation prediction using a small dataset. A new voting scheme, Selective Class Average Voting (SCAV), is proposed and its performance is assessed both for machine learning models and CNNs. For the EGFR mutation, in the machine learning approach, there was an increase in the sensitivity from 0.66 to 0.75, and an increase in AUC from 0.68 to 0.70. With the deep learning approach, an AUC of 0.846 was obtained, and with SCAV, the accuracy of the model was increased from 0.80 to 0.857. For the KRAS mutation, both in the machine learning models (0.65 to 0.71 AUC) and the deep learning models (0.739 to 0.778 AUC), a significant increase in performance was found. The results obtained in this work show how to effectively learn from small image datasets to predict EGFR and KRAS mutations, and that using ensembles with SCAV increases the performance of machine learning classifiers and CNNs. The results provide confidence that as large datasets become available, tools to augment clinical capabilities can be fielded.
format Online
Article
Text
id pubmed-8162978
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81629782021-05-29 A Radiogenomics Ensemble to Predict EGFR and KRAS Mutations in NSCLC Moreno, Silvia Bonfante, Mario Zurek, Eduardo Cherezov, Dmitry Goldgof, Dmitry Hall, Lawrence Schabath, Matthew Tomography Article Lung cancer causes more deaths globally than any other type of cancer. To determine the best treatment, detecting EGFR and KRAS mutations is of interest. However, non-invasive ways to obtain this information are not available. Furthermore, many times there is a lack of big enough relevant public datasets, so the performance of single classifiers is not outstanding. In this paper, an ensemble approach is applied to increase the performance of EGFR and KRAS mutation prediction using a small dataset. A new voting scheme, Selective Class Average Voting (SCAV), is proposed and its performance is assessed both for machine learning models and CNNs. For the EGFR mutation, in the machine learning approach, there was an increase in the sensitivity from 0.66 to 0.75, and an increase in AUC from 0.68 to 0.70. With the deep learning approach, an AUC of 0.846 was obtained, and with SCAV, the accuracy of the model was increased from 0.80 to 0.857. For the KRAS mutation, both in the machine learning models (0.65 to 0.71 AUC) and the deep learning models (0.739 to 0.778 AUC), a significant increase in performance was found. The results obtained in this work show how to effectively learn from small image datasets to predict EGFR and KRAS mutations, and that using ensembles with SCAV increases the performance of machine learning classifiers and CNNs. The results provide confidence that as large datasets become available, tools to augment clinical capabilities can be fielded. MDPI 2021-04-29 /pmc/articles/PMC8162978/ /pubmed/33946756 http://dx.doi.org/10.3390/tomography7020014 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Moreno, Silvia
Bonfante, Mario
Zurek, Eduardo
Cherezov, Dmitry
Goldgof, Dmitry
Hall, Lawrence
Schabath, Matthew
A Radiogenomics Ensemble to Predict EGFR and KRAS Mutations in NSCLC
title A Radiogenomics Ensemble to Predict EGFR and KRAS Mutations in NSCLC
title_full A Radiogenomics Ensemble to Predict EGFR and KRAS Mutations in NSCLC
title_fullStr A Radiogenomics Ensemble to Predict EGFR and KRAS Mutations in NSCLC
title_full_unstemmed A Radiogenomics Ensemble to Predict EGFR and KRAS Mutations in NSCLC
title_short A Radiogenomics Ensemble to Predict EGFR and KRAS Mutations in NSCLC
title_sort radiogenomics ensemble to predict egfr and kras mutations in nsclc
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8162978/
https://www.ncbi.nlm.nih.gov/pubmed/33946756
http://dx.doi.org/10.3390/tomography7020014
work_keys_str_mv AT morenosilvia aradiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT bonfantemario aradiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT zurekeduardo aradiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT cherezovdmitry aradiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT goldgofdmitry aradiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT halllawrence aradiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT schabathmatthew aradiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT morenosilvia radiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT bonfantemario radiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT zurekeduardo radiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT cherezovdmitry radiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT goldgofdmitry radiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT halllawrence radiogenomicsensembletopredictegfrandkrasmutationsinnsclc
AT schabathmatthew radiogenomicsensembletopredictegfrandkrasmutationsinnsclc