Cargando…

Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer

Neural networks are frequently applied to medical data. We describe how complex and imbalanced data can be modelled with simple but accurate neural networks that are transparent to the user. In the case of a data set on cervical cancer with 753 observations excluding, missing values, and 32 covariat...

Descripción completa

Detalles Bibliográficos
Autores principales: Lisboa, Paulo J. G., Ortega-Martorell, Sandra, Olier, Ivan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7274310/
http://dx.doi.org/10.1007/978-3-030-50146-4_43
_version_ 1783542553241452544
author Lisboa, Paulo J. G.
Ortega-Martorell, Sandra
Olier, Ivan
author_facet Lisboa, Paulo J. G.
Ortega-Martorell, Sandra
Olier, Ivan
author_sort Lisboa, Paulo J. G.
collection PubMed
description Neural networks are frequently applied to medical data. We describe how complex and imbalanced data can be modelled with simple but accurate neural networks that are transparent to the user. In the case of a data set on cervical cancer with 753 observations excluding, missing values, and 32 covariates, with a prevalence of 73 cases (9.69%), we explain how model selection can be applied to the Multi-Layer Perceptron (MLP) by deriving a representation using a General Additive Neural Network. The model achieves an AUROC of 0.621 CI [0.519,0.721] for predicting positive diagnosis with Schiller’s test. This is comparable with the performance obtained by a deep learning network with an AUROC of 0.667 [1]. Instead of using all covariates, the Partial Response Network (PRN) involves just 2 variables, namely the number of years on Hormonal Contraceptives and the number of years using IUD, in a fully explained model. This is consistent with an additive non-linear statistical approach, the Sparse Additive Model [2] which estimates non-linear components in a logistic regression classifier using the backfitting algorithm applied to an ANOVA functional expansion. This paper shows how the PRN, applied to a challenging classification task, can provide insights into the influential variables, in this case correlated with incidence of cervical cancer, so reducing the number of unnecessary variables to be collected for screening. It does so by exploiting the efficiency of sparse statistical models to select features from an ANOVA decomposition of the MLP, in the process deriving a fully interpretable model.
format Online
Article
Text
id pubmed-7274310
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72743102020-06-05 Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer Lisboa, Paulo J. G. Ortega-Martorell, Sandra Olier, Ivan Information Processing and Management of Uncertainty in Knowledge-Based Systems Article Neural networks are frequently applied to medical data. We describe how complex and imbalanced data can be modelled with simple but accurate neural networks that are transparent to the user. In the case of a data set on cervical cancer with 753 observations excluding, missing values, and 32 covariates, with a prevalence of 73 cases (9.69%), we explain how model selection can be applied to the Multi-Layer Perceptron (MLP) by deriving a representation using a General Additive Neural Network. The model achieves an AUROC of 0.621 CI [0.519,0.721] for predicting positive diagnosis with Schiller’s test. This is comparable with the performance obtained by a deep learning network with an AUROC of 0.667 [1]. Instead of using all covariates, the Partial Response Network (PRN) involves just 2 variables, namely the number of years on Hormonal Contraceptives and the number of years using IUD, in a fully explained model. This is consistent with an additive non-linear statistical approach, the Sparse Additive Model [2] which estimates non-linear components in a logistic regression classifier using the backfitting algorithm applied to an ANOVA functional expansion. This paper shows how the PRN, applied to a challenging classification task, can provide insights into the influential variables, in this case correlated with incidence of cervical cancer, so reducing the number of unnecessary variables to be collected for screening. It does so by exploiting the efficiency of sparse statistical models to select features from an ANOVA decomposition of the MLP, in the process deriving a fully interpretable model. 2020-05-18 /pmc/articles/PMC7274310/ http://dx.doi.org/10.1007/978-3-030-50146-4_43 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Lisboa, Paulo J. G.
Ortega-Martorell, Sandra
Olier, Ivan
Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer
title Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer
title_full Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer
title_fullStr Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer
title_full_unstemmed Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer
title_short Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer
title_sort explaining the neural network: a case study to model the incidence of cervical cancer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7274310/
http://dx.doi.org/10.1007/978-3-030-50146-4_43
work_keys_str_mv AT lisboapaulojg explainingtheneuralnetworkacasestudytomodeltheincidenceofcervicalcancer
AT ortegamartorellsandra explainingtheneuralnetworkacasestudytomodeltheincidenceofcervicalcancer
AT olierivan explainingtheneuralnetworkacasestudytomodeltheincidenceofcervicalcancer