Cargando…
Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer
Neural networks are frequently applied to medical data. We describe how complex and imbalanced data can be modelled with simple but accurate neural networks that are transparent to the user. In the case of a data set on cervical cancer with 753 observations excluding, missing values, and 32 covariat...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7274310/ http://dx.doi.org/10.1007/978-3-030-50146-4_43 |
_version_ | 1783542553241452544 |
---|---|
author | Lisboa, Paulo J. G. Ortega-Martorell, Sandra Olier, Ivan |
author_facet | Lisboa, Paulo J. G. Ortega-Martorell, Sandra Olier, Ivan |
author_sort | Lisboa, Paulo J. G. |
collection | PubMed |
description | Neural networks are frequently applied to medical data. We describe how complex and imbalanced data can be modelled with simple but accurate neural networks that are transparent to the user. In the case of a data set on cervical cancer with 753 observations excluding, missing values, and 32 covariates, with a prevalence of 73 cases (9.69%), we explain how model selection can be applied to the Multi-Layer Perceptron (MLP) by deriving a representation using a General Additive Neural Network. The model achieves an AUROC of 0.621 CI [0.519,0.721] for predicting positive diagnosis with Schiller’s test. This is comparable with the performance obtained by a deep learning network with an AUROC of 0.667 [1]. Instead of using all covariates, the Partial Response Network (PRN) involves just 2 variables, namely the number of years on Hormonal Contraceptives and the number of years using IUD, in a fully explained model. This is consistent with an additive non-linear statistical approach, the Sparse Additive Model [2] which estimates non-linear components in a logistic regression classifier using the backfitting algorithm applied to an ANOVA functional expansion. This paper shows how the PRN, applied to a challenging classification task, can provide insights into the influential variables, in this case correlated with incidence of cervical cancer, so reducing the number of unnecessary variables to be collected for screening. It does so by exploiting the efficiency of sparse statistical models to select features from an ANOVA decomposition of the MLP, in the process deriving a fully interpretable model. |
format | Online Article Text |
id | pubmed-7274310 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72743102020-06-05 Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer Lisboa, Paulo J. G. Ortega-Martorell, Sandra Olier, Ivan Information Processing and Management of Uncertainty in Knowledge-Based Systems Article Neural networks are frequently applied to medical data. We describe how complex and imbalanced data can be modelled with simple but accurate neural networks that are transparent to the user. In the case of a data set on cervical cancer with 753 observations excluding, missing values, and 32 covariates, with a prevalence of 73 cases (9.69%), we explain how model selection can be applied to the Multi-Layer Perceptron (MLP) by deriving a representation using a General Additive Neural Network. The model achieves an AUROC of 0.621 CI [0.519,0.721] for predicting positive diagnosis with Schiller’s test. This is comparable with the performance obtained by a deep learning network with an AUROC of 0.667 [1]. Instead of using all covariates, the Partial Response Network (PRN) involves just 2 variables, namely the number of years on Hormonal Contraceptives and the number of years using IUD, in a fully explained model. This is consistent with an additive non-linear statistical approach, the Sparse Additive Model [2] which estimates non-linear components in a logistic regression classifier using the backfitting algorithm applied to an ANOVA functional expansion. This paper shows how the PRN, applied to a challenging classification task, can provide insights into the influential variables, in this case correlated with incidence of cervical cancer, so reducing the number of unnecessary variables to be collected for screening. It does so by exploiting the efficiency of sparse statistical models to select features from an ANOVA decomposition of the MLP, in the process deriving a fully interpretable model. 2020-05-18 /pmc/articles/PMC7274310/ http://dx.doi.org/10.1007/978-3-030-50146-4_43 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Lisboa, Paulo J. G. Ortega-Martorell, Sandra Olier, Ivan Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer |
title | Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer |
title_full | Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer |
title_fullStr | Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer |
title_full_unstemmed | Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer |
title_short | Explaining the Neural Network: A Case Study to Model the Incidence of Cervical Cancer |
title_sort | explaining the neural network: a case study to model the incidence of cervical cancer |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7274310/ http://dx.doi.org/10.1007/978-3-030-50146-4_43 |
work_keys_str_mv | AT lisboapaulojg explainingtheneuralnetworkacasestudytomodeltheincidenceofcervicalcancer AT ortegamartorellsandra explainingtheneuralnetworkacasestudytomodeltheincidenceofcervicalcancer AT olierivan explainingtheneuralnetworkacasestudytomodeltheincidenceofcervicalcancer |