Cargando…
Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data
Early stopping is an extremely common tool to minimize overfitting, which would otherwise be a cause of poor generalization of the model to novel data. However, early stopping is a heuristic that, while effective, primarily relies on ad hoc parameters and metrics. Optimizing when to stop remains a c...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Nature Singapore
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9836982/ https://www.ncbi.nlm.nih.gov/pubmed/36647373 http://dx.doi.org/10.1007/s42979-022-01553-8 |
_version_ | 1784868975981625344 |
---|---|
author | Bartlett, Christopher W. Bossenbroek, Jamie Ueyama, Yukie McCallinhart, Patricia Peters, Olivia A. Santillan, Donna A. Santillan, Mark K. Trask, Aaron J. Ray, William C. |
author_facet | Bartlett, Christopher W. Bossenbroek, Jamie Ueyama, Yukie McCallinhart, Patricia Peters, Olivia A. Santillan, Donna A. Santillan, Mark K. Trask, Aaron J. Ray, William C. |
author_sort | Bartlett, Christopher W. |
collection | PubMed |
description | Early stopping is an extremely common tool to minimize overfitting, which would otherwise be a cause of poor generalization of the model to novel data. However, early stopping is a heuristic that, while effective, primarily relies on ad hoc parameters and metrics. Optimizing when to stop remains a challenge. In this paper, we suggest that for some biomedical applications, a natural dichotomy of invasive/non-invasive measurements, or more generally proximal vs distal measurements of a biological system can be exploited to provide objective advice on early stopping. We discuss the conditions where invasive measurements of a biological process should provide better predictions than non-invasive measurements, or at best offer parity. Hence, if data from an invasive measurement are available locally, or from the literature, that information can be leveraged to know with high certainty whether a model of non-invasive data is overfitted. We present paired invasive/non-invasive cardiac and coronary artery measurements from two mouse strains, one of which spontaneously develops type 2 diabetes, posed as a classification problem. Examination of the various stopping rules shows that generalization is reduced with more training epochs and commonly applied stopping rules give widely different generalization error estimates. The use of an empirically derived training ceiling is demonstrated to be helpful as added information to leverage early stopping in order to reduce overfitting. |
format | Online Article Text |
id | pubmed-9836982 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer Nature Singapore |
record_format | MEDLINE/PubMed |
spelling | pubmed-98369822023-01-14 Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data Bartlett, Christopher W. Bossenbroek, Jamie Ueyama, Yukie McCallinhart, Patricia Peters, Olivia A. Santillan, Donna A. Santillan, Mark K. Trask, Aaron J. Ray, William C. SN Comput Sci Original Research Early stopping is an extremely common tool to minimize overfitting, which would otherwise be a cause of poor generalization of the model to novel data. However, early stopping is a heuristic that, while effective, primarily relies on ad hoc parameters and metrics. Optimizing when to stop remains a challenge. In this paper, we suggest that for some biomedical applications, a natural dichotomy of invasive/non-invasive measurements, or more generally proximal vs distal measurements of a biological system can be exploited to provide objective advice on early stopping. We discuss the conditions where invasive measurements of a biological process should provide better predictions than non-invasive measurements, or at best offer parity. Hence, if data from an invasive measurement are available locally, or from the literature, that information can be leveraged to know with high certainty whether a model of non-invasive data is overfitted. We present paired invasive/non-invasive cardiac and coronary artery measurements from two mouse strains, one of which spontaneously develops type 2 diabetes, posed as a classification problem. Examination of the various stopping rules shows that generalization is reduced with more training epochs and commonly applied stopping rules give widely different generalization error estimates. The use of an empirically derived training ceiling is demonstrated to be helpful as added information to leverage early stopping in order to reduce overfitting. Springer Nature Singapore 2023-01-12 2023 /pmc/articles/PMC9836982/ /pubmed/36647373 http://dx.doi.org/10.1007/s42979-022-01553-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Original Research Bartlett, Christopher W. Bossenbroek, Jamie Ueyama, Yukie McCallinhart, Patricia Peters, Olivia A. Santillan, Donna A. Santillan, Mark K. Trask, Aaron J. Ray, William C. Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data |
title | Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data |
title_full | Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data |
title_fullStr | Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data |
title_full_unstemmed | Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data |
title_short | Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data |
title_sort | invasive or more direct measurements can provide an objective early-stopping ceiling for training deep neural networks on non-invasive or less-direct biomedical data |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9836982/ https://www.ncbi.nlm.nih.gov/pubmed/36647373 http://dx.doi.org/10.1007/s42979-022-01553-8 |
work_keys_str_mv | AT bartlettchristopherw invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata AT bossenbroekjamie invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata AT ueyamayukie invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata AT mccallinhartpatricia invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata AT petersoliviaa invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata AT santillandonnaa invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata AT santillanmarkk invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata AT traskaaronj invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata AT raywilliamc invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata |