Cargando…

Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data

Early stopping is an extremely common tool to minimize overfitting, which would otherwise be a cause of poor generalization of the model to novel data. However, early stopping is a heuristic that, while effective, primarily relies on ad hoc parameters and metrics. Optimizing when to stop remains a c...

Descripción completa

Detalles Bibliográficos
Autores principales: Bartlett, Christopher W., Bossenbroek, Jamie, Ueyama, Yukie, McCallinhart, Patricia, Peters, Olivia A., Santillan, Donna A., Santillan, Mark K., Trask, Aaron J., Ray, William C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Nature Singapore 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9836982/
https://www.ncbi.nlm.nih.gov/pubmed/36647373
http://dx.doi.org/10.1007/s42979-022-01553-8
_version_ 1784868975981625344
author Bartlett, Christopher W.
Bossenbroek, Jamie
Ueyama, Yukie
McCallinhart, Patricia
Peters, Olivia A.
Santillan, Donna A.
Santillan, Mark K.
Trask, Aaron J.
Ray, William C.
author_facet Bartlett, Christopher W.
Bossenbroek, Jamie
Ueyama, Yukie
McCallinhart, Patricia
Peters, Olivia A.
Santillan, Donna A.
Santillan, Mark K.
Trask, Aaron J.
Ray, William C.
author_sort Bartlett, Christopher W.
collection PubMed
description Early stopping is an extremely common tool to minimize overfitting, which would otherwise be a cause of poor generalization of the model to novel data. However, early stopping is a heuristic that, while effective, primarily relies on ad hoc parameters and metrics. Optimizing when to stop remains a challenge. In this paper, we suggest that for some biomedical applications, a natural dichotomy of invasive/non-invasive measurements, or more generally proximal vs distal measurements of a biological system can be exploited to provide objective advice on early stopping. We discuss the conditions where invasive measurements of a biological process should provide better predictions than non-invasive measurements, or at best offer parity. Hence, if data from an invasive measurement are available locally, or from the literature, that information can be leveraged to know with high certainty whether a model of non-invasive data is overfitted. We present paired invasive/non-invasive cardiac and coronary artery measurements from two mouse strains, one of which spontaneously develops type 2 diabetes, posed as a classification problem. Examination of the various stopping rules shows that generalization is reduced with more training epochs and commonly applied stopping rules give widely different generalization error estimates. The use of an empirically derived training ceiling is demonstrated to be helpful as added information to leverage early stopping in order to reduce overfitting.
format Online
Article
Text
id pubmed-9836982
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer Nature Singapore
record_format MEDLINE/PubMed
spelling pubmed-98369822023-01-14 Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data Bartlett, Christopher W. Bossenbroek, Jamie Ueyama, Yukie McCallinhart, Patricia Peters, Olivia A. Santillan, Donna A. Santillan, Mark K. Trask, Aaron J. Ray, William C. SN Comput Sci Original Research Early stopping is an extremely common tool to minimize overfitting, which would otherwise be a cause of poor generalization of the model to novel data. However, early stopping is a heuristic that, while effective, primarily relies on ad hoc parameters and metrics. Optimizing when to stop remains a challenge. In this paper, we suggest that for some biomedical applications, a natural dichotomy of invasive/non-invasive measurements, or more generally proximal vs distal measurements of a biological system can be exploited to provide objective advice on early stopping. We discuss the conditions where invasive measurements of a biological process should provide better predictions than non-invasive measurements, or at best offer parity. Hence, if data from an invasive measurement are available locally, or from the literature, that information can be leveraged to know with high certainty whether a model of non-invasive data is overfitted. We present paired invasive/non-invasive cardiac and coronary artery measurements from two mouse strains, one of which spontaneously develops type 2 diabetes, posed as a classification problem. Examination of the various stopping rules shows that generalization is reduced with more training epochs and commonly applied stopping rules give widely different generalization error estimates. The use of an empirically derived training ceiling is demonstrated to be helpful as added information to leverage early stopping in order to reduce overfitting. Springer Nature Singapore 2023-01-12 2023 /pmc/articles/PMC9836982/ /pubmed/36647373 http://dx.doi.org/10.1007/s42979-022-01553-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Research
Bartlett, Christopher W.
Bossenbroek, Jamie
Ueyama, Yukie
McCallinhart, Patricia
Peters, Olivia A.
Santillan, Donna A.
Santillan, Mark K.
Trask, Aaron J.
Ray, William C.
Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data
title Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data
title_full Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data
title_fullStr Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data
title_full_unstemmed Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data
title_short Invasive or More Direct Measurements Can Provide an Objective Early-Stopping Ceiling for Training Deep Neural Networks on Non-invasive or Less-Direct Biomedical Data
title_sort invasive or more direct measurements can provide an objective early-stopping ceiling for training deep neural networks on non-invasive or less-direct biomedical data
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9836982/
https://www.ncbi.nlm.nih.gov/pubmed/36647373
http://dx.doi.org/10.1007/s42979-022-01553-8
work_keys_str_mv AT bartlettchristopherw invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata
AT bossenbroekjamie invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata
AT ueyamayukie invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata
AT mccallinhartpatricia invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata
AT petersoliviaa invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata
AT santillandonnaa invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata
AT santillanmarkk invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata
AT traskaaronj invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata
AT raywilliamc invasiveormoredirectmeasurementscanprovideanobjectiveearlystoppingceilingfortrainingdeepneuralnetworksonnoninvasiveorlessdirectbiomedicaldata