Cargando…

Multi-scored sleep databases: how to exploit the multiple-labels in automated sleep scoring

STUDY OBJECTIVES: Inter-scorer variability in scoring polysomnograms is a well-known problem. Most of the existing automated sleep scoring systems are trained using labels annotated by a single-scorer, whose subjective evaluation is transferred to the model. When annotations from two or more scorers...

Descripción completa

Detalles Bibliográficos
Autores principales: Fiorillo, Luigi, Pedroncelli, Davide, Agostini, Valentina, Favaro, Paolo, Faraci, Francesca Dalia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10171642/
https://www.ncbi.nlm.nih.gov/pubmed/36762998
http://dx.doi.org/10.1093/sleep/zsad028
_version_ 1785039465111093248
author Fiorillo, Luigi
Pedroncelli, Davide
Agostini, Valentina
Favaro, Paolo
Faraci, Francesca Dalia
author_facet Fiorillo, Luigi
Pedroncelli, Davide
Agostini, Valentina
Favaro, Paolo
Faraci, Francesca Dalia
author_sort Fiorillo, Luigi
collection PubMed
description STUDY OBJECTIVES: Inter-scorer variability in scoring polysomnograms is a well-known problem. Most of the existing automated sleep scoring systems are trained using labels annotated by a single-scorer, whose subjective evaluation is transferred to the model. When annotations from two or more scorers are available, the scoring models are usually trained on the scorer consensus. The averaged scorer’s subjectivity is transferred into the model, losing information about the internal variability among different scorers. In this study, we aim to insert the multiple-knowledge of the different physicians into the training procedure. The goal is to optimize a model training, exploiting the full information that can be extracted from the consensus of a group of scorers. METHODS: We train two lightweight deep learning-based models on three different multi-scored databases. We exploit the label smoothing technique together with a soft-consensus (LS(SC)) distribution to insert the multiple-knowledge in the training procedure of the model. We introduce the averaged cosine similarity metric (ACS) to quantify the similarity between the hypnodensity-graph generated by the models with-LS(SC) and the hypnodensity-graph generated by the scorer consensus. RESULTS: The performance of the models improves on all the databases when we train the models with our LS(SC). We found an increase in ACS (up to 6.4%) between the hypnodensity-graph generated by the models trained with-LS(SC) and the hypnodensity-graph generated by the consensus. CONCLUSION: Our approach definitely enables a model to better adapt to the consensus of the group of scorers. Future work will focus on further investigations on different scoring architectures and hopefully large-scale-heterogeneous multi-scored datasets.
format Online
Article
Text
id pubmed-10171642
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101716422023-05-11 Multi-scored sleep databases: how to exploit the multiple-labels in automated sleep scoring Fiorillo, Luigi Pedroncelli, Davide Agostini, Valentina Favaro, Paolo Faraci, Francesca Dalia Sleep Sleep, Health, and Disease STUDY OBJECTIVES: Inter-scorer variability in scoring polysomnograms is a well-known problem. Most of the existing automated sleep scoring systems are trained using labels annotated by a single-scorer, whose subjective evaluation is transferred to the model. When annotations from two or more scorers are available, the scoring models are usually trained on the scorer consensus. The averaged scorer’s subjectivity is transferred into the model, losing information about the internal variability among different scorers. In this study, we aim to insert the multiple-knowledge of the different physicians into the training procedure. The goal is to optimize a model training, exploiting the full information that can be extracted from the consensus of a group of scorers. METHODS: We train two lightweight deep learning-based models on three different multi-scored databases. We exploit the label smoothing technique together with a soft-consensus (LS(SC)) distribution to insert the multiple-knowledge in the training procedure of the model. We introduce the averaged cosine similarity metric (ACS) to quantify the similarity between the hypnodensity-graph generated by the models with-LS(SC) and the hypnodensity-graph generated by the scorer consensus. RESULTS: The performance of the models improves on all the databases when we train the models with our LS(SC). We found an increase in ACS (up to 6.4%) between the hypnodensity-graph generated by the models trained with-LS(SC) and the hypnodensity-graph generated by the consensus. CONCLUSION: Our approach definitely enables a model to better adapt to the consensus of the group of scorers. Future work will focus on further investigations on different scoring architectures and hopefully large-scale-heterogeneous multi-scored datasets. Oxford University Press 2023-02-10 /pmc/articles/PMC10171642/ /pubmed/36762998 http://dx.doi.org/10.1093/sleep/zsad028 Text en © Sleep Research Society 2023. Published by Oxford University Press on behalf of the Sleep Research Society. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Sleep, Health, and Disease
Fiorillo, Luigi
Pedroncelli, Davide
Agostini, Valentina
Favaro, Paolo
Faraci, Francesca Dalia
Multi-scored sleep databases: how to exploit the multiple-labels in automated sleep scoring
title Multi-scored sleep databases: how to exploit the multiple-labels in automated sleep scoring
title_full Multi-scored sleep databases: how to exploit the multiple-labels in automated sleep scoring
title_fullStr Multi-scored sleep databases: how to exploit the multiple-labels in automated sleep scoring
title_full_unstemmed Multi-scored sleep databases: how to exploit the multiple-labels in automated sleep scoring
title_short Multi-scored sleep databases: how to exploit the multiple-labels in automated sleep scoring
title_sort multi-scored sleep databases: how to exploit the multiple-labels in automated sleep scoring
topic Sleep, Health, and Disease
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10171642/
https://www.ncbi.nlm.nih.gov/pubmed/36762998
http://dx.doi.org/10.1093/sleep/zsad028
work_keys_str_mv AT fiorilloluigi multiscoredsleepdatabaseshowtoexploitthemultiplelabelsinautomatedsleepscoring
AT pedroncellidavide multiscoredsleepdatabaseshowtoexploitthemultiplelabelsinautomatedsleepscoring
AT agostinivalentina multiscoredsleepdatabaseshowtoexploitthemultiplelabelsinautomatedsleepscoring
AT favaropaolo multiscoredsleepdatabaseshowtoexploitthemultiplelabelsinautomatedsleepscoring
AT faracifrancescadalia multiscoredsleepdatabaseshowtoexploitthemultiplelabelsinautomatedsleepscoring