Cargando…

Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring

STUDY OBJECTIVES: Automated sleep stage scoring is not yet vigorously used in practice because of the black-box nature and the risk of wrong predictions. The objective of this study was to introduce a confidence-based framework to detect the possibly wrong predictions that would inform clinicians ab...

Descripción completa

Detalles Bibliográficos
Autores principales: Hong, Jung Kyung, Lee, Taeyoung, Delos Reyes, Roben Deocampo, Hong, Joonki, Tran, Hai Hong, Lee, Dongheon, Jung, Jinhwan, Yoon, In-Young
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8721741/
https://www.ncbi.nlm.nih.gov/pubmed/35002345
http://dx.doi.org/10.2147/NSS.S333566
_version_ 1784625405440819200
author Hong, Jung Kyung
Lee, Taeyoung
Delos Reyes, Roben Deocampo
Hong, Joonki
Tran, Hai Hong
Lee, Dongheon
Jung, Jinhwan
Yoon, In-Young
author_facet Hong, Jung Kyung
Lee, Taeyoung
Delos Reyes, Roben Deocampo
Hong, Joonki
Tran, Hai Hong
Lee, Dongheon
Jung, Jinhwan
Yoon, In-Young
author_sort Hong, Jung Kyung
collection PubMed
description STUDY OBJECTIVES: Automated sleep stage scoring is not yet vigorously used in practice because of the black-box nature and the risk of wrong predictions. The objective of this study was to introduce a confidence-based framework to detect the possibly wrong predictions that would inform clinicians about which epochs would require a manual review and investigate the potential to improve accuracy for automated sleep stage scoring. METHODS: We used 702 polysomnography studies from a local clinical dataset (SNUBH dataset) and 2804 from an open dataset (SHHS dataset) for experiments. We adapted the state-of-the-art TinySleepNet architecture to train the classifier and modified the ConfidNet architecture to train an auxiliary confidence model. For the confidence model, we developed a novel method, Dropout Correct Rate (DCR), and the performance of it was compared with other existing methods. RESULTS: Confidence estimates (0.754) reflected accuracy (0.758) well in general. The best performance for differentiating correct and wrong predictions was shown when using the DCR method (AUROC: 0.812) compared to the existing approaches which largely failed to detect wrong predictions. By reviewing only 20% of epochs that received the lowest confidence values, the overall accuracy of sleep stage scoring was improved from 76% to 87%. For patients with reduced accuracy (ie, individuals with obesity or severe sleep apnea), the possible improvement range after applying confidence estimation was even greater. CONCLUSION: To the best of our knowledge, this is the first study applying confidence estimation on automated sleep stage scoring. Reliable confidence estimates by the DCR method help screen out most of the wrong predictions, which would increase the reliability and interpretability of automated sleep stage scoring.
format Online
Article
Text
id pubmed-8721741
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-87217412022-01-06 Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring Hong, Jung Kyung Lee, Taeyoung Delos Reyes, Roben Deocampo Hong, Joonki Tran, Hai Hong Lee, Dongheon Jung, Jinhwan Yoon, In-Young Nat Sci Sleep Original Research STUDY OBJECTIVES: Automated sleep stage scoring is not yet vigorously used in practice because of the black-box nature and the risk of wrong predictions. The objective of this study was to introduce a confidence-based framework to detect the possibly wrong predictions that would inform clinicians about which epochs would require a manual review and investigate the potential to improve accuracy for automated sleep stage scoring. METHODS: We used 702 polysomnography studies from a local clinical dataset (SNUBH dataset) and 2804 from an open dataset (SHHS dataset) for experiments. We adapted the state-of-the-art TinySleepNet architecture to train the classifier and modified the ConfidNet architecture to train an auxiliary confidence model. For the confidence model, we developed a novel method, Dropout Correct Rate (DCR), and the performance of it was compared with other existing methods. RESULTS: Confidence estimates (0.754) reflected accuracy (0.758) well in general. The best performance for differentiating correct and wrong predictions was shown when using the DCR method (AUROC: 0.812) compared to the existing approaches which largely failed to detect wrong predictions. By reviewing only 20% of epochs that received the lowest confidence values, the overall accuracy of sleep stage scoring was improved from 76% to 87%. For patients with reduced accuracy (ie, individuals with obesity or severe sleep apnea), the possible improvement range after applying confidence estimation was even greater. CONCLUSION: To the best of our knowledge, this is the first study applying confidence estimation on automated sleep stage scoring. Reliable confidence estimates by the DCR method help screen out most of the wrong predictions, which would increase the reliability and interpretability of automated sleep stage scoring. Dove 2021-12-24 /pmc/articles/PMC8721741/ /pubmed/35002345 http://dx.doi.org/10.2147/NSS.S333566 Text en © 2021 Hong et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
Hong, Jung Kyung
Lee, Taeyoung
Delos Reyes, Roben Deocampo
Hong, Joonki
Tran, Hai Hong
Lee, Dongheon
Jung, Jinhwan
Yoon, In-Young
Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring
title Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring
title_full Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring
title_fullStr Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring
title_full_unstemmed Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring
title_short Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring
title_sort confidence-based framework using deep learning for automated sleep stage scoring
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8721741/
https://www.ncbi.nlm.nih.gov/pubmed/35002345
http://dx.doi.org/10.2147/NSS.S333566
work_keys_str_mv AT hongjungkyung confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring
AT leetaeyoung confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring
AT delosreyesrobendeocampo confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring
AT hongjoonki confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring
AT tranhaihong confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring
AT leedongheon confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring
AT jungjinhwan confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring
AT yooninyoung confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring