Cargando…
Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring
STUDY OBJECTIVES: Automated sleep stage scoring is not yet vigorously used in practice because of the black-box nature and the risk of wrong predictions. The objective of this study was to introduce a confidence-based framework to detect the possibly wrong predictions that would inform clinicians ab...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Dove
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8721741/ https://www.ncbi.nlm.nih.gov/pubmed/35002345 http://dx.doi.org/10.2147/NSS.S333566 |
_version_ | 1784625405440819200 |
---|---|
author | Hong, Jung Kyung Lee, Taeyoung Delos Reyes, Roben Deocampo Hong, Joonki Tran, Hai Hong Lee, Dongheon Jung, Jinhwan Yoon, In-Young |
author_facet | Hong, Jung Kyung Lee, Taeyoung Delos Reyes, Roben Deocampo Hong, Joonki Tran, Hai Hong Lee, Dongheon Jung, Jinhwan Yoon, In-Young |
author_sort | Hong, Jung Kyung |
collection | PubMed |
description | STUDY OBJECTIVES: Automated sleep stage scoring is not yet vigorously used in practice because of the black-box nature and the risk of wrong predictions. The objective of this study was to introduce a confidence-based framework to detect the possibly wrong predictions that would inform clinicians about which epochs would require a manual review and investigate the potential to improve accuracy for automated sleep stage scoring. METHODS: We used 702 polysomnography studies from a local clinical dataset (SNUBH dataset) and 2804 from an open dataset (SHHS dataset) for experiments. We adapted the state-of-the-art TinySleepNet architecture to train the classifier and modified the ConfidNet architecture to train an auxiliary confidence model. For the confidence model, we developed a novel method, Dropout Correct Rate (DCR), and the performance of it was compared with other existing methods. RESULTS: Confidence estimates (0.754) reflected accuracy (0.758) well in general. The best performance for differentiating correct and wrong predictions was shown when using the DCR method (AUROC: 0.812) compared to the existing approaches which largely failed to detect wrong predictions. By reviewing only 20% of epochs that received the lowest confidence values, the overall accuracy of sleep stage scoring was improved from 76% to 87%. For patients with reduced accuracy (ie, individuals with obesity or severe sleep apnea), the possible improvement range after applying confidence estimation was even greater. CONCLUSION: To the best of our knowledge, this is the first study applying confidence estimation on automated sleep stage scoring. Reliable confidence estimates by the DCR method help screen out most of the wrong predictions, which would increase the reliability and interpretability of automated sleep stage scoring. |
format | Online Article Text |
id | pubmed-8721741 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Dove |
record_format | MEDLINE/PubMed |
spelling | pubmed-87217412022-01-06 Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring Hong, Jung Kyung Lee, Taeyoung Delos Reyes, Roben Deocampo Hong, Joonki Tran, Hai Hong Lee, Dongheon Jung, Jinhwan Yoon, In-Young Nat Sci Sleep Original Research STUDY OBJECTIVES: Automated sleep stage scoring is not yet vigorously used in practice because of the black-box nature and the risk of wrong predictions. The objective of this study was to introduce a confidence-based framework to detect the possibly wrong predictions that would inform clinicians about which epochs would require a manual review and investigate the potential to improve accuracy for automated sleep stage scoring. METHODS: We used 702 polysomnography studies from a local clinical dataset (SNUBH dataset) and 2804 from an open dataset (SHHS dataset) for experiments. We adapted the state-of-the-art TinySleepNet architecture to train the classifier and modified the ConfidNet architecture to train an auxiliary confidence model. For the confidence model, we developed a novel method, Dropout Correct Rate (DCR), and the performance of it was compared with other existing methods. RESULTS: Confidence estimates (0.754) reflected accuracy (0.758) well in general. The best performance for differentiating correct and wrong predictions was shown when using the DCR method (AUROC: 0.812) compared to the existing approaches which largely failed to detect wrong predictions. By reviewing only 20% of epochs that received the lowest confidence values, the overall accuracy of sleep stage scoring was improved from 76% to 87%. For patients with reduced accuracy (ie, individuals with obesity or severe sleep apnea), the possible improvement range after applying confidence estimation was even greater. CONCLUSION: To the best of our knowledge, this is the first study applying confidence estimation on automated sleep stage scoring. Reliable confidence estimates by the DCR method help screen out most of the wrong predictions, which would increase the reliability and interpretability of automated sleep stage scoring. Dove 2021-12-24 /pmc/articles/PMC8721741/ /pubmed/35002345 http://dx.doi.org/10.2147/NSS.S333566 Text en © 2021 Hong et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php). |
spellingShingle | Original Research Hong, Jung Kyung Lee, Taeyoung Delos Reyes, Roben Deocampo Hong, Joonki Tran, Hai Hong Lee, Dongheon Jung, Jinhwan Yoon, In-Young Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring |
title | Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring |
title_full | Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring |
title_fullStr | Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring |
title_full_unstemmed | Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring |
title_short | Confidence-Based Framework Using Deep Learning for Automated Sleep Stage Scoring |
title_sort | confidence-based framework using deep learning for automated sleep stage scoring |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8721741/ https://www.ncbi.nlm.nih.gov/pubmed/35002345 http://dx.doi.org/10.2147/NSS.S333566 |
work_keys_str_mv | AT hongjungkyung confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring AT leetaeyoung confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring AT delosreyesrobendeocampo confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring AT hongjoonki confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring AT tranhaihong confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring AT leedongheon confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring AT jungjinhwan confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring AT yooninyoung confidencebasedframeworkusingdeeplearningforautomatedsleepstagescoring |