Cargando…
Impact of Label Noise on the Learning Based Models for a Binary Classification of Physiological Signal
Label noise is omnipresent in the annotations process and has an impact on supervised learning algorithms. This work focuses on the impact of label noise on the performance of learning models by examining the effect of random and class-dependent label noise on a binary classification task: quality a...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572105/ https://www.ncbi.nlm.nih.gov/pubmed/36236265 http://dx.doi.org/10.3390/s22197166 |
_version_ | 1784810530540617728 |
---|---|
author | Ding, Cheng Pereira, Tania Xiao, Ran Lee, Randall J. Hu, Xiao |
author_facet | Ding, Cheng Pereira, Tania Xiao, Ran Lee, Randall J. Hu, Xiao |
author_sort | Ding, Cheng |
collection | PubMed |
description | Label noise is omnipresent in the annotations process and has an impact on supervised learning algorithms. This work focuses on the impact of label noise on the performance of learning models by examining the effect of random and class-dependent label noise on a binary classification task: quality assessment for photoplethysmography (PPG). PPG signal is used to detect physiological changes and its quality can have a significant impact on the subsequent tasks, which makes PPG quality assessment a particularly good target for examining the impact of label noise in the field of biomedicine. Random and class-dependent label noise was introduced separately into the training set to emulate the errors associated with fatigue and bias in labeling data samples. We also tested different representations of the PPG, including features defined by domain experts, 1D raw signal and 2D image. Three different classifiers are tested on the noisy training data, including support vector machine (SVM), XGBoost, 1D Resnet and 2D Resnet, which handle three representations, respectively. The results showed that the two deep learning models were more robust than the two traditional machine learning models for both the random and class-dependent label noise. From the representation perspective, the 2D image shows better robustness compared to the 1D raw signal. The logits from three classifiers are also analyzed, the predicted probabilities intend to be more dispersed when more label noise is introduced. From this work, we investigated various factors related to label noise, including representations, label noise type, and data imbalance, which can be a good guidebook for designing more robust methods for label noise in future work. |
format | Online Article Text |
id | pubmed-9572105 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95721052022-10-17 Impact of Label Noise on the Learning Based Models for a Binary Classification of Physiological Signal Ding, Cheng Pereira, Tania Xiao, Ran Lee, Randall J. Hu, Xiao Sensors (Basel) Article Label noise is omnipresent in the annotations process and has an impact on supervised learning algorithms. This work focuses on the impact of label noise on the performance of learning models by examining the effect of random and class-dependent label noise on a binary classification task: quality assessment for photoplethysmography (PPG). PPG signal is used to detect physiological changes and its quality can have a significant impact on the subsequent tasks, which makes PPG quality assessment a particularly good target for examining the impact of label noise in the field of biomedicine. Random and class-dependent label noise was introduced separately into the training set to emulate the errors associated with fatigue and bias in labeling data samples. We also tested different representations of the PPG, including features defined by domain experts, 1D raw signal and 2D image. Three different classifiers are tested on the noisy training data, including support vector machine (SVM), XGBoost, 1D Resnet and 2D Resnet, which handle three representations, respectively. The results showed that the two deep learning models were more robust than the two traditional machine learning models for both the random and class-dependent label noise. From the representation perspective, the 2D image shows better robustness compared to the 1D raw signal. The logits from three classifiers are also analyzed, the predicted probabilities intend to be more dispersed when more label noise is introduced. From this work, we investigated various factors related to label noise, including representations, label noise type, and data imbalance, which can be a good guidebook for designing more robust methods for label noise in future work. MDPI 2022-09-21 /pmc/articles/PMC9572105/ /pubmed/36236265 http://dx.doi.org/10.3390/s22197166 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ding, Cheng Pereira, Tania Xiao, Ran Lee, Randall J. Hu, Xiao Impact of Label Noise on the Learning Based Models for a Binary Classification of Physiological Signal |
title | Impact of Label Noise on the Learning Based Models for a Binary Classification of Physiological Signal |
title_full | Impact of Label Noise on the Learning Based Models for a Binary Classification of Physiological Signal |
title_fullStr | Impact of Label Noise on the Learning Based Models for a Binary Classification of Physiological Signal |
title_full_unstemmed | Impact of Label Noise on the Learning Based Models for a Binary Classification of Physiological Signal |
title_short | Impact of Label Noise on the Learning Based Models for a Binary Classification of Physiological Signal |
title_sort | impact of label noise on the learning based models for a binary classification of physiological signal |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9572105/ https://www.ncbi.nlm.nih.gov/pubmed/36236265 http://dx.doi.org/10.3390/s22197166 |
work_keys_str_mv | AT dingcheng impactoflabelnoiseonthelearningbasedmodelsforabinaryclassificationofphysiologicalsignal AT pereiratania impactoflabelnoiseonthelearningbasedmodelsforabinaryclassificationofphysiologicalsignal AT xiaoran impactoflabelnoiseonthelearningbasedmodelsforabinaryclassificationofphysiologicalsignal AT leerandallj impactoflabelnoiseonthelearningbasedmodelsforabinaryclassificationofphysiologicalsignal AT huxiao impactoflabelnoiseonthelearningbasedmodelsforabinaryclassificationofphysiologicalsignal |