Cargando…

Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation

Tri-training expands the training set by adding pseudo-labels to unlabeled data, which effectively improves the generalization ability of the classifier, but it is easy to mislabel unlabeled data into training noise, which damages the learning efficiency of the classifier, and the explicit decision...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Jia, Luo, Yuhang, Xiao, Renbin, Wu, Runxiu, Fan, Tanghuai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10047771/
https://www.ncbi.nlm.nih.gov/pubmed/36981368
http://dx.doi.org/10.3390/e25030480
_version_ 1785014010233487360
author Zhao, Jia
Luo, Yuhang
Xiao, Renbin
Wu, Runxiu
Fan, Tanghuai
author_facet Zhao, Jia
Luo, Yuhang
Xiao, Renbin
Wu, Runxiu
Fan, Tanghuai
author_sort Zhao, Jia
collection PubMed
description Tri-training expands the training set by adding pseudo-labels to unlabeled data, which effectively improves the generalization ability of the classifier, but it is easy to mislabel unlabeled data into training noise, which damages the learning efficiency of the classifier, and the explicit decision mechanism tends to make the training noise degrade the accuracy of the classification model in the prediction stage. This study proposes the Tri-training algorithm for adaptive nearest neighbor density editing and cross-entropy evaluation (TTADEC), which is used to reduce the training noise formed during the classifier iteration and to solve the problem of inaccurate prediction by explicit decision mechanism. First, the TTADEC algorithm uses the nearest neighbor editing to label high-confidence samples. Then, combined with the relative nearest neighbor to define the local density of samples to screen the pre-training samples, and then dynamically expand the training set by adaptive technique. Finally, the decision process uses cross-entropy to evaluate the completed base classifier of training and assign appropriate weights to it to construct a decision function. The effectiveness of the TTADEC algorithm is verified on the UCI dataset, and the experimental results show that compared with the standard Tri-training algorithm and its improvement algorithm, the TTADEC algorithm has better classification performance and can effectively deal with the semi-supervised classification problem where the training set is insufficient.
format Online
Article
Text
id pubmed-10047771
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100477712023-03-29 Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation Zhao, Jia Luo, Yuhang Xiao, Renbin Wu, Runxiu Fan, Tanghuai Entropy (Basel) Article Tri-training expands the training set by adding pseudo-labels to unlabeled data, which effectively improves the generalization ability of the classifier, but it is easy to mislabel unlabeled data into training noise, which damages the learning efficiency of the classifier, and the explicit decision mechanism tends to make the training noise degrade the accuracy of the classification model in the prediction stage. This study proposes the Tri-training algorithm for adaptive nearest neighbor density editing and cross-entropy evaluation (TTADEC), which is used to reduce the training noise formed during the classifier iteration and to solve the problem of inaccurate prediction by explicit decision mechanism. First, the TTADEC algorithm uses the nearest neighbor editing to label high-confidence samples. Then, combined with the relative nearest neighbor to define the local density of samples to screen the pre-training samples, and then dynamically expand the training set by adaptive technique. Finally, the decision process uses cross-entropy to evaluate the completed base classifier of training and assign appropriate weights to it to construct a decision function. The effectiveness of the TTADEC algorithm is verified on the UCI dataset, and the experimental results show that compared with the standard Tri-training algorithm and its improvement algorithm, the TTADEC algorithm has better classification performance and can effectively deal with the semi-supervised classification problem where the training set is insufficient. MDPI 2023-03-09 /pmc/articles/PMC10047771/ /pubmed/36981368 http://dx.doi.org/10.3390/e25030480 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhao, Jia
Luo, Yuhang
Xiao, Renbin
Wu, Runxiu
Fan, Tanghuai
Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation
title Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation
title_full Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation
title_fullStr Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation
title_full_unstemmed Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation
title_short Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation
title_sort tri-training algorithm for adaptive nearest neighbor density editing and cross entropy evaluation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10047771/
https://www.ncbi.nlm.nih.gov/pubmed/36981368
http://dx.doi.org/10.3390/e25030480
work_keys_str_mv AT zhaojia tritrainingalgorithmforadaptivenearestneighbordensityeditingandcrossentropyevaluation
AT luoyuhang tritrainingalgorithmforadaptivenearestneighbordensityeditingandcrossentropyevaluation
AT xiaorenbin tritrainingalgorithmforadaptivenearestneighbordensityeditingandcrossentropyevaluation
AT wurunxiu tritrainingalgorithmforadaptivenearestneighbordensityeditingandcrossentropyevaluation
AT fantanghuai tritrainingalgorithmforadaptivenearestneighbordensityeditingandcrossentropyevaluation