Cargando…
Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation
Tri-training expands the training set by adding pseudo-labels to unlabeled data, which effectively improves the generalization ability of the classifier, but it is easy to mislabel unlabeled data into training noise, which damages the learning efficiency of the classifier, and the explicit decision...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10047771/ https://www.ncbi.nlm.nih.gov/pubmed/36981368 http://dx.doi.org/10.3390/e25030480 |
_version_ | 1785014010233487360 |
---|---|
author | Zhao, Jia Luo, Yuhang Xiao, Renbin Wu, Runxiu Fan, Tanghuai |
author_facet | Zhao, Jia Luo, Yuhang Xiao, Renbin Wu, Runxiu Fan, Tanghuai |
author_sort | Zhao, Jia |
collection | PubMed |
description | Tri-training expands the training set by adding pseudo-labels to unlabeled data, which effectively improves the generalization ability of the classifier, but it is easy to mislabel unlabeled data into training noise, which damages the learning efficiency of the classifier, and the explicit decision mechanism tends to make the training noise degrade the accuracy of the classification model in the prediction stage. This study proposes the Tri-training algorithm for adaptive nearest neighbor density editing and cross-entropy evaluation (TTADEC), which is used to reduce the training noise formed during the classifier iteration and to solve the problem of inaccurate prediction by explicit decision mechanism. First, the TTADEC algorithm uses the nearest neighbor editing to label high-confidence samples. Then, combined with the relative nearest neighbor to define the local density of samples to screen the pre-training samples, and then dynamically expand the training set by adaptive technique. Finally, the decision process uses cross-entropy to evaluate the completed base classifier of training and assign appropriate weights to it to construct a decision function. The effectiveness of the TTADEC algorithm is verified on the UCI dataset, and the experimental results show that compared with the standard Tri-training algorithm and its improvement algorithm, the TTADEC algorithm has better classification performance and can effectively deal with the semi-supervised classification problem where the training set is insufficient. |
format | Online Article Text |
id | pubmed-10047771 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100477712023-03-29 Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation Zhao, Jia Luo, Yuhang Xiao, Renbin Wu, Runxiu Fan, Tanghuai Entropy (Basel) Article Tri-training expands the training set by adding pseudo-labels to unlabeled data, which effectively improves the generalization ability of the classifier, but it is easy to mislabel unlabeled data into training noise, which damages the learning efficiency of the classifier, and the explicit decision mechanism tends to make the training noise degrade the accuracy of the classification model in the prediction stage. This study proposes the Tri-training algorithm for adaptive nearest neighbor density editing and cross-entropy evaluation (TTADEC), which is used to reduce the training noise formed during the classifier iteration and to solve the problem of inaccurate prediction by explicit decision mechanism. First, the TTADEC algorithm uses the nearest neighbor editing to label high-confidence samples. Then, combined with the relative nearest neighbor to define the local density of samples to screen the pre-training samples, and then dynamically expand the training set by adaptive technique. Finally, the decision process uses cross-entropy to evaluate the completed base classifier of training and assign appropriate weights to it to construct a decision function. The effectiveness of the TTADEC algorithm is verified on the UCI dataset, and the experimental results show that compared with the standard Tri-training algorithm and its improvement algorithm, the TTADEC algorithm has better classification performance and can effectively deal with the semi-supervised classification problem where the training set is insufficient. MDPI 2023-03-09 /pmc/articles/PMC10047771/ /pubmed/36981368 http://dx.doi.org/10.3390/e25030480 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhao, Jia Luo, Yuhang Xiao, Renbin Wu, Runxiu Fan, Tanghuai Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation |
title | Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation |
title_full | Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation |
title_fullStr | Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation |
title_full_unstemmed | Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation |
title_short | Tri-Training Algorithm for Adaptive Nearest Neighbor Density Editing and Cross Entropy Evaluation |
title_sort | tri-training algorithm for adaptive nearest neighbor density editing and cross entropy evaluation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10047771/ https://www.ncbi.nlm.nih.gov/pubmed/36981368 http://dx.doi.org/10.3390/e25030480 |
work_keys_str_mv | AT zhaojia tritrainingalgorithmforadaptivenearestneighbordensityeditingandcrossentropyevaluation AT luoyuhang tritrainingalgorithmforadaptivenearestneighbordensityeditingandcrossentropyevaluation AT xiaorenbin tritrainingalgorithmforadaptivenearestneighbordensityeditingandcrossentropyevaluation AT wurunxiu tritrainingalgorithmforadaptivenearestneighbordensityeditingandcrossentropyevaluation AT fantanghuai tritrainingalgorithmforadaptivenearestneighbordensityeditingandcrossentropyevaluation |