Cargando…

Learning to Classify DWDM Optical Channels from Tiny and Imbalanced Data

Applying machine learning algorithms for assessing the transmission quality in optical networks is associated with substantial challenges. Datasets that could provide training instances tend to be small and heavily imbalanced. This requires applying imbalanced compensation techniques when using bina...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cichosz, Paweł, Kozdrowski, Stanisław, Sujecki, Sławomir
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8623617/ https://www.ncbi.nlm.nih.gov/pubmed/34828202 http://dx.doi.org/10.3390/e23111504

_version_	1784605975424008192
author	Cichosz, Paweł Kozdrowski, Stanisław Sujecki, Sławomir
author_facet	Cichosz, Paweł Kozdrowski, Stanisław Sujecki, Sławomir
author_sort	Cichosz, Paweł
collection	PubMed
description	Applying machine learning algorithms for assessing the transmission quality in optical networks is associated with substantial challenges. Datasets that could provide training instances tend to be small and heavily imbalanced. This requires applying imbalanced compensation techniques when using binary classification algorithms, but it also makes one-class classification, learning only from instances of the majority class, a noteworthy alternative. This work examines the utility of both these approaches using a real dataset from a Dense Wavelength Division Multiplexing network operator, gathered through the network control plane. The dataset is indeed of a very small size and contains very few examples of “bad” paths that do not deliver the required level of transmission quality. Two binary classification algorithms, random forest and extreme gradient boosting, are used in combination with two imbalance handling methods, instance weighting and synthetic minority class instance generation. Their predictive performance is compared with that of four one-class classification algorithms: One-class SVM, one-class naive Bayes classifier, isolation forest, and maximum entropy modeling. The one-class approach turns out to be clearly superior, particularly with respect to the level of classification precision, making it possible to obtain more practically useful models.
format	Online Article Text
id	pubmed-8623617
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-86236172021-11-27 Learning to Classify DWDM Optical Channels from Tiny and Imbalanced Data Cichosz, Paweł Kozdrowski, Stanisław Sujecki, Sławomir Entropy (Basel) Article Applying machine learning algorithms for assessing the transmission quality in optical networks is associated with substantial challenges. Datasets that could provide training instances tend to be small and heavily imbalanced. This requires applying imbalanced compensation techniques when using binary classification algorithms, but it also makes one-class classification, learning only from instances of the majority class, a noteworthy alternative. This work examines the utility of both these approaches using a real dataset from a Dense Wavelength Division Multiplexing network operator, gathered through the network control plane. The dataset is indeed of a very small size and contains very few examples of “bad” paths that do not deliver the required level of transmission quality. Two binary classification algorithms, random forest and extreme gradient boosting, are used in combination with two imbalance handling methods, instance weighting and synthetic minority class instance generation. Their predictive performance is compared with that of four one-class classification algorithms: One-class SVM, one-class naive Bayes classifier, isolation forest, and maximum entropy modeling. The one-class approach turns out to be clearly superior, particularly with respect to the level of classification precision, making it possible to obtain more practically useful models. MDPI 2021-11-13 /pmc/articles/PMC8623617/ /pubmed/34828202 http://dx.doi.org/10.3390/e23111504 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Cichosz, Paweł Kozdrowski, Stanisław Sujecki, Sławomir Learning to Classify DWDM Optical Channels from Tiny and Imbalanced Data
title	Learning to Classify DWDM Optical Channels from Tiny and Imbalanced Data
title_full	Learning to Classify DWDM Optical Channels from Tiny and Imbalanced Data
title_fullStr	Learning to Classify DWDM Optical Channels from Tiny and Imbalanced Data
title_full_unstemmed	Learning to Classify DWDM Optical Channels from Tiny and Imbalanced Data
title_short	Learning to Classify DWDM Optical Channels from Tiny and Imbalanced Data
title_sort	learning to classify dwdm optical channels from tiny and imbalanced data
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8623617/ https://www.ncbi.nlm.nih.gov/pubmed/34828202 http://dx.doi.org/10.3390/e23111504
work_keys_str_mv	AT cichoszpaweł learningtoclassifydwdmopticalchannelsfromtinyandimbalanceddata AT kozdrowskistanisław learningtoclassifydwdmopticalchannelsfromtinyandimbalanceddata AT sujeckisławomir learningtoclassifydwdmopticalchannelsfromtinyandimbalanceddata

Learning to Classify DWDM Optical Channels from Tiny and Imbalanced Data

Ejemplares similares