Cargando…

Voice disorder classification using convolutional neural network based on deep transfer learning

Voice disorders are very common in the global population. Many researchers have conducted research on the identification and classification of voice disorders based on machine learning. As a data-driven algorithm, machine learning requires a large number of samples for training. However, due to the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Peng, Xiangyu, Xu, Huoyao, Liu, Jie, Wang, Junlang, He, Chaoming
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10160111/ https://www.ncbi.nlm.nih.gov/pubmed/37142759 http://dx.doi.org/10.1038/s41598-023-34461-9

_version_	1785037217013432320
author	Peng, Xiangyu Xu, Huoyao Liu, Jie Wang, Junlang He, Chaoming
author_facet	Peng, Xiangyu Xu, Huoyao Liu, Jie Wang, Junlang He, Chaoming
author_sort	Peng, Xiangyu
collection	PubMed
description	Voice disorders are very common in the global population. Many researchers have conducted research on the identification and classification of voice disorders based on machine learning. As a data-driven algorithm, machine learning requires a large number of samples for training. However, due to the sensitivity and particularity of medical data, it is difficult to obtain sufficient samples for model learning. To address this challenge, this paper proposes a pretrained OpenL3-SVM transfer learning framework for the automatic recognition of multi-class voice disorders. The framework combines a pre-trained convolutional neural network, OpenL3, and a support vector machine (SVM) classifier. The Mel spectrum of the given voice signal is first extracted and then input into the OpenL3 network to obtain high-level feature embedding. Considering the effects of redundant and negative high-dimensional features, model overfitting easily occurs. Therefore, linear local tangent space alignment (LLTSA) is used for feature dimension reduction. Finally, the obtained dimensionality reduction features are used to train the SVM for voice disorder classification. Fivefold cross-validation is used to verify the classification performance of the OpenL3-SVM. The experimental results show that OpenL3-SVM can effectively classify voice disorders automatically, and its performance exceeds that of the existing methods. With continuous improvements in research, it is expected to be considered as auxiliary diagnostic tool for physicians in the future.
format	Online Article Text
id	pubmed-10160111
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-101601112023-05-06 Voice disorder classification using convolutional neural network based on deep transfer learning Peng, Xiangyu Xu, Huoyao Liu, Jie Wang, Junlang He, Chaoming Sci Rep Article Voice disorders are very common in the global population. Many researchers have conducted research on the identification and classification of voice disorders based on machine learning. As a data-driven algorithm, machine learning requires a large number of samples for training. However, due to the sensitivity and particularity of medical data, it is difficult to obtain sufficient samples for model learning. To address this challenge, this paper proposes a pretrained OpenL3-SVM transfer learning framework for the automatic recognition of multi-class voice disorders. The framework combines a pre-trained convolutional neural network, OpenL3, and a support vector machine (SVM) classifier. The Mel spectrum of the given voice signal is first extracted and then input into the OpenL3 network to obtain high-level feature embedding. Considering the effects of redundant and negative high-dimensional features, model overfitting easily occurs. Therefore, linear local tangent space alignment (LLTSA) is used for feature dimension reduction. Finally, the obtained dimensionality reduction features are used to train the SVM for voice disorder classification. Fivefold cross-validation is used to verify the classification performance of the OpenL3-SVM. The experimental results show that OpenL3-SVM can effectively classify voice disorders automatically, and its performance exceeds that of the existing methods. With continuous improvements in research, it is expected to be considered as auxiliary diagnostic tool for physicians in the future. Nature Publishing Group UK 2023-05-04 /pmc/articles/PMC10160111/ /pubmed/37142759 http://dx.doi.org/10.1038/s41598-023-34461-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Peng, Xiangyu Xu, Huoyao Liu, Jie Wang, Junlang He, Chaoming Voice disorder classification using convolutional neural network based on deep transfer learning
title	Voice disorder classification using convolutional neural network based on deep transfer learning
title_full	Voice disorder classification using convolutional neural network based on deep transfer learning
title_fullStr	Voice disorder classification using convolutional neural network based on deep transfer learning
title_full_unstemmed	Voice disorder classification using convolutional neural network based on deep transfer learning
title_short	Voice disorder classification using convolutional neural network based on deep transfer learning
title_sort	voice disorder classification using convolutional neural network based on deep transfer learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10160111/ https://www.ncbi.nlm.nih.gov/pubmed/37142759 http://dx.doi.org/10.1038/s41598-023-34461-9
work_keys_str_mv	AT pengxiangyu voicedisorderclassificationusingconvolutionalneuralnetworkbasedondeeptransferlearning AT xuhuoyao voicedisorderclassificationusingconvolutionalneuralnetworkbasedondeeptransferlearning AT liujie voicedisorderclassificationusingconvolutionalneuralnetworkbasedondeeptransferlearning AT wangjunlang voicedisorderclassificationusingconvolutionalneuralnetworkbasedondeeptransferlearning AT hechaoming voicedisorderclassificationusingconvolutionalneuralnetworkbasedondeeptransferlearning

Voice disorder classification using convolutional neural network based on deep transfer learning

Ejemplares similares