Cargando…

A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification

Understanding the reason for an infant's cry is the most difficult thing for parents. There might be various reasons behind the baby's cry. It may be due to hunger, pain, sleep, or diaper-related problems. The key concept behind identifying the reason behind the infant's cry is mainly...

Descripción completa

Detalles Bibliográficos
Autores principales: Joshi, Vinayak Ravi, Srinivasan, Kathiravan, Vincent, P. M. Durai Raj, Rajinikanth, Venkatesan, Chang, Chuan-Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8987163/
https://www.ncbi.nlm.nih.gov/pubmed/35400062
http://dx.doi.org/10.3389/fpubh.2022.819865
_version_ 1784682679076126720
author Joshi, Vinayak Ravi
Srinivasan, Kathiravan
Vincent, P. M. Durai Raj
Rajinikanth, Venkatesan
Chang, Chuan-Yu
author_facet Joshi, Vinayak Ravi
Srinivasan, Kathiravan
Vincent, P. M. Durai Raj
Rajinikanth, Venkatesan
Chang, Chuan-Yu
author_sort Joshi, Vinayak Ravi
collection PubMed
description Understanding the reason for an infant's cry is the most difficult thing for parents. There might be various reasons behind the baby's cry. It may be due to hunger, pain, sleep, or diaper-related problems. The key concept behind identifying the reason behind the infant's cry is mainly based on the varying patterns of the crying audio. The audio file comprises many features, which are highly important in classifying the results. It is important to convert the audio signals into the required spectrograms. In this article, we are trying to find efficient solutions to the problem of predicting the reason behind an infant's cry. In this article, we have used the Mel-frequency cepstral coefficients algorithm to generate the spectrograms and analyzed the varying feature vectors. We then came up with two approaches to obtain the experimental results. In the first approach, we used the Convolution Neural network (CNN) variants like VGG16 and YOLOv4 to classify the infant cry signals. In the second approach, a multistage heterogeneous stacking ensemble model was used for infant cry classification. Its major advantage was the inclusion of various advanced boosting algorithms at various levels. The proposed multistage heterogeneous stacking ensemble model had the edge over the other neural network models, especially in terms of overall performance and computing power. Finally, after many comparisons, the proposed model revealed the virtuoso performance and a mean classification accuracy of up to 93.7%.
format Online
Article
Text
id pubmed-8987163
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89871632022-04-08 A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification Joshi, Vinayak Ravi Srinivasan, Kathiravan Vincent, P. M. Durai Raj Rajinikanth, Venkatesan Chang, Chuan-Yu Front Public Health Public Health Understanding the reason for an infant's cry is the most difficult thing for parents. There might be various reasons behind the baby's cry. It may be due to hunger, pain, sleep, or diaper-related problems. The key concept behind identifying the reason behind the infant's cry is mainly based on the varying patterns of the crying audio. The audio file comprises many features, which are highly important in classifying the results. It is important to convert the audio signals into the required spectrograms. In this article, we are trying to find efficient solutions to the problem of predicting the reason behind an infant's cry. In this article, we have used the Mel-frequency cepstral coefficients algorithm to generate the spectrograms and analyzed the varying feature vectors. We then came up with two approaches to obtain the experimental results. In the first approach, we used the Convolution Neural network (CNN) variants like VGG16 and YOLOv4 to classify the infant cry signals. In the second approach, a multistage heterogeneous stacking ensemble model was used for infant cry classification. Its major advantage was the inclusion of various advanced boosting algorithms at various levels. The proposed multistage heterogeneous stacking ensemble model had the edge over the other neural network models, especially in terms of overall performance and computing power. Finally, after many comparisons, the proposed model revealed the virtuoso performance and a mean classification accuracy of up to 93.7%. Frontiers Media S.A. 2022-03-24 /pmc/articles/PMC8987163/ /pubmed/35400062 http://dx.doi.org/10.3389/fpubh.2022.819865 Text en Copyright © 2022 Joshi, Srinivasan, Vincent, Rajinikanth and Chang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Joshi, Vinayak Ravi
Srinivasan, Kathiravan
Vincent, P. M. Durai Raj
Rajinikanth, Venkatesan
Chang, Chuan-Yu
A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification
title A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification
title_full A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification
title_fullStr A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification
title_full_unstemmed A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification
title_short A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification
title_sort multistage heterogeneous stacking ensemble model for augmented infant cry classification
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8987163/
https://www.ncbi.nlm.nih.gov/pubmed/35400062
http://dx.doi.org/10.3389/fpubh.2022.819865
work_keys_str_mv AT joshivinayakravi amultistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification
AT srinivasankathiravan amultistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification
AT vincentpmdurairaj amultistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification
AT rajinikanthvenkatesan amultistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification
AT changchuanyu amultistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification
AT joshivinayakravi multistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification
AT srinivasankathiravan multistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification
AT vincentpmdurairaj multistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification
AT rajinikanthvenkatesan multistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification
AT changchuanyu multistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification