Cargando…
A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification
Understanding the reason for an infant's cry is the most difficult thing for parents. There might be various reasons behind the baby's cry. It may be due to hunger, pain, sleep, or diaper-related problems. The key concept behind identifying the reason behind the infant's cry is mainly...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8987163/ https://www.ncbi.nlm.nih.gov/pubmed/35400062 http://dx.doi.org/10.3389/fpubh.2022.819865 |
_version_ | 1784682679076126720 |
---|---|
author | Joshi, Vinayak Ravi Srinivasan, Kathiravan Vincent, P. M. Durai Raj Rajinikanth, Venkatesan Chang, Chuan-Yu |
author_facet | Joshi, Vinayak Ravi Srinivasan, Kathiravan Vincent, P. M. Durai Raj Rajinikanth, Venkatesan Chang, Chuan-Yu |
author_sort | Joshi, Vinayak Ravi |
collection | PubMed |
description | Understanding the reason for an infant's cry is the most difficult thing for parents. There might be various reasons behind the baby's cry. It may be due to hunger, pain, sleep, or diaper-related problems. The key concept behind identifying the reason behind the infant's cry is mainly based on the varying patterns of the crying audio. The audio file comprises many features, which are highly important in classifying the results. It is important to convert the audio signals into the required spectrograms. In this article, we are trying to find efficient solutions to the problem of predicting the reason behind an infant's cry. In this article, we have used the Mel-frequency cepstral coefficients algorithm to generate the spectrograms and analyzed the varying feature vectors. We then came up with two approaches to obtain the experimental results. In the first approach, we used the Convolution Neural network (CNN) variants like VGG16 and YOLOv4 to classify the infant cry signals. In the second approach, a multistage heterogeneous stacking ensemble model was used for infant cry classification. Its major advantage was the inclusion of various advanced boosting algorithms at various levels. The proposed multistage heterogeneous stacking ensemble model had the edge over the other neural network models, especially in terms of overall performance and computing power. Finally, after many comparisons, the proposed model revealed the virtuoso performance and a mean classification accuracy of up to 93.7%. |
format | Online Article Text |
id | pubmed-8987163 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-89871632022-04-08 A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification Joshi, Vinayak Ravi Srinivasan, Kathiravan Vincent, P. M. Durai Raj Rajinikanth, Venkatesan Chang, Chuan-Yu Front Public Health Public Health Understanding the reason for an infant's cry is the most difficult thing for parents. There might be various reasons behind the baby's cry. It may be due to hunger, pain, sleep, or diaper-related problems. The key concept behind identifying the reason behind the infant's cry is mainly based on the varying patterns of the crying audio. The audio file comprises many features, which are highly important in classifying the results. It is important to convert the audio signals into the required spectrograms. In this article, we are trying to find efficient solutions to the problem of predicting the reason behind an infant's cry. In this article, we have used the Mel-frequency cepstral coefficients algorithm to generate the spectrograms and analyzed the varying feature vectors. We then came up with two approaches to obtain the experimental results. In the first approach, we used the Convolution Neural network (CNN) variants like VGG16 and YOLOv4 to classify the infant cry signals. In the second approach, a multistage heterogeneous stacking ensemble model was used for infant cry classification. Its major advantage was the inclusion of various advanced boosting algorithms at various levels. The proposed multistage heterogeneous stacking ensemble model had the edge over the other neural network models, especially in terms of overall performance and computing power. Finally, after many comparisons, the proposed model revealed the virtuoso performance and a mean classification accuracy of up to 93.7%. Frontiers Media S.A. 2022-03-24 /pmc/articles/PMC8987163/ /pubmed/35400062 http://dx.doi.org/10.3389/fpubh.2022.819865 Text en Copyright © 2022 Joshi, Srinivasan, Vincent, Rajinikanth and Chang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Public Health Joshi, Vinayak Ravi Srinivasan, Kathiravan Vincent, P. M. Durai Raj Rajinikanth, Venkatesan Chang, Chuan-Yu A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification |
title | A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification |
title_full | A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification |
title_fullStr | A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification |
title_full_unstemmed | A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification |
title_short | A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification |
title_sort | multistage heterogeneous stacking ensemble model for augmented infant cry classification |
topic | Public Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8987163/ https://www.ncbi.nlm.nih.gov/pubmed/35400062 http://dx.doi.org/10.3389/fpubh.2022.819865 |
work_keys_str_mv | AT joshivinayakravi amultistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification AT srinivasankathiravan amultistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification AT vincentpmdurairaj amultistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification AT rajinikanthvenkatesan amultistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification AT changchuanyu amultistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification AT joshivinayakravi multistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification AT srinivasankathiravan multistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification AT vincentpmdurairaj multistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification AT rajinikanthvenkatesan multistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification AT changchuanyu multistageheterogeneousstackingensemblemodelforaugmentedinfantcryclassification |