Cargando…

Deep Learning for Infant Cry Recognition

Recognizing why an infant cries is challenging as babies cannot communicate verbally with others to express their wishes or needs. This leads to difficulties for parents in identifying the needs and the health of their infants. This study used deep learning (DL) algorithms such as the convolutional...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Yun-Chia, Wijaya, Iven, Yang, Ming-Tao, Cuevas Juarez, Josue Rodolfo, Chang, Hou-Tai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141218/
https://www.ncbi.nlm.nih.gov/pubmed/35627847
http://dx.doi.org/10.3390/ijerph19106311
_version_ 1784715291022852096
author Liang, Yun-Chia
Wijaya, Iven
Yang, Ming-Tao
Cuevas Juarez, Josue Rodolfo
Chang, Hou-Tai
author_facet Liang, Yun-Chia
Wijaya, Iven
Yang, Ming-Tao
Cuevas Juarez, Josue Rodolfo
Chang, Hou-Tai
author_sort Liang, Yun-Chia
collection PubMed
description Recognizing why an infant cries is challenging as babies cannot communicate verbally with others to express their wishes or needs. This leads to difficulties for parents in identifying the needs and the health of their infants. This study used deep learning (DL) algorithms such as the convolutional neural network (CNN) and long short-term memory (LSTM) to recognize infants’ necessities such as hunger/thirst, need for a diaper change, emotional needs (e.g., need for touch/holding), and pain caused by medical treatment (e.g., injection). The classical artificial neural network (ANN) was also used for comparison. The inputs of ANN, CNN, and LSTM were the features extracted from 1607 10 s audio recordings of infants using mel-frequency cepstral coefficients (MFCC). Results showed that CNN and LSTM both provided decent performance, around 95% in accuracy, precision, and recall, in differentiating healthy and sick infants. For recognizing infants’ specific needs, CNN reached up to 60% accuracy, outperforming LSTM and ANN in almost all measures. These results could be applied as indicators for future applications to help parents understand their infant’s condition and needs.
format Online
Article
Text
id pubmed-9141218
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91412182022-05-28 Deep Learning for Infant Cry Recognition Liang, Yun-Chia Wijaya, Iven Yang, Ming-Tao Cuevas Juarez, Josue Rodolfo Chang, Hou-Tai Int J Environ Res Public Health Article Recognizing why an infant cries is challenging as babies cannot communicate verbally with others to express their wishes or needs. This leads to difficulties for parents in identifying the needs and the health of their infants. This study used deep learning (DL) algorithms such as the convolutional neural network (CNN) and long short-term memory (LSTM) to recognize infants’ necessities such as hunger/thirst, need for a diaper change, emotional needs (e.g., need for touch/holding), and pain caused by medical treatment (e.g., injection). The classical artificial neural network (ANN) was also used for comparison. The inputs of ANN, CNN, and LSTM were the features extracted from 1607 10 s audio recordings of infants using mel-frequency cepstral coefficients (MFCC). Results showed that CNN and LSTM both provided decent performance, around 95% in accuracy, precision, and recall, in differentiating healthy and sick infants. For recognizing infants’ specific needs, CNN reached up to 60% accuracy, outperforming LSTM and ANN in almost all measures. These results could be applied as indicators for future applications to help parents understand their infant’s condition and needs. MDPI 2022-05-23 /pmc/articles/PMC9141218/ /pubmed/35627847 http://dx.doi.org/10.3390/ijerph19106311 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liang, Yun-Chia
Wijaya, Iven
Yang, Ming-Tao
Cuevas Juarez, Josue Rodolfo
Chang, Hou-Tai
Deep Learning for Infant Cry Recognition
title Deep Learning for Infant Cry Recognition
title_full Deep Learning for Infant Cry Recognition
title_fullStr Deep Learning for Infant Cry Recognition
title_full_unstemmed Deep Learning for Infant Cry Recognition
title_short Deep Learning for Infant Cry Recognition
title_sort deep learning for infant cry recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141218/
https://www.ncbi.nlm.nih.gov/pubmed/35627847
http://dx.doi.org/10.3390/ijerph19106311
work_keys_str_mv AT liangyunchia deeplearningforinfantcryrecognition
AT wijayaiven deeplearningforinfantcryrecognition
AT yangmingtao deeplearningforinfantcryrecognition
AT cuevasjuarezjosuerodolfo deeplearningforinfantcryrecognition
AT changhoutai deeplearningforinfantcryrecognition