Cargando…
End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model
In this paper, we propose an end-to-end (E2E) neural network model to detect autism spectrum disorder (ASD) from children’s voices without explicitly extracting the deterministic features. In order to obtain the decisions for discriminating between the voices of children with ASD and those with typi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9823402/ https://www.ncbi.nlm.nih.gov/pubmed/36616801 http://dx.doi.org/10.3390/s23010202 |
_version_ | 1784866150757171200 |
---|---|
author | Lee, Jung Hyuk Lee, Geon Woo Bong, Guiyoung Yoo, Hee Jeong Kim, Hong Kook |
author_facet | Lee, Jung Hyuk Lee, Geon Woo Bong, Guiyoung Yoo, Hee Jeong Kim, Hong Kook |
author_sort | Lee, Jung Hyuk |
collection | PubMed |
description | In this paper, we propose an end-to-end (E2E) neural network model to detect autism spectrum disorder (ASD) from children’s voices without explicitly extracting the deterministic features. In order to obtain the decisions for discriminating between the voices of children with ASD and those with typical development (TD), we combined two different feature-extraction models and a bidirectional long short-term memory (BLSTM)-based classifier to obtain the ASD/TD classification in the form of probability. We realized one of the feature extractors as the bottleneck feature from an autoencoder using the extended version of the Geneva minimalistic acoustic parameter set (eGeMAPS) input. The other feature extractor is the context vector from a pretrained wav2vec2.0-based model directly applied to the waveform input. In addition, we optimized the E2E models in two different ways: (1) fine-tuning and (2) joint optimization. To evaluate the performance of the proposed E2E models, we prepared two datasets from video recordings of ASD diagnoses collected between 2016 and 2018 at Seoul National University Bundang Hospital (SNUBH), and between 2019 and 2021 at a Living Lab. According to the experimental results, the proposed wav2vec2.0-based E2E model with joint optimization achieved significant improvements in the accuracy and unweighted average recall, from 64.74% to 71.66% and from 65.04% to 70.81%, respectively, compared with a conventional model using autoencoder-based BLSTM and the deterministic features of the eGeMAPS. |
format | Online Article Text |
id | pubmed-9823402 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-98234022023-01-08 End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model Lee, Jung Hyuk Lee, Geon Woo Bong, Guiyoung Yoo, Hee Jeong Kim, Hong Kook Sensors (Basel) Article In this paper, we propose an end-to-end (E2E) neural network model to detect autism spectrum disorder (ASD) from children’s voices without explicitly extracting the deterministic features. In order to obtain the decisions for discriminating between the voices of children with ASD and those with typical development (TD), we combined two different feature-extraction models and a bidirectional long short-term memory (BLSTM)-based classifier to obtain the ASD/TD classification in the form of probability. We realized one of the feature extractors as the bottleneck feature from an autoencoder using the extended version of the Geneva minimalistic acoustic parameter set (eGeMAPS) input. The other feature extractor is the context vector from a pretrained wav2vec2.0-based model directly applied to the waveform input. In addition, we optimized the E2E models in two different ways: (1) fine-tuning and (2) joint optimization. To evaluate the performance of the proposed E2E models, we prepared two datasets from video recordings of ASD diagnoses collected between 2016 and 2018 at Seoul National University Bundang Hospital (SNUBH), and between 2019 and 2021 at a Living Lab. According to the experimental results, the proposed wav2vec2.0-based E2E model with joint optimization achieved significant improvements in the accuracy and unweighted average recall, from 64.74% to 71.66% and from 65.04% to 70.81%, respectively, compared with a conventional model using autoencoder-based BLSTM and the deterministic features of the eGeMAPS. MDPI 2022-12-25 /pmc/articles/PMC9823402/ /pubmed/36616801 http://dx.doi.org/10.3390/s23010202 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lee, Jung Hyuk Lee, Geon Woo Bong, Guiyoung Yoo, Hee Jeong Kim, Hong Kook End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model |
title | End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model |
title_full | End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model |
title_fullStr | End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model |
title_full_unstemmed | End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model |
title_short | End-to-End Model-Based Detection of Infants with Autism Spectrum Disorder Using a Pretrained Model |
title_sort | end-to-end model-based detection of infants with autism spectrum disorder using a pretrained model |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9823402/ https://www.ncbi.nlm.nih.gov/pubmed/36616801 http://dx.doi.org/10.3390/s23010202 |
work_keys_str_mv | AT leejunghyuk endtoendmodelbaseddetectionofinfantswithautismspectrumdisorderusingapretrainedmodel AT leegeonwoo endtoendmodelbaseddetectionofinfantswithautismspectrumdisorderusingapretrainedmodel AT bongguiyoung endtoendmodelbaseddetectionofinfantswithautismspectrumdisorderusingapretrainedmodel AT yooheejeong endtoendmodelbaseddetectionofinfantswithautismspectrumdisorderusingapretrainedmodel AT kimhongkook endtoendmodelbaseddetectionofinfantswithautismspectrumdisorderusingapretrainedmodel |