Cargando…

Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia

Children learn and develop their abilities at their own pace. One of the most basic skills that they acquire is reading. However, some children struggle with reading longer than their friends, and in such a case, it is possible that they have a learning disorder known as dyslexia. The paper aims to...

Descripción completa

Detalles Bibliográficos
Autores principales: Atkar, Geeta, Jayaraju, Priyadarshini
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer London 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7883547/
https://www.ncbi.nlm.nih.gov/pubmed/33612979
http://dx.doi.org/10.1007/s00521-021-05695-3
_version_ 1783651231571378176
author Atkar, Geeta
Jayaraju, Priyadarshini
author_facet Atkar, Geeta
Jayaraju, Priyadarshini
author_sort Atkar, Geeta
collection PubMed
description Children learn and develop their abilities at their own pace. One of the most basic skills that they acquire is reading. However, some children struggle with reading longer than their friends, and in such a case, it is possible that they have a learning disorder known as dyslexia. The paper aims to use neural networks, namely generative neural networks, for generating raw audio data of two- or three-letter Hindi words. Using the generated data, a system will be built that will pronounce generated words for children recuperating from dyslexia. The system aims to be an effective helping tool for teachers to speed up the recuperation process by making the child repeat the correct pronunciation of the word. The system uses advance Mel-generative adversarial network neural network for working with Mel-spectrograms of the raw audio, by which the system will model its own audio iteratively, until a satisfactory result is achieved. Generated audio sample contains the Hindi words which will be taught to children. Mel-generative adversarial network will be used to generate audio samples since it provides better results compared to other existing models. 300 basic two- or three-letter Hindi words are taken as an input for assisting 5- to 8-year children. Minimum opinion score is calculated for comparison.
format Online
Article
Text
id pubmed-7883547
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer London
record_format MEDLINE/PubMed
spelling pubmed-78835472021-02-16 Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia Atkar, Geeta Jayaraju, Priyadarshini Neural Comput Appl Original Article Children learn and develop their abilities at their own pace. One of the most basic skills that they acquire is reading. However, some children struggle with reading longer than their friends, and in such a case, it is possible that they have a learning disorder known as dyslexia. The paper aims to use neural networks, namely generative neural networks, for generating raw audio data of two- or three-letter Hindi words. Using the generated data, a system will be built that will pronounce generated words for children recuperating from dyslexia. The system aims to be an effective helping tool for teachers to speed up the recuperation process by making the child repeat the correct pronunciation of the word. The system uses advance Mel-generative adversarial network neural network for working with Mel-spectrograms of the raw audio, by which the system will model its own audio iteratively, until a satisfactory result is achieved. Generated audio sample contains the Hindi words which will be taught to children. Mel-generative adversarial network will be used to generate audio samples since it provides better results compared to other existing models. 300 basic two- or three-letter Hindi words are taken as an input for assisting 5- to 8-year children. Minimum opinion score is calculated for comparison. Springer London 2021-02-15 2021 /pmc/articles/PMC7883547/ /pubmed/33612979 http://dx.doi.org/10.1007/s00521-021-05695-3 Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd. part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Article
Atkar, Geeta
Jayaraju, Priyadarshini
Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia
title Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia
title_full Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia
title_fullStr Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia
title_full_unstemmed Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia
title_short Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia
title_sort speech synthesis using generative adversarial network for improving readability of hindi words to recuperate from dyslexia
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7883547/
https://www.ncbi.nlm.nih.gov/pubmed/33612979
http://dx.doi.org/10.1007/s00521-021-05695-3
work_keys_str_mv AT atkargeeta speechsynthesisusinggenerativeadversarialnetworkforimprovingreadabilityofhindiwordstorecuperatefromdyslexia
AT jayarajupriyadarshini speechsynthesisusinggenerativeadversarialnetworkforimprovingreadabilityofhindiwordstorecuperatefromdyslexia