Cargando…
Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia
Children learn and develop their abilities at their own pace. One of the most basic skills that they acquire is reading. However, some children struggle with reading longer than their friends, and in such a case, it is possible that they have a learning disorder known as dyslexia. The paper aims to...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer London
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7883547/ https://www.ncbi.nlm.nih.gov/pubmed/33612979 http://dx.doi.org/10.1007/s00521-021-05695-3 |
_version_ | 1783651231571378176 |
---|---|
author | Atkar, Geeta Jayaraju, Priyadarshini |
author_facet | Atkar, Geeta Jayaraju, Priyadarshini |
author_sort | Atkar, Geeta |
collection | PubMed |
description | Children learn and develop their abilities at their own pace. One of the most basic skills that they acquire is reading. However, some children struggle with reading longer than their friends, and in such a case, it is possible that they have a learning disorder known as dyslexia. The paper aims to use neural networks, namely generative neural networks, for generating raw audio data of two- or three-letter Hindi words. Using the generated data, a system will be built that will pronounce generated words for children recuperating from dyslexia. The system aims to be an effective helping tool for teachers to speed up the recuperation process by making the child repeat the correct pronunciation of the word. The system uses advance Mel-generative adversarial network neural network for working with Mel-spectrograms of the raw audio, by which the system will model its own audio iteratively, until a satisfactory result is achieved. Generated audio sample contains the Hindi words which will be taught to children. Mel-generative adversarial network will be used to generate audio samples since it provides better results compared to other existing models. 300 basic two- or three-letter Hindi words are taken as an input for assisting 5- to 8-year children. Minimum opinion score is calculated for comparison. |
format | Online Article Text |
id | pubmed-7883547 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer London |
record_format | MEDLINE/PubMed |
spelling | pubmed-78835472021-02-16 Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia Atkar, Geeta Jayaraju, Priyadarshini Neural Comput Appl Original Article Children learn and develop their abilities at their own pace. One of the most basic skills that they acquire is reading. However, some children struggle with reading longer than their friends, and in such a case, it is possible that they have a learning disorder known as dyslexia. The paper aims to use neural networks, namely generative neural networks, for generating raw audio data of two- or three-letter Hindi words. Using the generated data, a system will be built that will pronounce generated words for children recuperating from dyslexia. The system aims to be an effective helping tool for teachers to speed up the recuperation process by making the child repeat the correct pronunciation of the word. The system uses advance Mel-generative adversarial network neural network for working with Mel-spectrograms of the raw audio, by which the system will model its own audio iteratively, until a satisfactory result is achieved. Generated audio sample contains the Hindi words which will be taught to children. Mel-generative adversarial network will be used to generate audio samples since it provides better results compared to other existing models. 300 basic two- or three-letter Hindi words are taken as an input for assisting 5- to 8-year children. Minimum opinion score is calculated for comparison. Springer London 2021-02-15 2021 /pmc/articles/PMC7883547/ /pubmed/33612979 http://dx.doi.org/10.1007/s00521-021-05695-3 Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd. part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Article Atkar, Geeta Jayaraju, Priyadarshini Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia |
title | Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia |
title_full | Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia |
title_fullStr | Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia |
title_full_unstemmed | Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia |
title_short | Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia |
title_sort | speech synthesis using generative adversarial network for improving readability of hindi words to recuperate from dyslexia |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7883547/ https://www.ncbi.nlm.nih.gov/pubmed/33612979 http://dx.doi.org/10.1007/s00521-021-05695-3 |
work_keys_str_mv | AT atkargeeta speechsynthesisusinggenerativeadversarialnetworkforimprovingreadabilityofhindiwordstorecuperatefromdyslexia AT jayarajupriyadarshini speechsynthesisusinggenerativeadversarialnetworkforimprovingreadabilityofhindiwordstorecuperatefromdyslexia |