Cargando…
Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods
This article discusses the idea of developing an intelligent and customizable automated system for real-time text and voice dialogs with the user. This system can be used for almost any subject area, for example, to create an automated robot - a call center operator or smart chat bots, assistants, a...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7198241/ http://dx.doi.org/10.1007/978-3-030-47240-5_10 |
_version_ | 1783528961724121088 |
---|---|
author | Tarasiev, Andrey Filippova, Margarita Aksyonov, Konstantin Aksyonova, Olga Antonova, Anna |
author_facet | Tarasiev, Andrey Filippova, Margarita Aksyonov, Konstantin Aksyonova, Olga Antonova, Anna |
author_sort | Tarasiev, Andrey |
collection | PubMed |
description | This article discusses the idea of developing an intelligent and customizable automated system for real-time text and voice dialogs with the user. This system can be used for almost any subject area, for example, to create an automated robot - a call center operator or smart chat bots, assistants, and so on. This article presents the developed flexible architecture of the proposed system. The system has many independent submodules. These modules work as interacting microservices and use several speech recognition schemes, including a decision support submodule, third-party speech recognition systems and a post-processing subsystem. In this paper, the post-processing module of the recognized text is presented in detail on the example of Russian and English dictionary models. The proposed submodule also uses several processing steps, including the use of various stemming methods, the use of word stop-lists or other lexical structures, the use of stochastic keyword ranking using a weight table, etc. |
format | Online Article Text |
id | pubmed-7198241 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-71982412020-05-05 Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods Tarasiev, Andrey Filippova, Margarita Aksyonov, Konstantin Aksyonova, Olga Antonova, Anna Open Source Systems Article This article discusses the idea of developing an intelligent and customizable automated system for real-time text and voice dialogs with the user. This system can be used for almost any subject area, for example, to create an automated robot - a call center operator or smart chat bots, assistants, and so on. This article presents the developed flexible architecture of the proposed system. The system has many independent submodules. These modules work as interacting microservices and use several speech recognition schemes, including a decision support submodule, third-party speech recognition systems and a post-processing subsystem. In this paper, the post-processing module of the recognized text is presented in detail on the example of Russian and English dictionary models. The proposed submodule also uses several processing steps, including the use of various stemming methods, the use of word stop-lists or other lexical structures, the use of stochastic keyword ranking using a weight table, etc. 2020-05-05 /pmc/articles/PMC7198241/ http://dx.doi.org/10.1007/978-3-030-47240-5_10 Text en © IFIP International Federation for Information Processing 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Tarasiev, Andrey Filippova, Margarita Aksyonov, Konstantin Aksyonova, Olga Antonova, Anna Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods |
title | Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods |
title_full | Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods |
title_fullStr | Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods |
title_full_unstemmed | Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods |
title_short | Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods |
title_sort | using of open-source technologies for the design and development of a speech processing system based on stemming methods |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7198241/ http://dx.doi.org/10.1007/978-3-030-47240-5_10 |
work_keys_str_mv | AT tarasievandrey usingofopensourcetechnologiesforthedesignanddevelopmentofaspeechprocessingsystembasedonstemmingmethods AT filippovamargarita usingofopensourcetechnologiesforthedesignanddevelopmentofaspeechprocessingsystembasedonstemmingmethods AT aksyonovkonstantin usingofopensourcetechnologiesforthedesignanddevelopmentofaspeechprocessingsystembasedonstemmingmethods AT aksyonovaolga usingofopensourcetechnologiesforthedesignanddevelopmentofaspeechprocessingsystembasedonstemmingmethods AT antonovaanna usingofopensourcetechnologiesforthedesignanddevelopmentofaspeechprocessingsystembasedonstemmingmethods |