Cargando…

Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods

This article discusses the idea of developing an intelligent and customizable automated system for real-time text and voice dialogs with the user. This system can be used for almost any subject area, for example, to create an automated robot - a call center operator or smart chat bots, assistants, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Tarasiev, Andrey, Filippova, Margarita, Aksyonov, Konstantin, Aksyonova, Olga, Antonova, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7198241/
http://dx.doi.org/10.1007/978-3-030-47240-5_10
_version_ 1783528961724121088
author Tarasiev, Andrey
Filippova, Margarita
Aksyonov, Konstantin
Aksyonova, Olga
Antonova, Anna
author_facet Tarasiev, Andrey
Filippova, Margarita
Aksyonov, Konstantin
Aksyonova, Olga
Antonova, Anna
author_sort Tarasiev, Andrey
collection PubMed
description This article discusses the idea of developing an intelligent and customizable automated system for real-time text and voice dialogs with the user. This system can be used for almost any subject area, for example, to create an automated robot - a call center operator or smart chat bots, assistants, and so on. This article presents the developed flexible architecture of the proposed system. The system has many independent submodules. These modules work as interacting microservices and use several speech recognition schemes, including a decision support submodule, third-party speech recognition systems and a post-processing subsystem. In this paper, the post-processing module of the recognized text is presented in detail on the example of Russian and English dictionary models. The proposed submodule also uses several processing steps, including the use of various stemming methods, the use of word stop-lists or other lexical structures, the use of stochastic keyword ranking using a weight table, etc.
format Online
Article
Text
id pubmed-7198241
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-71982412020-05-05 Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods Tarasiev, Andrey Filippova, Margarita Aksyonov, Konstantin Aksyonova, Olga Antonova, Anna Open Source Systems Article This article discusses the idea of developing an intelligent and customizable automated system for real-time text and voice dialogs with the user. This system can be used for almost any subject area, for example, to create an automated robot - a call center operator or smart chat bots, assistants, and so on. This article presents the developed flexible architecture of the proposed system. The system has many independent submodules. These modules work as interacting microservices and use several speech recognition schemes, including a decision support submodule, third-party speech recognition systems and a post-processing subsystem. In this paper, the post-processing module of the recognized text is presented in detail on the example of Russian and English dictionary models. The proposed submodule also uses several processing steps, including the use of various stemming methods, the use of word stop-lists or other lexical structures, the use of stochastic keyword ranking using a weight table, etc. 2020-05-05 /pmc/articles/PMC7198241/ http://dx.doi.org/10.1007/978-3-030-47240-5_10 Text en © IFIP International Federation for Information Processing 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Tarasiev, Andrey
Filippova, Margarita
Aksyonov, Konstantin
Aksyonova, Olga
Antonova, Anna
Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods
title Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods
title_full Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods
title_fullStr Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods
title_full_unstemmed Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods
title_short Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods
title_sort using of open-source technologies for the design and development of a speech processing system based on stemming methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7198241/
http://dx.doi.org/10.1007/978-3-030-47240-5_10
work_keys_str_mv AT tarasievandrey usingofopensourcetechnologiesforthedesignanddevelopmentofaspeechprocessingsystembasedonstemmingmethods
AT filippovamargarita usingofopensourcetechnologiesforthedesignanddevelopmentofaspeechprocessingsystembasedonstemmingmethods
AT aksyonovkonstantin usingofopensourcetechnologiesforthedesignanddevelopmentofaspeechprocessingsystembasedonstemmingmethods
AT aksyonovaolga usingofopensourcetechnologiesforthedesignanddevelopmentofaspeechprocessingsystembasedonstemmingmethods
AT antonovaanna usingofopensourcetechnologiesforthedesignanddevelopmentofaspeechprocessingsystembasedonstemmingmethods