Cargando…
Improving Hybrid CTC/Attention Architecture for Agglutinative Language Speech Recognition
Unlike the traditional model, the end-to-end (E2E) ASR model does not require speech information such as a pronunciation dictionary, and its system is built through a single neural network and obtains performance comparable to that of traditional methods. However, the model requires massive amounts...
Autores principales: | Ren, Zeyu, Yolwas, Nurmemet, Slamu, Wushour, Cao, Ronghe, Wang, Huiru |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571619/ https://www.ncbi.nlm.nih.gov/pubmed/36236419 http://dx.doi.org/10.3390/s22197319 |
Ejemplares similares
-
A Study of Speech Recognition for Kazakh Based on Unsupervised Pre-Training
por: Meng, Weijing, et al.
Publicado: (2023) -
Multi-Attribute Decision Making with Einstein Aggregation Operators in Complex Q-Rung Orthopair Fuzzy Hypersoft Environments
por: Ying, Changyan, et al.
Publicado: (2022) -
End-to-End Automatic Pronunciation Error Detection Based on Improved Hybrid CTC/Attention Architecture
por: Zhang, Long, et al.
Publicado: (2020) -
Frontier Research on Low-Resource Speech Recognition Technology
por: Slam, Wushour, et al.
Publicado: (2023) -
Regional Language Speech Recognition from Bone-Conducted Speech Signals through Different Deep Learning Architectures
por: Putta, Venkata Subbaiah, et al.
Publicado: (2022)