Cargando…

An Improved Transformer-Based Neural Machine Translation Strategy: Interacting-Head Attention

Transformer-based models have gained significant advances in neural machine translation (NMT). The main component of the transformer is the multihead attention layer. In theory, more heads enhance the expressive power of the NMT model. But this is not always the case in practice. On the one hand, th...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Dongxing, Luo, Zuying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9239798/
https://www.ncbi.nlm.nih.gov/pubmed/35774445
http://dx.doi.org/10.1155/2022/2998242