Cargando…
An Improved Transformer-Based Neural Machine Translation Strategy: Interacting-Head Attention
Transformer-based models have gained significant advances in neural machine translation (NMT). The main component of the transformer is the multihead attention layer. In theory, more heads enhance the expressive power of the NMT model. But this is not always the case in practice. On the one hand, th...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9239798/ https://www.ncbi.nlm.nih.gov/pubmed/35774445 http://dx.doi.org/10.1155/2022/2998242 |