Cargando…

Decoding and synthesizing tonal language speech from brain activity

Recent studies have shown that the feasibility of speech brain-computer interfaces (BCIs) as a clinically valid treatment in helping nontonal language patients with communication disorders restore their speech ability. However, tonal language speech BCI is challenging because additional precise cont...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yan, Zhao, Zehao, Xu, Minpeng, Yu, Haiqing, Zhu, Yanming, Zhang, Jie, Bu, Linghao, Zhang, Xiaoluo, Lu, Junfeng, Li, Yuanning, Ming, Dong, Wu, Jinsong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for the Advancement of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256166/
https://www.ncbi.nlm.nih.gov/pubmed/37294753
http://dx.doi.org/10.1126/sciadv.adh0478
Descripción
Sumario:Recent studies have shown that the feasibility of speech brain-computer interfaces (BCIs) as a clinically valid treatment in helping nontonal language patients with communication disorders restore their speech ability. However, tonal language speech BCI is challenging because additional precise control of laryngeal movements to produce lexical tones is required. Thus, the model should emphasize the features from the tonal-related cortex. Here, we designed a modularized multistream neural network that directly synthesizes tonal language speech from intracranial recordings. The network decoded lexical tones and base syllables independently via parallel streams of neural network modules inspired by neuroscience findings. The speech was synthesized by combining tonal syllable labels with nondiscriminant speech neural activity. Compared to commonly used baseline models, our proposed models achieved higher performance with modest training data and computational costs. These findings raise a potential strategy for approaching tonal language speech restoration.