Cargando…

A Tutorial on Auditory Attention Identification Methods

Auditory attention identification methods attempt to identify the sound source of a listener's interest by analyzing measurements of electrophysiological data. We present a tutorial on the numerous techniques that have been developed in recent decades, and we present an overview of current tren...

Descripción completa

Detalles Bibliográficos
Autores principales: Alickovic, Emina, Lunner, Thomas, Gustafsson, Fredrik, Ljung, Lennart
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6434370/
https://www.ncbi.nlm.nih.gov/pubmed/30941002
http://dx.doi.org/10.3389/fnins.2019.00153
_version_ 1783406468511301632
author Alickovic, Emina
Lunner, Thomas
Gustafsson, Fredrik
Ljung, Lennart
author_facet Alickovic, Emina
Lunner, Thomas
Gustafsson, Fredrik
Ljung, Lennart
author_sort Alickovic, Emina
collection PubMed
description Auditory attention identification methods attempt to identify the sound source of a listener's interest by analyzing measurements of electrophysiological data. We present a tutorial on the numerous techniques that have been developed in recent decades, and we present an overview of current trends in multivariate correlation-based and model-based learning frameworks. The focus is on the use of linear relations between electrophysiological and audio data. The way in which these relations are computed differs. For example, canonical correlation analysis (CCA) finds a linear subset of electrophysiological data that best correlates to audio data and a similar subset of audio data that best correlates to electrophysiological data. Model-based (encoding and decoding) approaches focus on either of these two sets. We investigate the similarities and differences between these linear model philosophies. We focus on (1) correlation-based approaches (CCA), (2) encoding/decoding models based on dense estimation, and (3) (adaptive) encoding/decoding models based on sparse estimation. The specific focus is on sparsity-driven adaptive encoding models and comparing the methodology in state-of-the-art models found in the auditory literature. Furthermore, we outline the main signal processing pipeline for how to identify the attended sound source in a cocktail party environment from the raw electrophysiological data with all the necessary steps, complemented with the necessary MATLAB code and the relevant references for each step. Our main aim is to compare the methodology of the available methods, and provide numerical illustrations to some of them to get a feeling for their potential. A thorough performance comparison is outside the scope of this tutorial.
format Online
Article
Text
id pubmed-6434370
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-64343702019-04-02 A Tutorial on Auditory Attention Identification Methods Alickovic, Emina Lunner, Thomas Gustafsson, Fredrik Ljung, Lennart Front Neurosci Neuroscience Auditory attention identification methods attempt to identify the sound source of a listener's interest by analyzing measurements of electrophysiological data. We present a tutorial on the numerous techniques that have been developed in recent decades, and we present an overview of current trends in multivariate correlation-based and model-based learning frameworks. The focus is on the use of linear relations between electrophysiological and audio data. The way in which these relations are computed differs. For example, canonical correlation analysis (CCA) finds a linear subset of electrophysiological data that best correlates to audio data and a similar subset of audio data that best correlates to electrophysiological data. Model-based (encoding and decoding) approaches focus on either of these two sets. We investigate the similarities and differences between these linear model philosophies. We focus on (1) correlation-based approaches (CCA), (2) encoding/decoding models based on dense estimation, and (3) (adaptive) encoding/decoding models based on sparse estimation. The specific focus is on sparsity-driven adaptive encoding models and comparing the methodology in state-of-the-art models found in the auditory literature. Furthermore, we outline the main signal processing pipeline for how to identify the attended sound source in a cocktail party environment from the raw electrophysiological data with all the necessary steps, complemented with the necessary MATLAB code and the relevant references for each step. Our main aim is to compare the methodology of the available methods, and provide numerical illustrations to some of them to get a feeling for their potential. A thorough performance comparison is outside the scope of this tutorial. Frontiers Media S.A. 2019-03-19 /pmc/articles/PMC6434370/ /pubmed/30941002 http://dx.doi.org/10.3389/fnins.2019.00153 Text en Copyright © 2019 Alickovic, Lunner, Gustafsson and Ljung. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Alickovic, Emina
Lunner, Thomas
Gustafsson, Fredrik
Ljung, Lennart
A Tutorial on Auditory Attention Identification Methods
title A Tutorial on Auditory Attention Identification Methods
title_full A Tutorial on Auditory Attention Identification Methods
title_fullStr A Tutorial on Auditory Attention Identification Methods
title_full_unstemmed A Tutorial on Auditory Attention Identification Methods
title_short A Tutorial on Auditory Attention Identification Methods
title_sort tutorial on auditory attention identification methods
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6434370/
https://www.ncbi.nlm.nih.gov/pubmed/30941002
http://dx.doi.org/10.3389/fnins.2019.00153
work_keys_str_mv AT alickovicemina atutorialonauditoryattentionidentificationmethods
AT lunnerthomas atutorialonauditoryattentionidentificationmethods
AT gustafssonfredrik atutorialonauditoryattentionidentificationmethods
AT ljunglennart atutorialonauditoryattentionidentificationmethods
AT alickovicemina tutorialonauditoryattentionidentificationmethods
AT lunnerthomas tutorialonauditoryattentionidentificationmethods
AT gustafssonfredrik tutorialonauditoryattentionidentificationmethods
AT ljunglennart tutorialonauditoryattentionidentificationmethods