Cargando…
Computational Models of Auditory Scene Analysis: A Review
Auditory scene analysis (ASA) refers to the process (es) of parsing the complex acoustic input into auditory perceptual objects representing either physical sources or temporal sound patterns, such as melodies, which contributed to the sound waves reaching the ears. A number of new computational mod...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5108797/ https://www.ncbi.nlm.nih.gov/pubmed/27895552 http://dx.doi.org/10.3389/fnins.2016.00524 |
_version_ | 1782467422267113472 |
---|---|
author | Szabó, Beáta T. Denham, Susan L. Winkler, István |
author_facet | Szabó, Beáta T. Denham, Susan L. Winkler, István |
author_sort | Szabó, Beáta T. |
collection | PubMed |
description | Auditory scene analysis (ASA) refers to the process (es) of parsing the complex acoustic input into auditory perceptual objects representing either physical sources or temporal sound patterns, such as melodies, which contributed to the sound waves reaching the ears. A number of new computational models accounting for some of the perceptual phenomena of ASA have been published recently. Here we provide a theoretically motivated review of these computational models, aiming to relate their guiding principles to the central issues of the theoretical framework of ASA. Specifically, we ask how they achieve the grouping and separation of sound elements and whether they implement some form of competition between alternative interpretations of the sound input. We consider the extent to which they include predictive processes, as important current theories suggest that perception is inherently predictive, and also how they have been evaluated. We conclude that current computational models of ASA are fragmentary in the sense that rather than providing general competing interpretations of ASA, they focus on assessing the utility of specific processes (or algorithms) for finding the causes of the complex acoustic signal. This leaves open the possibility for integrating complementary aspects of the models into a more comprehensive theory of ASA. |
format | Online Article Text |
id | pubmed-5108797 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-51087972016-11-28 Computational Models of Auditory Scene Analysis: A Review Szabó, Beáta T. Denham, Susan L. Winkler, István Front Neurosci Neuroscience Auditory scene analysis (ASA) refers to the process (es) of parsing the complex acoustic input into auditory perceptual objects representing either physical sources or temporal sound patterns, such as melodies, which contributed to the sound waves reaching the ears. A number of new computational models accounting for some of the perceptual phenomena of ASA have been published recently. Here we provide a theoretically motivated review of these computational models, aiming to relate their guiding principles to the central issues of the theoretical framework of ASA. Specifically, we ask how they achieve the grouping and separation of sound elements and whether they implement some form of competition between alternative interpretations of the sound input. We consider the extent to which they include predictive processes, as important current theories suggest that perception is inherently predictive, and also how they have been evaluated. We conclude that current computational models of ASA are fragmentary in the sense that rather than providing general competing interpretations of ASA, they focus on assessing the utility of specific processes (or algorithms) for finding the causes of the complex acoustic signal. This leaves open the possibility for integrating complementary aspects of the models into a more comprehensive theory of ASA. Frontiers Media S.A. 2016-11-15 /pmc/articles/PMC5108797/ /pubmed/27895552 http://dx.doi.org/10.3389/fnins.2016.00524 Text en Copyright © 2016 Szabó, Denham and Winkler. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Szabó, Beáta T. Denham, Susan L. Winkler, István Computational Models of Auditory Scene Analysis: A Review |
title | Computational Models of Auditory Scene Analysis: A Review |
title_full | Computational Models of Auditory Scene Analysis: A Review |
title_fullStr | Computational Models of Auditory Scene Analysis: A Review |
title_full_unstemmed | Computational Models of Auditory Scene Analysis: A Review |
title_short | Computational Models of Auditory Scene Analysis: A Review |
title_sort | computational models of auditory scene analysis: a review |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5108797/ https://www.ncbi.nlm.nih.gov/pubmed/27895552 http://dx.doi.org/10.3389/fnins.2016.00524 |
work_keys_str_mv | AT szabobeatat computationalmodelsofauditorysceneanalysisareview AT denhamsusanl computationalmodelsofauditorysceneanalysisareview AT winkleristvan computationalmodelsofauditorysceneanalysisareview |