Cargando…

Invariant recognition drives neural representations of action sequences

Recognizing the actions of others from visual stimuli is a crucial aspect of human perception that allows individuals to respond to social cues. Humans are able to discriminate between similar actions despite transformations, like changes in viewpoint or actor, that substantially alter the visual ap...

Descripción completa

Detalles Bibliográficos
Autores principales: Tacchetti, Andrea, Isik, Leyla, Poggio, Tomaso
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5749869/
https://www.ncbi.nlm.nih.gov/pubmed/29253864
http://dx.doi.org/10.1371/journal.pcbi.1005859
_version_ 1783289655749246976
author Tacchetti, Andrea
Isik, Leyla
Poggio, Tomaso
author_facet Tacchetti, Andrea
Isik, Leyla
Poggio, Tomaso
author_sort Tacchetti, Andrea
collection PubMed
description Recognizing the actions of others from visual stimuli is a crucial aspect of human perception that allows individuals to respond to social cues. Humans are able to discriminate between similar actions despite transformations, like changes in viewpoint or actor, that substantially alter the visual appearance of a scene. This ability to generalize across complex transformations is a hallmark of human visual intelligence. Advances in understanding action recognition at the neural level have not always translated into precise accounts of the computational principles underlying what representations of action sequences are constructed by human visual cortex. Here we test the hypothesis that invariant action discrimination might fill this gap. Recently, the study of artificial systems for static object perception has produced models, Convolutional Neural Networks (CNNs), that achieve human level performance in complex discriminative tasks. Within this class, architectures that better support invariant object recognition also produce image representations that better match those implied by human and primate neural data. However, whether these models produce representations of action sequences that support recognition across complex transformations and closely follow neural representations of actions remains unknown. Here we show that spatiotemporal CNNs accurately categorize video stimuli into action classes, and that deliberate model modifications that improve performance on an invariant action recognition task lead to data representations that better match human neural recordings. Our results support our hypothesis that performance on invariant discrimination dictates the neural representations of actions computed in the brain. These results broaden the scope of the invariant recognition framework for understanding visual intelligence from perception of inanimate objects and faces in static images to the study of human perception of action sequences.
format Online
Article
Text
id pubmed-5749869
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-57498692018-01-09 Invariant recognition drives neural representations of action sequences Tacchetti, Andrea Isik, Leyla Poggio, Tomaso PLoS Comput Biol Research Article Recognizing the actions of others from visual stimuli is a crucial aspect of human perception that allows individuals to respond to social cues. Humans are able to discriminate between similar actions despite transformations, like changes in viewpoint or actor, that substantially alter the visual appearance of a scene. This ability to generalize across complex transformations is a hallmark of human visual intelligence. Advances in understanding action recognition at the neural level have not always translated into precise accounts of the computational principles underlying what representations of action sequences are constructed by human visual cortex. Here we test the hypothesis that invariant action discrimination might fill this gap. Recently, the study of artificial systems for static object perception has produced models, Convolutional Neural Networks (CNNs), that achieve human level performance in complex discriminative tasks. Within this class, architectures that better support invariant object recognition also produce image representations that better match those implied by human and primate neural data. However, whether these models produce representations of action sequences that support recognition across complex transformations and closely follow neural representations of actions remains unknown. Here we show that spatiotemporal CNNs accurately categorize video stimuli into action classes, and that deliberate model modifications that improve performance on an invariant action recognition task lead to data representations that better match human neural recordings. Our results support our hypothesis that performance on invariant discrimination dictates the neural representations of actions computed in the brain. These results broaden the scope of the invariant recognition framework for understanding visual intelligence from perception of inanimate objects and faces in static images to the study of human perception of action sequences. Public Library of Science 2017-12-18 /pmc/articles/PMC5749869/ /pubmed/29253864 http://dx.doi.org/10.1371/journal.pcbi.1005859 Text en © 2017 Tacchetti et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Tacchetti, Andrea
Isik, Leyla
Poggio, Tomaso
Invariant recognition drives neural representations of action sequences
title Invariant recognition drives neural representations of action sequences
title_full Invariant recognition drives neural representations of action sequences
title_fullStr Invariant recognition drives neural representations of action sequences
title_full_unstemmed Invariant recognition drives neural representations of action sequences
title_short Invariant recognition drives neural representations of action sequences
title_sort invariant recognition drives neural representations of action sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5749869/
https://www.ncbi.nlm.nih.gov/pubmed/29253864
http://dx.doi.org/10.1371/journal.pcbi.1005859
work_keys_str_mv AT tacchettiandrea invariantrecognitiondrivesneuralrepresentationsofactionsequences
AT isikleyla invariantrecognitiondrivesneuralrepresentationsofactionsequences
AT poggiotomaso invariantrecognitiondrivesneuralrepresentationsofactionsequences