Cargando…

The Three Terms Task - an open benchmark to compare human and artificial semantic representations

Word processing entails retrieval of a unitary yet multidimensional semantic representation (e.g., a lemon’s colour, flavour, possible use) and has been investigated in both cognitive neuroscience and artificial intelligence. To enable the direct comparison of human and artificial semantic represent...

Descripción completa

Detalles Bibliográficos
Autores principales:	Borghesani, V., Armoza, J., Hebart, M. N., Bellec, P., Brambati, S. M.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2023
Materias:	Data Descriptor
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9981885/ https://www.ncbi.nlm.nih.gov/pubmed/36864054 http://dx.doi.org/10.1038/s41597-023-02015-3

_version_	1784900202945052672
author	Borghesani, V. Armoza, J. Hebart, M. N. Bellec, P. Brambati, S. M.
author_facet	Borghesani, V. Armoza, J. Hebart, M. N. Bellec, P. Brambati, S. M.
author_sort	Borghesani, V.
collection	PubMed
description	Word processing entails retrieval of a unitary yet multidimensional semantic representation (e.g., a lemon’s colour, flavour, possible use) and has been investigated in both cognitive neuroscience and artificial intelligence. To enable the direct comparison of human and artificial semantic representations, and to support the use of natural language processing (NLP) for computational modelling of human understanding, a critical challenge is the development of benchmarks of appropriate size and complexity. Here we present a dataset probing semantic knowledge with a three-terms semantic associative task: which of two target words is more closely associated with a given anchor (e.g., is lemon closer to squeezer or sour?). The dataset includes both abstract and concrete nouns for a total of 10,107 triplets. For the 2,255 triplets with varying levels of agreement among NLP word embeddings, we additionally collected behavioural similarity judgments from 1,322 human raters. We hope that this openly available, large-scale dataset will be a useful benchmark for both computational and neuroscientific investigations of semantic knowledge.
format	Online Article Text
id	pubmed-9981885
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-99818852023-03-04 The Three Terms Task - an open benchmark to compare human and artificial semantic representations Borghesani, V. Armoza, J. Hebart, M. N. Bellec, P. Brambati, S. M. Sci Data Data Descriptor Word processing entails retrieval of a unitary yet multidimensional semantic representation (e.g., a lemon’s colour, flavour, possible use) and has been investigated in both cognitive neuroscience and artificial intelligence. To enable the direct comparison of human and artificial semantic representations, and to support the use of natural language processing (NLP) for computational modelling of human understanding, a critical challenge is the development of benchmarks of appropriate size and complexity. Here we present a dataset probing semantic knowledge with a three-terms semantic associative task: which of two target words is more closely associated with a given anchor (e.g., is lemon closer to squeezer or sour?). The dataset includes both abstract and concrete nouns for a total of 10,107 triplets. For the 2,255 triplets with varying levels of agreement among NLP word embeddings, we additionally collected behavioural similarity judgments from 1,322 human raters. We hope that this openly available, large-scale dataset will be a useful benchmark for both computational and neuroscientific investigations of semantic knowledge. Nature Publishing Group UK 2023-03-02 /pmc/articles/PMC9981885/ /pubmed/36864054 http://dx.doi.org/10.1038/s41597-023-02015-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Data Descriptor Borghesani, V. Armoza, J. Hebart, M. N. Bellec, P. Brambati, S. M. The Three Terms Task - an open benchmark to compare human and artificial semantic representations
title	The Three Terms Task - an open benchmark to compare human and artificial semantic representations
title_full	The Three Terms Task - an open benchmark to compare human and artificial semantic representations
title_fullStr	The Three Terms Task - an open benchmark to compare human and artificial semantic representations
title_full_unstemmed	The Three Terms Task - an open benchmark to compare human and artificial semantic representations
title_short	The Three Terms Task - an open benchmark to compare human and artificial semantic representations
title_sort	three terms task - an open benchmark to compare human and artificial semantic representations
topic	Data Descriptor
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9981885/ https://www.ncbi.nlm.nih.gov/pubmed/36864054 http://dx.doi.org/10.1038/s41597-023-02015-3
work_keys_str_mv	AT borghesaniv thethreetermstaskanopenbenchmarktocomparehumanandartificialsemanticrepresentations AT armozaj thethreetermstaskanopenbenchmarktocomparehumanandartificialsemanticrepresentations AT hebartmn thethreetermstaskanopenbenchmarktocomparehumanandartificialsemanticrepresentations AT bellecp thethreetermstaskanopenbenchmarktocomparehumanandartificialsemanticrepresentations AT brambatism thethreetermstaskanopenbenchmarktocomparehumanandartificialsemanticrepresentations AT borghesaniv threetermstaskanopenbenchmarktocomparehumanandartificialsemanticrepresentations AT armozaj threetermstaskanopenbenchmarktocomparehumanandartificialsemanticrepresentations AT hebartmn threetermstaskanopenbenchmarktocomparehumanandartificialsemanticrepresentations AT bellecp threetermstaskanopenbenchmarktocomparehumanandartificialsemanticrepresentations AT brambatism threetermstaskanopenbenchmarktocomparehumanandartificialsemanticrepresentations

The Three Terms Task - an open benchmark to compare human and artificial semantic representations

Ejemplares similares