Cargando…

Confusion2Vec: towards enriching vector space word representations with representational ambiguities

Word vector representations are a crucial part of natural language processing (NLP) and human computer interaction. In this paper, we propose a novel word vector representation, Confusion2Vec, motivated from the human speech production and perception that encodes representational ambiguity. Humans e...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gurunath Shivakumar, Prashanth, Georgiou, Panayiotis
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2019
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924673/ https://www.ncbi.nlm.nih.gov/pubmed/33816848 http://dx.doi.org/10.7717/peerj-cs.195

_version_	1783659138370240512
author	Gurunath Shivakumar, Prashanth Georgiou, Panayiotis
author_facet	Gurunath Shivakumar, Prashanth Georgiou, Panayiotis
author_sort	Gurunath Shivakumar, Prashanth
collection	PubMed
description	Word vector representations are a crucial part of natural language processing (NLP) and human computer interaction. In this paper, we propose a novel word vector representation, Confusion2Vec, motivated from the human speech production and perception that encodes representational ambiguity. Humans employ both acoustic similarity cues and contextual cues to decode information and we focus on a model that incorporates both sources of information. The representational ambiguity of acoustics, which manifests itself in word confusions, is often resolved by both humans and machines through contextual cues. A range of representational ambiguities can emerge in various domains further to acoustic perception, such as morphological transformations, word segmentation, paraphrasing for NLP tasks like machine translation, etc. In this work, we present a case study in application to automatic speech recognition (ASR) task, where the word representational ambiguities/confusions are related to acoustic similarity. We present several techniques to train an acoustic perceptual similarity representation ambiguity. We term this Confusion2Vec and learn on unsupervised-generated data from ASR confusion networks or lattice-like structures. Appropriate evaluations for the Confusion2Vec are formulated for gauging acoustic similarity in addition to semantic–syntactic and word similarity evaluations. The Confusion2Vec is able to model word confusions efficiently, without compromising on the semantic-syntactic word relations, thus effectively enriching the word vector space with extra task relevant ambiguity information. We provide an intuitive exploration of the two-dimensional Confusion2Vec space using principal component analysis of the embedding and relate to semantic relationships, syntactic relationships, and acoustic relationships. We show through this that the new space preserves the semantic/syntactic relationships while robustly encoding acoustic similarities. The potential of the new vector representation and its ability in the utilization of uncertainty information associated with the lattice is demonstrated through small examples relating to the task of ASR error correction.
format	Online Article Text
id	pubmed-7924673
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-79246732021-04-02 Confusion2Vec: towards enriching vector space word representations with representational ambiguities Gurunath Shivakumar, Prashanth Georgiou, Panayiotis PeerJ Comput Sci Artificial Intelligence Word vector representations are a crucial part of natural language processing (NLP) and human computer interaction. In this paper, we propose a novel word vector representation, Confusion2Vec, motivated from the human speech production and perception that encodes representational ambiguity. Humans employ both acoustic similarity cues and contextual cues to decode information and we focus on a model that incorporates both sources of information. The representational ambiguity of acoustics, which manifests itself in word confusions, is often resolved by both humans and machines through contextual cues. A range of representational ambiguities can emerge in various domains further to acoustic perception, such as morphological transformations, word segmentation, paraphrasing for NLP tasks like machine translation, etc. In this work, we present a case study in application to automatic speech recognition (ASR) task, where the word representational ambiguities/confusions are related to acoustic similarity. We present several techniques to train an acoustic perceptual similarity representation ambiguity. We term this Confusion2Vec and learn on unsupervised-generated data from ASR confusion networks or lattice-like structures. Appropriate evaluations for the Confusion2Vec are formulated for gauging acoustic similarity in addition to semantic–syntactic and word similarity evaluations. The Confusion2Vec is able to model word confusions efficiently, without compromising on the semantic-syntactic word relations, thus effectively enriching the word vector space with extra task relevant ambiguity information. We provide an intuitive exploration of the two-dimensional Confusion2Vec space using principal component analysis of the embedding and relate to semantic relationships, syntactic relationships, and acoustic relationships. We show through this that the new space preserves the semantic/syntactic relationships while robustly encoding acoustic similarities. The potential of the new vector representation and its ability in the utilization of uncertainty information associated with the lattice is demonstrated through small examples relating to the task of ASR error correction. PeerJ Inc. 2019-06-10 /pmc/articles/PMC7924673/ /pubmed/33816848 http://dx.doi.org/10.7717/peerj-cs.195 Text en © 2019 Gurunath Shivakumar and Georgiou http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle	Artificial Intelligence Gurunath Shivakumar, Prashanth Georgiou, Panayiotis Confusion2Vec: towards enriching vector space word representations with representational ambiguities
title	Confusion2Vec: towards enriching vector space word representations with representational ambiguities
title_full	Confusion2Vec: towards enriching vector space word representations with representational ambiguities
title_fullStr	Confusion2Vec: towards enriching vector space word representations with representational ambiguities
title_full_unstemmed	Confusion2Vec: towards enriching vector space word representations with representational ambiguities
title_short	Confusion2Vec: towards enriching vector space word representations with representational ambiguities
title_sort	confusion2vec: towards enriching vector space word representations with representational ambiguities
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924673/ https://www.ncbi.nlm.nih.gov/pubmed/33816848 http://dx.doi.org/10.7717/peerj-cs.195
work_keys_str_mv	AT gurunathshivakumarprashanth confusion2vectowardsenrichingvectorspacewordrepresentationswithrepresentationalambiguities AT georgioupanayiotis confusion2vectowardsenrichingvectorspacewordrepresentationswithrepresentationalambiguities

Confusion2Vec: towards enriching vector space word representations with representational ambiguities

Ejemplares similares