Cargando…

Learning linear transformations between counting-based and prediction-based word embeddings

Despite the growing interest in prediction-based word embedding learning methods, it remains unclear as to how the vector spaces learnt by the prediction-based methods differ from that of the counting-based methods, or whether one can be transformed into the other. To study the relationship between...

Descripción completa

Detalles Bibliográficos
Autores principales: Bollegala, Danushka, Hayashi, Kohei, Kawarabayashi, Ken-ichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5604957/
https://www.ncbi.nlm.nih.gov/pubmed/28926629
http://dx.doi.org/10.1371/journal.pone.0184544
_version_ 1783264930347089920
author Bollegala, Danushka
Hayashi, Kohei
Kawarabayashi, Ken-ichi
author_facet Bollegala, Danushka
Hayashi, Kohei
Kawarabayashi, Ken-ichi
author_sort Bollegala, Danushka
collection PubMed
description Despite the growing interest in prediction-based word embedding learning methods, it remains unclear as to how the vector spaces learnt by the prediction-based methods differ from that of the counting-based methods, or whether one can be transformed into the other. To study the relationship between counting-based and prediction-based embeddings, we propose a method for learning a linear transformation between two given sets of word embeddings. Our proposal contributes to the word embedding learning research in three ways: (a) we propose an efficient method to learn a linear transformation between two sets of word embeddings, (b) using the transformation learnt in (a), we empirically show that it is possible to predict distributed word embeddings for novel unseen words, and (c) empirically it is possible to linearly transform counting-based embeddings to prediction-based embeddings, for frequent words, different POS categories, and varying degrees of ambiguities.
format Online
Article
Text
id pubmed-5604957
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56049572017-09-28 Learning linear transformations between counting-based and prediction-based word embeddings Bollegala, Danushka Hayashi, Kohei Kawarabayashi, Ken-ichi PLoS One Research Article Despite the growing interest in prediction-based word embedding learning methods, it remains unclear as to how the vector spaces learnt by the prediction-based methods differ from that of the counting-based methods, or whether one can be transformed into the other. To study the relationship between counting-based and prediction-based embeddings, we propose a method for learning a linear transformation between two given sets of word embeddings. Our proposal contributes to the word embedding learning research in three ways: (a) we propose an efficient method to learn a linear transformation between two sets of word embeddings, (b) using the transformation learnt in (a), we empirically show that it is possible to predict distributed word embeddings for novel unseen words, and (c) empirically it is possible to linearly transform counting-based embeddings to prediction-based embeddings, for frequent words, different POS categories, and varying degrees of ambiguities. Public Library of Science 2017-09-19 /pmc/articles/PMC5604957/ /pubmed/28926629 http://dx.doi.org/10.1371/journal.pone.0184544 Text en © 2017 Bollegala et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bollegala, Danushka
Hayashi, Kohei
Kawarabayashi, Ken-ichi
Learning linear transformations between counting-based and prediction-based word embeddings
title Learning linear transformations between counting-based and prediction-based word embeddings
title_full Learning linear transformations between counting-based and prediction-based word embeddings
title_fullStr Learning linear transformations between counting-based and prediction-based word embeddings
title_full_unstemmed Learning linear transformations between counting-based and prediction-based word embeddings
title_short Learning linear transformations between counting-based and prediction-based word embeddings
title_sort learning linear transformations between counting-based and prediction-based word embeddings
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5604957/
https://www.ncbi.nlm.nih.gov/pubmed/28926629
http://dx.doi.org/10.1371/journal.pone.0184544
work_keys_str_mv AT bollegaladanushka learninglineartransformationsbetweencountingbasedandpredictionbasedwordembeddings
AT hayashikohei learninglineartransformationsbetweencountingbasedandpredictionbasedwordembeddings
AT kawarabayashikenichi learninglineartransformationsbetweencountingbasedandpredictionbasedwordembeddings