Cargando…
Learning linear transformations between counting-based and prediction-based word embeddings
Despite the growing interest in prediction-based word embedding learning methods, it remains unclear as to how the vector spaces learnt by the prediction-based methods differ from that of the counting-based methods, or whether one can be transformed into the other. To study the relationship between...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5604957/ https://www.ncbi.nlm.nih.gov/pubmed/28926629 http://dx.doi.org/10.1371/journal.pone.0184544 |
_version_ | 1783264930347089920 |
---|---|
author | Bollegala, Danushka Hayashi, Kohei Kawarabayashi, Ken-ichi |
author_facet | Bollegala, Danushka Hayashi, Kohei Kawarabayashi, Ken-ichi |
author_sort | Bollegala, Danushka |
collection | PubMed |
description | Despite the growing interest in prediction-based word embedding learning methods, it remains unclear as to how the vector spaces learnt by the prediction-based methods differ from that of the counting-based methods, or whether one can be transformed into the other. To study the relationship between counting-based and prediction-based embeddings, we propose a method for learning a linear transformation between two given sets of word embeddings. Our proposal contributes to the word embedding learning research in three ways: (a) we propose an efficient method to learn a linear transformation between two sets of word embeddings, (b) using the transformation learnt in (a), we empirically show that it is possible to predict distributed word embeddings for novel unseen words, and (c) empirically it is possible to linearly transform counting-based embeddings to prediction-based embeddings, for frequent words, different POS categories, and varying degrees of ambiguities. |
format | Online Article Text |
id | pubmed-5604957 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-56049572017-09-28 Learning linear transformations between counting-based and prediction-based word embeddings Bollegala, Danushka Hayashi, Kohei Kawarabayashi, Ken-ichi PLoS One Research Article Despite the growing interest in prediction-based word embedding learning methods, it remains unclear as to how the vector spaces learnt by the prediction-based methods differ from that of the counting-based methods, or whether one can be transformed into the other. To study the relationship between counting-based and prediction-based embeddings, we propose a method for learning a linear transformation between two given sets of word embeddings. Our proposal contributes to the word embedding learning research in three ways: (a) we propose an efficient method to learn a linear transformation between two sets of word embeddings, (b) using the transformation learnt in (a), we empirically show that it is possible to predict distributed word embeddings for novel unseen words, and (c) empirically it is possible to linearly transform counting-based embeddings to prediction-based embeddings, for frequent words, different POS categories, and varying degrees of ambiguities. Public Library of Science 2017-09-19 /pmc/articles/PMC5604957/ /pubmed/28926629 http://dx.doi.org/10.1371/journal.pone.0184544 Text en © 2017 Bollegala et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Bollegala, Danushka Hayashi, Kohei Kawarabayashi, Ken-ichi Learning linear transformations between counting-based and prediction-based word embeddings |
title | Learning linear transformations between counting-based and prediction-based word embeddings |
title_full | Learning linear transformations between counting-based and prediction-based word embeddings |
title_fullStr | Learning linear transformations between counting-based and prediction-based word embeddings |
title_full_unstemmed | Learning linear transformations between counting-based and prediction-based word embeddings |
title_short | Learning linear transformations between counting-based and prediction-based word embeddings |
title_sort | learning linear transformations between counting-based and prediction-based word embeddings |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5604957/ https://www.ncbi.nlm.nih.gov/pubmed/28926629 http://dx.doi.org/10.1371/journal.pone.0184544 |
work_keys_str_mv | AT bollegaladanushka learninglineartransformationsbetweencountingbasedandpredictionbasedwordembeddings AT hayashikohei learninglineartransformationsbetweencountingbasedandpredictionbasedwordembeddings AT kawarabayashikenichi learninglineartransformationsbetweencountingbasedandpredictionbasedwordembeddings |