Cargando…

Funding map using paragraph embedding based on semantic diversity

Maps of science representing the structure of science can help us understand science and technology (S&T) development. Studies have thus developed techniques for analyzing research activities’ relationships; however, ongoing research projects and recently published papers have difficulty in appl...

Descripción completa

Detalles Bibliográficos
Autores principales: Kawamura, Takahiro, Watanabe, Katsutaro, Matsumoto, Naoya, Egami, Shusaku, Jibu, Mari
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6096681/
https://www.ncbi.nlm.nih.gov/pubmed/30147200
http://dx.doi.org/10.1007/s11192-018-2783-x
_version_ 1783348150564552704
author Kawamura, Takahiro
Watanabe, Katsutaro
Matsumoto, Naoya
Egami, Shusaku
Jibu, Mari
author_facet Kawamura, Takahiro
Watanabe, Katsutaro
Matsumoto, Naoya
Egami, Shusaku
Jibu, Mari
author_sort Kawamura, Takahiro
collection PubMed
description Maps of science representing the structure of science can help us understand science and technology (S&T) development. Studies have thus developed techniques for analyzing research activities’ relationships; however, ongoing research projects and recently published papers have difficulty in applying inter-citation and co-citation analysis. Therefore, in order to characterize what is currently being attempted in the scientific landscape, this paper proposes a new content-based method of locating research projects in a multi-dimensional space using the recent word/paragraph embedding techniques. Specifically, for addressing an unclustered problem associated with the original paragraph vectors, we introduce paragraph vectors based on the information entropies of concepts in an S&T thesaurus. The experimental results show that the proposed method successfully formed a clustered map from 25,607 project descriptions of the 7th Framework Programme of EU from 2006 to 2016 and 34,192 project descriptions of the National Science Foundation from 2012 to 2016.
format Online
Article
Text
id pubmed-6096681
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-60966812018-08-24 Funding map using paragraph embedding based on semantic diversity Kawamura, Takahiro Watanabe, Katsutaro Matsumoto, Naoya Egami, Shusaku Jibu, Mari Scientometrics Article Maps of science representing the structure of science can help us understand science and technology (S&T) development. Studies have thus developed techniques for analyzing research activities’ relationships; however, ongoing research projects and recently published papers have difficulty in applying inter-citation and co-citation analysis. Therefore, in order to characterize what is currently being attempted in the scientific landscape, this paper proposes a new content-based method of locating research projects in a multi-dimensional space using the recent word/paragraph embedding techniques. Specifically, for addressing an unclustered problem associated with the original paragraph vectors, we introduce paragraph vectors based on the information entropies of concepts in an S&T thesaurus. The experimental results show that the proposed method successfully formed a clustered map from 25,607 project descriptions of the 7th Framework Programme of EU from 2006 to 2016 and 34,192 project descriptions of the National Science Foundation from 2012 to 2016. Springer International Publishing 2018-05-28 2018 /pmc/articles/PMC6096681/ /pubmed/30147200 http://dx.doi.org/10.1007/s11192-018-2783-x Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Kawamura, Takahiro
Watanabe, Katsutaro
Matsumoto, Naoya
Egami, Shusaku
Jibu, Mari
Funding map using paragraph embedding based on semantic diversity
title Funding map using paragraph embedding based on semantic diversity
title_full Funding map using paragraph embedding based on semantic diversity
title_fullStr Funding map using paragraph embedding based on semantic diversity
title_full_unstemmed Funding map using paragraph embedding based on semantic diversity
title_short Funding map using paragraph embedding based on semantic diversity
title_sort funding map using paragraph embedding based on semantic diversity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6096681/
https://www.ncbi.nlm.nih.gov/pubmed/30147200
http://dx.doi.org/10.1007/s11192-018-2783-x
work_keys_str_mv AT kawamuratakahiro fundingmapusingparagraphembeddingbasedonsemanticdiversity
AT watanabekatsutaro fundingmapusingparagraphembeddingbasedonsemanticdiversity
AT matsumotonaoya fundingmapusingparagraphembeddingbasedonsemanticdiversity
AT egamishusaku fundingmapusingparagraphembeddingbasedonsemanticdiversity
AT jibumari fundingmapusingparagraphembeddingbasedonsemanticdiversity