Cargando…

Extracting relations from texts using vector language models and a neural network classifier

The article investigates the possibility of identifying the presence of SKOS (Simple Knowledge Organization System) relations between concepts represented by terms on the base of their vector representation in general natural language models. Several language models of the Word2Vec and GloVe familie...

Descripción completa

Detalles Bibliográficos
Autores principales: Shishaev, Maksim, Dikovitsky, Vladimir, Pimeshkov, Vadim, Kuprikov, Nikita, Kuprikov, Mikhail, Shkodyrev, Viacheslav
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588709/
https://www.ncbi.nlm.nih.gov/pubmed/37869460
http://dx.doi.org/10.7717/peerj-cs.1636
_version_ 1785123637319172096
author Shishaev, Maksim
Dikovitsky, Vladimir
Pimeshkov, Vadim
Kuprikov, Nikita
Kuprikov, Mikhail
Shkodyrev, Viacheslav
author_facet Shishaev, Maksim
Dikovitsky, Vladimir
Pimeshkov, Vadim
Kuprikov, Nikita
Kuprikov, Mikhail
Shkodyrev, Viacheslav
author_sort Shishaev, Maksim
collection PubMed
description The article investigates the possibility of identifying the presence of SKOS (Simple Knowledge Organization System) relations between concepts represented by terms on the base of their vector representation in general natural language models. Several language models of the Word2Vec and GloVe families are considered, on the basis of which an artificial neural network (ANN) classifier of SKOS relations is formed. To train and test the efficiency of the classifier, datasets formed on the basis of the DBPedia and EuroVoc thesauri are used. The experiments performed have shown the high efficiency of the classifier trained using GloVe family models, while training it with use of Word2Vec models looks impossible in the bounds of considered ANN-based classifier architecture. Based on the results, a conclusion is made about the key role of taking into account the global context of the use of terms in the text for the possibility of identifying SKOS relations.
format Online
Article
Text
id pubmed-10588709
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-105887092023-10-21 Extracting relations from texts using vector language models and a neural network classifier Shishaev, Maksim Dikovitsky, Vladimir Pimeshkov, Vadim Kuprikov, Nikita Kuprikov, Mikhail Shkodyrev, Viacheslav PeerJ Comput Sci Artificial Intelligence The article investigates the possibility of identifying the presence of SKOS (Simple Knowledge Organization System) relations between concepts represented by terms on the base of their vector representation in general natural language models. Several language models of the Word2Vec and GloVe families are considered, on the basis of which an artificial neural network (ANN) classifier of SKOS relations is formed. To train and test the efficiency of the classifier, datasets formed on the basis of the DBPedia and EuroVoc thesauri are used. The experiments performed have shown the high efficiency of the classifier trained using GloVe family models, while training it with use of Word2Vec models looks impossible in the bounds of considered ANN-based classifier architecture. Based on the results, a conclusion is made about the key role of taking into account the global context of the use of terms in the text for the possibility of identifying SKOS relations. PeerJ Inc. 2023-10-11 /pmc/articles/PMC10588709/ /pubmed/37869460 http://dx.doi.org/10.7717/peerj-cs.1636 Text en ©2023 Shishaev et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Shishaev, Maksim
Dikovitsky, Vladimir
Pimeshkov, Vadim
Kuprikov, Nikita
Kuprikov, Mikhail
Shkodyrev, Viacheslav
Extracting relations from texts using vector language models and a neural network classifier
title Extracting relations from texts using vector language models and a neural network classifier
title_full Extracting relations from texts using vector language models and a neural network classifier
title_fullStr Extracting relations from texts using vector language models and a neural network classifier
title_full_unstemmed Extracting relations from texts using vector language models and a neural network classifier
title_short Extracting relations from texts using vector language models and a neural network classifier
title_sort extracting relations from texts using vector language models and a neural network classifier
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588709/
https://www.ncbi.nlm.nih.gov/pubmed/37869460
http://dx.doi.org/10.7717/peerj-cs.1636
work_keys_str_mv AT shishaevmaksim extractingrelationsfromtextsusingvectorlanguagemodelsandaneuralnetworkclassifier
AT dikovitskyvladimir extractingrelationsfromtextsusingvectorlanguagemodelsandaneuralnetworkclassifier
AT pimeshkovvadim extractingrelationsfromtextsusingvectorlanguagemodelsandaneuralnetworkclassifier
AT kuprikovnikita extractingrelationsfromtextsusingvectorlanguagemodelsandaneuralnetworkclassifier
AT kuprikovmikhail extractingrelationsfromtextsusingvectorlanguagemodelsandaneuralnetworkclassifier
AT shkodyrevviacheslav extractingrelationsfromtextsusingvectorlanguagemodelsandaneuralnetworkclassifier