Cargando…

Marie and BERT—A Knowledge Graph Embedding Based Question Answering System for Chemistry

[Image: see text] This paper presents a novel knowledge graph question answering (KGQA) system for chemistry, which is implemented on hybrid knowledge graph embeddings, aiming to provide fact-oriented information retrieval for chemistry-related research and industrial applications. Unlike other exis...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Xiaochi, Zhang, Shaocong, Agarwal, Mehal, Akroyd, Jethro, Mosbach, Sebastian, Kraft, Markus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10500657/
https://www.ncbi.nlm.nih.gov/pubmed/37720754
http://dx.doi.org/10.1021/acsomega.3c05114
_version_ 1785105954889531392
author Zhou, Xiaochi
Zhang, Shaocong
Agarwal, Mehal
Akroyd, Jethro
Mosbach, Sebastian
Kraft, Markus
author_facet Zhou, Xiaochi
Zhang, Shaocong
Agarwal, Mehal
Akroyd, Jethro
Mosbach, Sebastian
Kraft, Markus
author_sort Zhou, Xiaochi
collection PubMed
description [Image: see text] This paper presents a novel knowledge graph question answering (KGQA) system for chemistry, which is implemented on hybrid knowledge graph embeddings, aiming to provide fact-oriented information retrieval for chemistry-related research and industrial applications. Unlike other existing designs, the system operates on multiple embedding spaces, which use various embedding methods and queries the embedding spaces in parallel. With the answers returned from multiple embedding spaces, the system leverages a score alignment model to adjust the answer scores and rerank the answers. Further, the system implements an algorithm to derive implicit multihop relations to handle the complexities of deep ontologies and improve multihop question answering. The system also implements a BERT-based bidirectional entity-linking model to enhance the robustness and accuracy of the entity-linking module. The system uses a joint numerical embedding model to efficiently handle numerical filtering questions. Further, it can invoke semantic agents to perform dynamic calculations autonomously. Finally, the KGQA system handles numerous chemical reaction mechanisms using semantic parsing supported by a Linked Data Fragment server. This paper evaluates the accuracy of each module within the KGQA system with a chemistry question data set.
format Online
Article
Text
id pubmed-10500657
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-105006572023-09-15 Marie and BERT—A Knowledge Graph Embedding Based Question Answering System for Chemistry Zhou, Xiaochi Zhang, Shaocong Agarwal, Mehal Akroyd, Jethro Mosbach, Sebastian Kraft, Markus ACS Omega [Image: see text] This paper presents a novel knowledge graph question answering (KGQA) system for chemistry, which is implemented on hybrid knowledge graph embeddings, aiming to provide fact-oriented information retrieval for chemistry-related research and industrial applications. Unlike other existing designs, the system operates on multiple embedding spaces, which use various embedding methods and queries the embedding spaces in parallel. With the answers returned from multiple embedding spaces, the system leverages a score alignment model to adjust the answer scores and rerank the answers. Further, the system implements an algorithm to derive implicit multihop relations to handle the complexities of deep ontologies and improve multihop question answering. The system also implements a BERT-based bidirectional entity-linking model to enhance the robustness and accuracy of the entity-linking module. The system uses a joint numerical embedding model to efficiently handle numerical filtering questions. Further, it can invoke semantic agents to perform dynamic calculations autonomously. Finally, the KGQA system handles numerous chemical reaction mechanisms using semantic parsing supported by a Linked Data Fragment server. This paper evaluates the accuracy of each module within the KGQA system with a chemistry question data set. American Chemical Society 2023-08-25 /pmc/articles/PMC10500657/ /pubmed/37720754 http://dx.doi.org/10.1021/acsomega.3c05114 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Zhou, Xiaochi
Zhang, Shaocong
Agarwal, Mehal
Akroyd, Jethro
Mosbach, Sebastian
Kraft, Markus
Marie and BERT—A Knowledge Graph Embedding Based Question Answering System for Chemistry
title Marie and BERT—A Knowledge Graph Embedding Based Question Answering System for Chemistry
title_full Marie and BERT—A Knowledge Graph Embedding Based Question Answering System for Chemistry
title_fullStr Marie and BERT—A Knowledge Graph Embedding Based Question Answering System for Chemistry
title_full_unstemmed Marie and BERT—A Knowledge Graph Embedding Based Question Answering System for Chemistry
title_short Marie and BERT—A Knowledge Graph Embedding Based Question Answering System for Chemistry
title_sort marie and bert—a knowledge graph embedding based question answering system for chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10500657/
https://www.ncbi.nlm.nih.gov/pubmed/37720754
http://dx.doi.org/10.1021/acsomega.3c05114
work_keys_str_mv AT zhouxiaochi marieandbertaknowledgegraphembeddingbasedquestionansweringsystemforchemistry
AT zhangshaocong marieandbertaknowledgegraphembeddingbasedquestionansweringsystemforchemistry
AT agarwalmehal marieandbertaknowledgegraphembeddingbasedquestionansweringsystemforchemistry
AT akroydjethro marieandbertaknowledgegraphembeddingbasedquestionansweringsystemforchemistry
AT mosbachsebastian marieandbertaknowledgegraphembeddingbasedquestionansweringsystemforchemistry
AT kraftmarkus marieandbertaknowledgegraphembeddingbasedquestionansweringsystemforchemistry