Cargando…
The New Version of the ANDDigest Tool with Improved AI-Based Short Names Recognition
The body of scientific literature continues to grow annually. Over 1.5 million abstracts of biomedical publications were added to the PubMed database in 2021. Therefore, developing cognitive systems that provide a specialized search for information in scientific publications based on subject area on...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9738852/ https://www.ncbi.nlm.nih.gov/pubmed/36499269 http://dx.doi.org/10.3390/ijms232314934 |
_version_ | 1784847652967415808 |
---|---|
author | Ivanisenko, Timofey V. Demenkov, Pavel S. Kolchanov, Nikolay A. Ivanisenko, Vladimir A. |
author_facet | Ivanisenko, Timofey V. Demenkov, Pavel S. Kolchanov, Nikolay A. Ivanisenko, Vladimir A. |
author_sort | Ivanisenko, Timofey V. |
collection | PubMed |
description | The body of scientific literature continues to grow annually. Over 1.5 million abstracts of biomedical publications were added to the PubMed database in 2021. Therefore, developing cognitive systems that provide a specialized search for information in scientific publications based on subject area ontology and modern artificial intelligence methods is urgently needed. We previously developed a web-based information retrieval system, ANDDigest, designed to search and analyze information in the PubMed database using a customized domain ontology. This paper presents an improved ANDDigest version that uses fine-tuned PubMedBERT classifiers to enhance the quality of short name recognition for molecular-genetics entities in PubMed abstracts on eight biological object types: cell components, diseases, side effects, genes, proteins, pathways, drugs, and metabolites. This approach increased average short name recognition accuracy by 13%. |
format | Online Article Text |
id | pubmed-9738852 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-97388522022-12-11 The New Version of the ANDDigest Tool with Improved AI-Based Short Names Recognition Ivanisenko, Timofey V. Demenkov, Pavel S. Kolchanov, Nikolay A. Ivanisenko, Vladimir A. Int J Mol Sci Article The body of scientific literature continues to grow annually. Over 1.5 million abstracts of biomedical publications were added to the PubMed database in 2021. Therefore, developing cognitive systems that provide a specialized search for information in scientific publications based on subject area ontology and modern artificial intelligence methods is urgently needed. We previously developed a web-based information retrieval system, ANDDigest, designed to search and analyze information in the PubMed database using a customized domain ontology. This paper presents an improved ANDDigest version that uses fine-tuned PubMedBERT classifiers to enhance the quality of short name recognition for molecular-genetics entities in PubMed abstracts on eight biological object types: cell components, diseases, side effects, genes, proteins, pathways, drugs, and metabolites. This approach increased average short name recognition accuracy by 13%. MDPI 2022-11-29 /pmc/articles/PMC9738852/ /pubmed/36499269 http://dx.doi.org/10.3390/ijms232314934 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ivanisenko, Timofey V. Demenkov, Pavel S. Kolchanov, Nikolay A. Ivanisenko, Vladimir A. The New Version of the ANDDigest Tool with Improved AI-Based Short Names Recognition |
title | The New Version of the ANDDigest Tool with Improved AI-Based Short Names Recognition |
title_full | The New Version of the ANDDigest Tool with Improved AI-Based Short Names Recognition |
title_fullStr | The New Version of the ANDDigest Tool with Improved AI-Based Short Names Recognition |
title_full_unstemmed | The New Version of the ANDDigest Tool with Improved AI-Based Short Names Recognition |
title_short | The New Version of the ANDDigest Tool with Improved AI-Based Short Names Recognition |
title_sort | new version of the anddigest tool with improved ai-based short names recognition |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9738852/ https://www.ncbi.nlm.nih.gov/pubmed/36499269 http://dx.doi.org/10.3390/ijms232314934 |
work_keys_str_mv | AT ivanisenkotimofeyv thenewversionoftheanddigesttoolwithimprovedaibasedshortnamesrecognition AT demenkovpavels thenewversionoftheanddigesttoolwithimprovedaibasedshortnamesrecognition AT kolchanovnikolaya thenewversionoftheanddigesttoolwithimprovedaibasedshortnamesrecognition AT ivanisenkovladimira thenewversionoftheanddigesttoolwithimprovedaibasedshortnamesrecognition AT ivanisenkotimofeyv newversionoftheanddigesttoolwithimprovedaibasedshortnamesrecognition AT demenkovpavels newversionoftheanddigesttoolwithimprovedaibasedshortnamesrecognition AT kolchanovnikolaya newversionoftheanddigesttoolwithimprovedaibasedshortnamesrecognition AT ivanisenkovladimira newversionoftheanddigesttoolwithimprovedaibasedshortnamesrecognition |