Cargando…

Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations

Appropriate indexing of resources is necessary for their efficient search, discovery and utilization. Relying solely on manual effort is time-consuming, costly and error prone. On the other hand, the special nature, volume and broadness of biomedical literature pose barriers for automated methods. W...

Descripción completa

Detalles Bibliográficos
Autores principales: Koutsomitropoulos, Dimitrios A., Andriopoulos, Andreas D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7256379/
http://dx.doi.org/10.1007/978-3-030-49161-1_29
_version_ 1783539895027892224
author Koutsomitropoulos, Dimitrios A.
Andriopoulos, Andreas D.
author_facet Koutsomitropoulos, Dimitrios A.
Andriopoulos, Andreas D.
author_sort Koutsomitropoulos, Dimitrios A.
collection PubMed
description Appropriate indexing of resources is necessary for their efficient search, discovery and utilization. Relying solely on manual effort is time-consuming, costly and error prone. On the other hand, the special nature, volume and broadness of biomedical literature pose barriers for automated methods. We argue that current word embedding algorithms can be efficiently used to support the task of biomedical text classification. Both deep- and shallow network approaches are implemented and evaluated. Large datasets of biomedical citations and full texts are harvested for their metadata and used for training and testing. The ontology representation of Medical Subject Headings provides machine-readable labels and specifies the dimensionality of the problem space. These automated approaches are still far from entirely substituting human experts, yet they can be useful as a mechanism for validation and recommendation. Dataset balancing, distributed processing and training parallelization in GPUs, all play an important part regarding the effectiveness and performance of proposed methods.
format Online
Article
Text
id pubmed-7256379
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72563792020-05-29 Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations Koutsomitropoulos, Dimitrios A. Andriopoulos, Andreas D. Artificial Intelligence Applications and Innovations Article Appropriate indexing of resources is necessary for their efficient search, discovery and utilization. Relying solely on manual effort is time-consuming, costly and error prone. On the other hand, the special nature, volume and broadness of biomedical literature pose barriers for automated methods. We argue that current word embedding algorithms can be efficiently used to support the task of biomedical text classification. Both deep- and shallow network approaches are implemented and evaluated. Large datasets of biomedical citations and full texts are harvested for their metadata and used for training and testing. The ontology representation of Medical Subject Headings provides machine-readable labels and specifies the dimensionality of the problem space. These automated approaches are still far from entirely substituting human experts, yet they can be useful as a mechanism for validation and recommendation. Dataset balancing, distributed processing and training parallelization in GPUs, all play an important part regarding the effectiveness and performance of proposed methods. 2020-05-06 /pmc/articles/PMC7256379/ http://dx.doi.org/10.1007/978-3-030-49161-1_29 Text en © IFIP International Federation for Information Processing 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Koutsomitropoulos, Dimitrios A.
Andriopoulos, Andreas D.
Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations
title Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations
title_full Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations
title_fullStr Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations
title_full_unstemmed Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations
title_short Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations
title_sort automated mesh indexing of biomedical literature using contextualized word representations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7256379/
http://dx.doi.org/10.1007/978-3-030-49161-1_29
work_keys_str_mv AT koutsomitropoulosdimitriosa automatedmeshindexingofbiomedicalliteratureusingcontextualizedwordrepresentations
AT andriopoulosandreasd automatedmeshindexingofbiomedicalliteratureusingcontextualizedwordrepresentations