Cargando…
Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations
Appropriate indexing of resources is necessary for their efficient search, discovery and utilization. Relying solely on manual effort is time-consuming, costly and error prone. On the other hand, the special nature, volume and broadness of biomedical literature pose barriers for automated methods. W...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7256379/ http://dx.doi.org/10.1007/978-3-030-49161-1_29 |
_version_ | 1783539895027892224 |
---|---|
author | Koutsomitropoulos, Dimitrios A. Andriopoulos, Andreas D. |
author_facet | Koutsomitropoulos, Dimitrios A. Andriopoulos, Andreas D. |
author_sort | Koutsomitropoulos, Dimitrios A. |
collection | PubMed |
description | Appropriate indexing of resources is necessary for their efficient search, discovery and utilization. Relying solely on manual effort is time-consuming, costly and error prone. On the other hand, the special nature, volume and broadness of biomedical literature pose barriers for automated methods. We argue that current word embedding algorithms can be efficiently used to support the task of biomedical text classification. Both deep- and shallow network approaches are implemented and evaluated. Large datasets of biomedical citations and full texts are harvested for their metadata and used for training and testing. The ontology representation of Medical Subject Headings provides machine-readable labels and specifies the dimensionality of the problem space. These automated approaches are still far from entirely substituting human experts, yet they can be useful as a mechanism for validation and recommendation. Dataset balancing, distributed processing and training parallelization in GPUs, all play an important part regarding the effectiveness and performance of proposed methods. |
format | Online Article Text |
id | pubmed-7256379 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72563792020-05-29 Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations Koutsomitropoulos, Dimitrios A. Andriopoulos, Andreas D. Artificial Intelligence Applications and Innovations Article Appropriate indexing of resources is necessary for their efficient search, discovery and utilization. Relying solely on manual effort is time-consuming, costly and error prone. On the other hand, the special nature, volume and broadness of biomedical literature pose barriers for automated methods. We argue that current word embedding algorithms can be efficiently used to support the task of biomedical text classification. Both deep- and shallow network approaches are implemented and evaluated. Large datasets of biomedical citations and full texts are harvested for their metadata and used for training and testing. The ontology representation of Medical Subject Headings provides machine-readable labels and specifies the dimensionality of the problem space. These automated approaches are still far from entirely substituting human experts, yet they can be useful as a mechanism for validation and recommendation. Dataset balancing, distributed processing and training parallelization in GPUs, all play an important part regarding the effectiveness and performance of proposed methods. 2020-05-06 /pmc/articles/PMC7256379/ http://dx.doi.org/10.1007/978-3-030-49161-1_29 Text en © IFIP International Federation for Information Processing 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Koutsomitropoulos, Dimitrios A. Andriopoulos, Andreas D. Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations |
title | Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations |
title_full | Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations |
title_fullStr | Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations |
title_full_unstemmed | Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations |
title_short | Automated MeSH Indexing of Biomedical Literature Using Contextualized Word Representations |
title_sort | automated mesh indexing of biomedical literature using contextualized word representations |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7256379/ http://dx.doi.org/10.1007/978-3-030-49161-1_29 |
work_keys_str_mv | AT koutsomitropoulosdimitriosa automatedmeshindexingofbiomedicalliteratureusingcontextualizedwordrepresentations AT andriopoulosandreasd automatedmeshindexingofbiomedicalliteratureusingcontextualizedwordrepresentations |