Cargando…

Anatomical entity mention recognition at literature scale

Motivation: Anatomical entities ranging from subcellular structures to organ systems are central to biomedical science, and mentions of these entities are essential to understanding the scientific literature. Despite extensive efforts to automatically analyze various aspects of biomedical text, ther...

Descripción completa

Detalles Bibliográficos
Autores principales: Pyysalo, Sampo, Ananiadou, Sophia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3957068/
https://www.ncbi.nlm.nih.gov/pubmed/24162468
http://dx.doi.org/10.1093/bioinformatics/btt580
_version_ 1782307761985421312
author Pyysalo, Sampo
Ananiadou, Sophia
author_facet Pyysalo, Sampo
Ananiadou, Sophia
author_sort Pyysalo, Sampo
collection PubMed
description Motivation: Anatomical entities ranging from subcellular structures to organ systems are central to biomedical science, and mentions of these entities are essential to understanding the scientific literature. Despite extensive efforts to automatically analyze various aspects of biomedical text, there have been only few studies focusing on anatomical entities, and no dedicated methods for learning to automatically recognize anatomical entity mentions in free-form text have been introduced. Results: We present AnatomyTagger, a machine learning-based system for anatomical entity mention recognition. The system incorporates a broad array of approaches proposed to benefit tagging, including the use of Unified Medical Language System (UMLS)- and Open Biomedical Ontologies (OBO)-based lexical resources, word representations induced from unlabeled text, statistical truecasing and non-local features. We train and evaluate the system on a newly introduced corpus that substantially extends on previously available resources, and apply the resulting tagger to automatically annotate the entire open access scientific domain literature. The resulting analyses have been applied to extend services provided by the Europe PubMed Central literature database. Availability and implementation: All tools and resources introduced in this work are available from http://nactem.ac.uk/anatomytagger. Contact: sophia.ananiadou@manchester.ac.uk Supplementary Information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3957068
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-39570682014-03-19 Anatomical entity mention recognition at literature scale Pyysalo, Sampo Ananiadou, Sophia Bioinformatics Original Papers Motivation: Anatomical entities ranging from subcellular structures to organ systems are central to biomedical science, and mentions of these entities are essential to understanding the scientific literature. Despite extensive efforts to automatically analyze various aspects of biomedical text, there have been only few studies focusing on anatomical entities, and no dedicated methods for learning to automatically recognize anatomical entity mentions in free-form text have been introduced. Results: We present AnatomyTagger, a machine learning-based system for anatomical entity mention recognition. The system incorporates a broad array of approaches proposed to benefit tagging, including the use of Unified Medical Language System (UMLS)- and Open Biomedical Ontologies (OBO)-based lexical resources, word representations induced from unlabeled text, statistical truecasing and non-local features. We train and evaluate the system on a newly introduced corpus that substantially extends on previously available resources, and apply the resulting tagger to automatically annotate the entire open access scientific domain literature. The resulting analyses have been applied to extend services provided by the Europe PubMed Central literature database. Availability and implementation: All tools and resources introduced in this work are available from http://nactem.ac.uk/anatomytagger. Contact: sophia.ananiadou@manchester.ac.uk Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-03-15 2013-10-25 /pmc/articles/PMC3957068/ /pubmed/24162468 http://dx.doi.org/10.1093/bioinformatics/btt580 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Pyysalo, Sampo
Ananiadou, Sophia
Anatomical entity mention recognition at literature scale
title Anatomical entity mention recognition at literature scale
title_full Anatomical entity mention recognition at literature scale
title_fullStr Anatomical entity mention recognition at literature scale
title_full_unstemmed Anatomical entity mention recognition at literature scale
title_short Anatomical entity mention recognition at literature scale
title_sort anatomical entity mention recognition at literature scale
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3957068/
https://www.ncbi.nlm.nih.gov/pubmed/24162468
http://dx.doi.org/10.1093/bioinformatics/btt580
work_keys_str_mv AT pyysalosampo anatomicalentitymentionrecognitionatliteraturescale
AT ananiadousophia anatomicalentitymentionrecognitionatliteraturescale