Cargando…

Markup: A Web-Based Annotation Tool Powered by Active Learning

Across various domains, such as health and social care, law, news, and social media, there are increasing quantities of unstructured texts being produced. These potential data sources often contain rich information that could be used for domain-specific and research purposes. However, the unstructur...

Descripción completa

Detalles Bibliográficos
Autores principales: Dobbie, Samuel, Strafford, Huw, Pickrell, W. Owen, Fonferko-Shadrach, Beata, Jones, Carys, Akbari, Ashley, Thompson, Simon, Lacey, Arron
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8521860/
https://www.ncbi.nlm.nih.gov/pubmed/34713086
http://dx.doi.org/10.3389/fdgth.2021.598916
_version_ 1784584974240841728
author Dobbie, Samuel
Strafford, Huw
Pickrell, W. Owen
Fonferko-Shadrach, Beata
Jones, Carys
Akbari, Ashley
Thompson, Simon
Lacey, Arron
author_facet Dobbie, Samuel
Strafford, Huw
Pickrell, W. Owen
Fonferko-Shadrach, Beata
Jones, Carys
Akbari, Ashley
Thompson, Simon
Lacey, Arron
author_sort Dobbie, Samuel
collection PubMed
description Across various domains, such as health and social care, law, news, and social media, there are increasing quantities of unstructured texts being produced. These potential data sources often contain rich information that could be used for domain-specific and research purposes. However, the unstructured nature of free-text data poses a significant challenge for its utilisation due to the necessity of substantial manual intervention from domain-experts to label embedded information. Annotation tools can assist with this process by providing functionality that enables the accurate capture and transformation of unstructured texts into structured annotations, which can be used individually, or as part of larger Natural Language Processing (NLP) pipelines. We present Markup (https://www.getmarkup.com/) an open-source, web-based annotation tool that is undergoing continued development for use across all domains. Markup incorporates NLP and Active Learning (AL) technologies to enable rapid and accurate annotation using custom user configurations, predictive annotation suggestions, and automated mapping suggestions to both domain-specific ontologies, such as the Unified Medical Language System (UMLS), and custom, user-defined ontologies. We demonstrate a real-world use case of how Markup has been used in a healthcare setting to annotate structured information from unstructured clinic letters, where captured annotations were used to build and test NLP applications.
format Online
Article
Text
id pubmed-8521860
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85218602021-10-27 Markup: A Web-Based Annotation Tool Powered by Active Learning Dobbie, Samuel Strafford, Huw Pickrell, W. Owen Fonferko-Shadrach, Beata Jones, Carys Akbari, Ashley Thompson, Simon Lacey, Arron Front Digit Health Digital Health Across various domains, such as health and social care, law, news, and social media, there are increasing quantities of unstructured texts being produced. These potential data sources often contain rich information that could be used for domain-specific and research purposes. However, the unstructured nature of free-text data poses a significant challenge for its utilisation due to the necessity of substantial manual intervention from domain-experts to label embedded information. Annotation tools can assist with this process by providing functionality that enables the accurate capture and transformation of unstructured texts into structured annotations, which can be used individually, or as part of larger Natural Language Processing (NLP) pipelines. We present Markup (https://www.getmarkup.com/) an open-source, web-based annotation tool that is undergoing continued development for use across all domains. Markup incorporates NLP and Active Learning (AL) technologies to enable rapid and accurate annotation using custom user configurations, predictive annotation suggestions, and automated mapping suggestions to both domain-specific ontologies, such as the Unified Medical Language System (UMLS), and custom, user-defined ontologies. We demonstrate a real-world use case of how Markup has been used in a healthcare setting to annotate structured information from unstructured clinic letters, where captured annotations were used to build and test NLP applications. Frontiers Media S.A. 2021-07-26 /pmc/articles/PMC8521860/ /pubmed/34713086 http://dx.doi.org/10.3389/fdgth.2021.598916 Text en Copyright © 2021 Dobbie, Strafford, Pickrell, Fonferko-Shadrach, Jones, Akbari, Thompson and Lacey. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Digital Health
Dobbie, Samuel
Strafford, Huw
Pickrell, W. Owen
Fonferko-Shadrach, Beata
Jones, Carys
Akbari, Ashley
Thompson, Simon
Lacey, Arron
Markup: A Web-Based Annotation Tool Powered by Active Learning
title Markup: A Web-Based Annotation Tool Powered by Active Learning
title_full Markup: A Web-Based Annotation Tool Powered by Active Learning
title_fullStr Markup: A Web-Based Annotation Tool Powered by Active Learning
title_full_unstemmed Markup: A Web-Based Annotation Tool Powered by Active Learning
title_short Markup: A Web-Based Annotation Tool Powered by Active Learning
title_sort markup: a web-based annotation tool powered by active learning
topic Digital Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8521860/
https://www.ncbi.nlm.nih.gov/pubmed/34713086
http://dx.doi.org/10.3389/fdgth.2021.598916
work_keys_str_mv AT dobbiesamuel markupawebbasedannotationtoolpoweredbyactivelearning
AT straffordhuw markupawebbasedannotationtoolpoweredbyactivelearning
AT pickrellwowen markupawebbasedannotationtoolpoweredbyactivelearning
AT fonferkoshadrachbeata markupawebbasedannotationtoolpoweredbyactivelearning
AT jonescarys markupawebbasedannotationtoolpoweredbyactivelearning
AT akbariashley markupawebbasedannotationtoolpoweredbyactivelearning
AT thompsonsimon markupawebbasedannotationtoolpoweredbyactivelearning
AT laceyarron markupawebbasedannotationtoolpoweredbyactivelearning