Cargando…

Argo: an integrative, interactive, text mining-based workbench supporting curation

Curation of biomedical literature is often supported by the automatic analysis of textual content that generally involves a sequence of individual processing components. Text mining (TM) has been used to enhance the process of manual biocuration, but has been focused on specific databases and tasks...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rak, Rafal, Rowley, Andrew, Black, William, Ananiadou, Sophia
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2012
Materias:	Original Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3308166/ https://www.ncbi.nlm.nih.gov/pubmed/22434844 http://dx.doi.org/10.1093/database/bas010

_version_	1782227406416773120
author	Rak, Rafal Rowley, Andrew Black, William Ananiadou, Sophia
author_facet	Rak, Rafal Rowley, Andrew Black, William Ananiadou, Sophia
author_sort	Rak, Rafal
collection	PubMed
description	Curation of biomedical literature is often supported by the automatic analysis of textual content that generally involves a sequence of individual processing components. Text mining (TM) has been used to enhance the process of manual biocuration, but has been focused on specific databases and tasks rather than an environment integrating TM tools into the curation pipeline, catering for a variety of tasks, types of information and applications. Processing components usually come from different sources and often lack interoperability. The well established Unstructured Information Management Architecture is a framework that addresses interoperability by defining common data structures and interfaces. However, most of the efforts are targeted towards software developers and are not suitable for curators, or are otherwise inconvenient to use on a higher level of abstraction. To overcome these issues we introduce Argo, an interoperable, integrative, interactive and collaborative system for text analysis with a convenient graphic user interface to ease the development of processing workflows and boost productivity in labour-intensive manual curation. Robust, scalable text analytics follow a modular approach, adopting component modules for distinct levels of text analysis. The user interface is available entirely through a web browser that saves the user from going through often complicated and platform-dependent installation procedures. Argo comes with a predefined set of processing components commonly used in text analysis, while giving the users the ability to deposit their own components. The system accommodates various areas and levels of user expertise, from TM and computational linguistics to ontology-based curation. One of the key functionalities of Argo is its ability to seamlessly incorporate user-interactive components, such as manual annotation editors, into otherwise completely automatic pipelines. As a use case, we demonstrate the functionality of an in-built manual annotation editor that is well suited for in-text corpus annotation tasks. Database URL: http://www.nactem.ac.uk/Argo
format	Online Article Text
id	pubmed-3308166
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-33081662012-03-20 Argo: an integrative, interactive, text mining-based workbench supporting curation Rak, Rafal Rowley, Andrew Black, William Ananiadou, Sophia Database (Oxford) Original Articles Curation of biomedical literature is often supported by the automatic analysis of textual content that generally involves a sequence of individual processing components. Text mining (TM) has been used to enhance the process of manual biocuration, but has been focused on specific databases and tasks rather than an environment integrating TM tools into the curation pipeline, catering for a variety of tasks, types of information and applications. Processing components usually come from different sources and often lack interoperability. The well established Unstructured Information Management Architecture is a framework that addresses interoperability by defining common data structures and interfaces. However, most of the efforts are targeted towards software developers and are not suitable for curators, or are otherwise inconvenient to use on a higher level of abstraction. To overcome these issues we introduce Argo, an interoperable, integrative, interactive and collaborative system for text analysis with a convenient graphic user interface to ease the development of processing workflows and boost productivity in labour-intensive manual curation. Robust, scalable text analytics follow a modular approach, adopting component modules for distinct levels of text analysis. The user interface is available entirely through a web browser that saves the user from going through often complicated and platform-dependent installation procedures. Argo comes with a predefined set of processing components commonly used in text analysis, while giving the users the ability to deposit their own components. The system accommodates various areas and levels of user expertise, from TM and computational linguistics to ontology-based curation. One of the key functionalities of Argo is its ability to seamlessly incorporate user-interactive components, such as manual annotation editors, into otherwise completely automatic pipelines. As a use case, we demonstrate the functionality of an in-built manual annotation editor that is well suited for in-text corpus annotation tasks. Database URL: http://www.nactem.ac.uk/Argo Oxford University Press 2012-02-13 /pmc/articles/PMC3308166/ /pubmed/22434844 http://dx.doi.org/10.1093/database/bas010 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Articles Rak, Rafal Rowley, Andrew Black, William Ananiadou, Sophia Argo: an integrative, interactive, text mining-based workbench supporting curation
title	Argo: an integrative, interactive, text mining-based workbench supporting curation
title_full	Argo: an integrative, interactive, text mining-based workbench supporting curation
title_fullStr	Argo: an integrative, interactive, text mining-based workbench supporting curation
title_full_unstemmed	Argo: an integrative, interactive, text mining-based workbench supporting curation
title_short	Argo: an integrative, interactive, text mining-based workbench supporting curation
title_sort	argo: an integrative, interactive, text mining-based workbench supporting curation
topic	Original Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3308166/ https://www.ncbi.nlm.nih.gov/pubmed/22434844 http://dx.doi.org/10.1093/database/bas010
work_keys_str_mv	AT rakrafal argoanintegrativeinteractivetextminingbasedworkbenchsupportingcuration AT rowleyandrew argoanintegrativeinteractivetextminingbasedworkbenchsupportingcuration AT blackwilliam argoanintegrativeinteractivetextminingbasedworkbenchsupportingcuration AT ananiadousophia argoanintegrativeinteractivetextminingbasedworkbenchsupportingcuration

Argo: an integrative, interactive, text mining-based workbench supporting curation

Ejemplares similares