Cargando…

EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation

The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have theref...

Descripción completa

Detalles Bibliográficos
Autores principales: Pafilis, Evangelos, Buttigieg, Pier Luigi, Ferrell, Barbra, Pereira, Emiliano, Schnetzer, Julia, Arvanitidis, Christos, Jensen, Lars Juhl
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4761108/
https://www.ncbi.nlm.nih.gov/pubmed/26896844
http://dx.doi.org/10.1093/database/baw005
_version_ 1782416928271237120
author Pafilis, Evangelos
Buttigieg, Pier Luigi
Ferrell, Barbra
Pereira, Emiliano
Schnetzer, Julia
Arvanitidis, Christos
Jensen, Lars Juhl
author_facet Pafilis, Evangelos
Buttigieg, Pier Luigi
Ferrell, Barbra
Pereira, Emiliano
Schnetzer, Julia
Arvanitidis, Christos
Jensen, Lars Juhl
author_sort Pafilis, Evangelos
collection PubMed
description The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment, organism, tissue and disease terms. The evaluators in the BioCreative V Interactive Annotation Task found the system to be intuitive, useful, well documented and sufficiently accurate to be helpful in spotting relevant text passages and extracting organism and environment terms. Comparison of fully manual and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15–25% and helps curators to detect terms that would otherwise have been missed. Database URL: https://extract.hcmr.gr/
format Online
Article
Text
id pubmed-4761108
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-47611082016-02-22 EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation Pafilis, Evangelos Buttigieg, Pier Luigi Ferrell, Barbra Pereira, Emiliano Schnetzer, Julia Arvanitidis, Christos Jensen, Lars Juhl Database (Oxford) Original Article The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment, organism, tissue and disease terms. The evaluators in the BioCreative V Interactive Annotation Task found the system to be intuitive, useful, well documented and sufficiently accurate to be helpful in spotting relevant text passages and extracting organism and environment terms. Comparison of fully manual and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15–25% and helps curators to detect terms that would otherwise have been missed. Database URL: https://extract.hcmr.gr/ Oxford University Press 2016-02-19 /pmc/articles/PMC4761108/ /pubmed/26896844 http://dx.doi.org/10.1093/database/baw005 Text en © The Author(s) 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Pafilis, Evangelos
Buttigieg, Pier Luigi
Ferrell, Barbra
Pereira, Emiliano
Schnetzer, Julia
Arvanitidis, Christos
Jensen, Lars Juhl
EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation
title EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation
title_full EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation
title_fullStr EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation
title_full_unstemmed EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation
title_short EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation
title_sort extract: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4761108/
https://www.ncbi.nlm.nih.gov/pubmed/26896844
http://dx.doi.org/10.1093/database/baw005
work_keys_str_mv AT pafilisevangelos extractinteractiveextractionofenvironmentmetadataandtermsuggestionformetagenomicsampleannotation
AT buttigiegpierluigi extractinteractiveextractionofenvironmentmetadataandtermsuggestionformetagenomicsampleannotation
AT ferrellbarbra extractinteractiveextractionofenvironmentmetadataandtermsuggestionformetagenomicsampleannotation
AT pereiraemiliano extractinteractiveextractionofenvironmentmetadataandtermsuggestionformetagenomicsampleannotation
AT schnetzerjulia extractinteractiveextractionofenvironmentmetadataandtermsuggestionformetagenomicsampleannotation
AT arvanitidischristos extractinteractiveextractionofenvironmentmetadataandtermsuggestionformetagenomicsampleannotation
AT jensenlarsjuhl extractinteractiveextractionofenvironmentmetadataandtermsuggestionformetagenomicsampleannotation