Cargando…

Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR

WormBase, dictyBase and The Arabidopsis Information Resource (TAIR) are model organism databases containing information about Caenorhabditis elegans and other nematodes, the social amoeba Dictyostelium discoideum and related Dictyostelids and the flowering plant Arabidopsis thaliana, respectively. E...

Descripción completa

Detalles Bibliográficos
Autores principales: Van Auken, Kimberly, Fey, Petra, Berardini, Tanya Z., Dodson, Robert, Cooper, Laurel, Li, Donghui, Chan, Juancarlos, Li, Yuling, Basu, Siddhartha, Muller, Hans-Michael, Chisholm, Rex, Huala, Eva, Sternberg, Paul W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3500519/
https://www.ncbi.nlm.nih.gov/pubmed/23160413
http://dx.doi.org/10.1093/database/bas040
_version_ 1782250116513529856
author Van Auken, Kimberly
Fey, Petra
Berardini, Tanya Z.
Dodson, Robert
Cooper, Laurel
Li, Donghui
Chan, Juancarlos
Li, Yuling
Basu, Siddhartha
Muller, Hans-Michael
Chisholm, Rex
Huala, Eva
Sternberg, Paul W.
author_facet Van Auken, Kimberly
Fey, Petra
Berardini, Tanya Z.
Dodson, Robert
Cooper, Laurel
Li, Donghui
Chan, Juancarlos
Li, Yuling
Basu, Siddhartha
Muller, Hans-Michael
Chisholm, Rex
Huala, Eva
Sternberg, Paul W.
author_sort Van Auken, Kimberly
collection PubMed
description WormBase, dictyBase and The Arabidopsis Information Resource (TAIR) are model organism databases containing information about Caenorhabditis elegans and other nematodes, the social amoeba Dictyostelium discoideum and related Dictyostelids and the flowering plant Arabidopsis thaliana, respectively. Each database curates multiple data types from the primary research literature. In this article, we describe the curation workflow at WormBase, with particular emphasis on our use of text-mining tools (BioCreative 2012, Workshop Track II). We then describe the application of a specific component of that workflow, Textpresso for Cellular Component Curation (CCC), to Gene Ontology (GO) curation at dictyBase and TAIR (BioCreative 2012, Workshop Track III). We find that, with organism-specific modifications, Textpresso can be used by dictyBase and TAIR to annotate gene productions to GO's Cellular Component (CC) ontology.
format Online
Article
Text
id pubmed-3500519
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35005192012-11-19 Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR Van Auken, Kimberly Fey, Petra Berardini, Tanya Z. Dodson, Robert Cooper, Laurel Li, Donghui Chan, Juancarlos Li, Yuling Basu, Siddhartha Muller, Hans-Michael Chisholm, Rex Huala, Eva Sternberg, Paul W. Database (Oxford) BioCreative Virtual Issue WormBase, dictyBase and The Arabidopsis Information Resource (TAIR) are model organism databases containing information about Caenorhabditis elegans and other nematodes, the social amoeba Dictyostelium discoideum and related Dictyostelids and the flowering plant Arabidopsis thaliana, respectively. Each database curates multiple data types from the primary research literature. In this article, we describe the curation workflow at WormBase, with particular emphasis on our use of text-mining tools (BioCreative 2012, Workshop Track II). We then describe the application of a specific component of that workflow, Textpresso for Cellular Component Curation (CCC), to Gene Ontology (GO) curation at dictyBase and TAIR (BioCreative 2012, Workshop Track III). We find that, with organism-specific modifications, Textpresso can be used by dictyBase and TAIR to annotate gene productions to GO's Cellular Component (CC) ontology. Oxford University Press 2012-11-15 /pmc/articles/PMC3500519/ /pubmed/23160413 http://dx.doi.org/10.1093/database/bas040 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle BioCreative Virtual Issue
Van Auken, Kimberly
Fey, Petra
Berardini, Tanya Z.
Dodson, Robert
Cooper, Laurel
Li, Donghui
Chan, Juancarlos
Li, Yuling
Basu, Siddhartha
Muller, Hans-Michael
Chisholm, Rex
Huala, Eva
Sternberg, Paul W.
Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR
title Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR
title_full Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR
title_fullStr Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR
title_full_unstemmed Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR
title_short Text mining in the biocuration workflow: applications for literature curation at WormBase, dictyBase and TAIR
title_sort text mining in the biocuration workflow: applications for literature curation at wormbase, dictybase and tair
topic BioCreative Virtual Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3500519/
https://www.ncbi.nlm.nih.gov/pubmed/23160413
http://dx.doi.org/10.1093/database/bas040
work_keys_str_mv AT vanaukenkimberly textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT feypetra textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT berardinitanyaz textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT dodsonrobert textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT cooperlaurel textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT lidonghui textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT chanjuancarlos textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT liyuling textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT basusiddhartha textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT mullerhansmichael textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT chisholmrex textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT hualaeva textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair
AT sternbergpaulw textmininginthebiocurationworkflowapplicationsforliteraturecurationatwormbasedictybaseandtair