Cargando…

BioC implementations in Go, Perl, Python and Ruby

As part of a communitywide effort for evaluating text mining and information extraction systems applied to the biomedical domain, BioC is focused on the goal of interoperability, currently a major barrier to wide-scale adoption of text mining tools. BioC is a simple XML format, specified by DTD, for...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Wanli, Islamaj Doğan, Rezarta, Kwon, Dongseop, Marques, Hernani, Rinaldi, Fabio, Wilbur, W. John, Comeau, Donald C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4067548/
https://www.ncbi.nlm.nih.gov/pubmed/24961236
http://dx.doi.org/10.1093/database/bau059
_version_ 1782322307518169088
author Liu, Wanli
Islamaj Doğan, Rezarta
Kwon, Dongseop
Marques, Hernani
Rinaldi, Fabio
Wilbur, W. John
Comeau, Donald C.
author_facet Liu, Wanli
Islamaj Doğan, Rezarta
Kwon, Dongseop
Marques, Hernani
Rinaldi, Fabio
Wilbur, W. John
Comeau, Donald C.
author_sort Liu, Wanli
collection PubMed
description As part of a communitywide effort for evaluating text mining and information extraction systems applied to the biomedical domain, BioC is focused on the goal of interoperability, currently a major barrier to wide-scale adoption of text mining tools. BioC is a simple XML format, specified by DTD, for exchanging data for biomedical natural language processing. With initial implementations in C++ and Java, BioC provides libraries of code for reading and writing BioC text documents and annotations. We extend BioC to Perl, Python, Go and Ruby. We used SWIG to extend the C++ implementation for Perl and one Python implementation. A second Python implementation and the Ruby implementation use native data structures and libraries. BioC is also implemented in the Google language Go. BioC modules are functional in all of these languages, which can facilitate text mining tasks. BioC implementations are freely available through the BioC site: http://bioc.sourceforge.net. Database URL: http://bioc.sourceforge.net/
format Online
Article
Text
id pubmed-4067548
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40675482014-06-25 BioC implementations in Go, Perl, Python and Ruby Liu, Wanli Islamaj Doğan, Rezarta Kwon, Dongseop Marques, Hernani Rinaldi, Fabio Wilbur, W. John Comeau, Donald C. Database (Oxford) Original Article As part of a communitywide effort for evaluating text mining and information extraction systems applied to the biomedical domain, BioC is focused on the goal of interoperability, currently a major barrier to wide-scale adoption of text mining tools. BioC is a simple XML format, specified by DTD, for exchanging data for biomedical natural language processing. With initial implementations in C++ and Java, BioC provides libraries of code for reading and writing BioC text documents and annotations. We extend BioC to Perl, Python, Go and Ruby. We used SWIG to extend the C++ implementation for Perl and one Python implementation. A second Python implementation and the Ruby implementation use native data structures and libraries. BioC is also implemented in the Google language Go. BioC modules are functional in all of these languages, which can facilitate text mining tasks. BioC implementations are freely available through the BioC site: http://bioc.sourceforge.net. Database URL: http://bioc.sourceforge.net/ Oxford University Press 2014-06-23 /pmc/articles/PMC4067548/ /pubmed/24961236 http://dx.doi.org/10.1093/database/bau059 Text en Published by Oxford University Press 2014. This work is written by US Government employees and is in the public domain in the US.
spellingShingle Original Article
Liu, Wanli
Islamaj Doğan, Rezarta
Kwon, Dongseop
Marques, Hernani
Rinaldi, Fabio
Wilbur, W. John
Comeau, Donald C.
BioC implementations in Go, Perl, Python and Ruby
title BioC implementations in Go, Perl, Python and Ruby
title_full BioC implementations in Go, Perl, Python and Ruby
title_fullStr BioC implementations in Go, Perl, Python and Ruby
title_full_unstemmed BioC implementations in Go, Perl, Python and Ruby
title_short BioC implementations in Go, Perl, Python and Ruby
title_sort bioc implementations in go, perl, python and ruby
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4067548/
https://www.ncbi.nlm.nih.gov/pubmed/24961236
http://dx.doi.org/10.1093/database/bau059
work_keys_str_mv AT liuwanli biocimplementationsingoperlpythonandruby
AT islamajdoganrezarta biocimplementationsingoperlpythonandruby
AT kwondongseop biocimplementationsingoperlpythonandruby
AT marqueshernani biocimplementationsingoperlpythonandruby
AT rinaldifabio biocimplementationsingoperlpythonandruby
AT wilburwjohn biocimplementationsingoperlpythonandruby
AT comeaudonaldc biocimplementationsingoperlpythonandruby