Cargando…
BioC implementations in Go, Perl, Python and Ruby
As part of a communitywide effort for evaluating text mining and information extraction systems applied to the biomedical domain, BioC is focused on the goal of interoperability, currently a major barrier to wide-scale adoption of text mining tools. BioC is a simple XML format, specified by DTD, for...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4067548/ https://www.ncbi.nlm.nih.gov/pubmed/24961236 http://dx.doi.org/10.1093/database/bau059 |
_version_ | 1782322307518169088 |
---|---|
author | Liu, Wanli Islamaj Doğan, Rezarta Kwon, Dongseop Marques, Hernani Rinaldi, Fabio Wilbur, W. John Comeau, Donald C. |
author_facet | Liu, Wanli Islamaj Doğan, Rezarta Kwon, Dongseop Marques, Hernani Rinaldi, Fabio Wilbur, W. John Comeau, Donald C. |
author_sort | Liu, Wanli |
collection | PubMed |
description | As part of a communitywide effort for evaluating text mining and information extraction systems applied to the biomedical domain, BioC is focused on the goal of interoperability, currently a major barrier to wide-scale adoption of text mining tools. BioC is a simple XML format, specified by DTD, for exchanging data for biomedical natural language processing. With initial implementations in C++ and Java, BioC provides libraries of code for reading and writing BioC text documents and annotations. We extend BioC to Perl, Python, Go and Ruby. We used SWIG to extend the C++ implementation for Perl and one Python implementation. A second Python implementation and the Ruby implementation use native data structures and libraries. BioC is also implemented in the Google language Go. BioC modules are functional in all of these languages, which can facilitate text mining tasks. BioC implementations are freely available through the BioC site: http://bioc.sourceforge.net. Database URL: http://bioc.sourceforge.net/ |
format | Online Article Text |
id | pubmed-4067548 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-40675482014-06-25 BioC implementations in Go, Perl, Python and Ruby Liu, Wanli Islamaj Doğan, Rezarta Kwon, Dongseop Marques, Hernani Rinaldi, Fabio Wilbur, W. John Comeau, Donald C. Database (Oxford) Original Article As part of a communitywide effort for evaluating text mining and information extraction systems applied to the biomedical domain, BioC is focused on the goal of interoperability, currently a major barrier to wide-scale adoption of text mining tools. BioC is a simple XML format, specified by DTD, for exchanging data for biomedical natural language processing. With initial implementations in C++ and Java, BioC provides libraries of code for reading and writing BioC text documents and annotations. We extend BioC to Perl, Python, Go and Ruby. We used SWIG to extend the C++ implementation for Perl and one Python implementation. A second Python implementation and the Ruby implementation use native data structures and libraries. BioC is also implemented in the Google language Go. BioC modules are functional in all of these languages, which can facilitate text mining tasks. BioC implementations are freely available through the BioC site: http://bioc.sourceforge.net. Database URL: http://bioc.sourceforge.net/ Oxford University Press 2014-06-23 /pmc/articles/PMC4067548/ /pubmed/24961236 http://dx.doi.org/10.1093/database/bau059 Text en Published by Oxford University Press 2014. This work is written by US Government employees and is in the public domain in the US. |
spellingShingle | Original Article Liu, Wanli Islamaj Doğan, Rezarta Kwon, Dongseop Marques, Hernani Rinaldi, Fabio Wilbur, W. John Comeau, Donald C. BioC implementations in Go, Perl, Python and Ruby |
title | BioC implementations in Go, Perl, Python and Ruby |
title_full | BioC implementations in Go, Perl, Python and Ruby |
title_fullStr | BioC implementations in Go, Perl, Python and Ruby |
title_full_unstemmed | BioC implementations in Go, Perl, Python and Ruby |
title_short | BioC implementations in Go, Perl, Python and Ruby |
title_sort | bioc implementations in go, perl, python and ruby |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4067548/ https://www.ncbi.nlm.nih.gov/pubmed/24961236 http://dx.doi.org/10.1093/database/bau059 |
work_keys_str_mv | AT liuwanli biocimplementationsingoperlpythonandruby AT islamajdoganrezarta biocimplementationsingoperlpythonandruby AT kwondongseop biocimplementationsingoperlpythonandruby AT marqueshernani biocimplementationsingoperlpythonandruby AT rinaldifabio biocimplementationsingoperlpythonandruby AT wilburwjohn biocimplementationsingoperlpythonandruby AT comeaudonaldc biocimplementationsingoperlpythonandruby |