Cargando…

BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects

Text mining is one promising way of extracting information automatically from the vast biological literature. To maximize its potential, the knowledge encoded in the text should be translated to some semantic representation such as entities and relations, which could be analyzed by machines. But lar...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Xin, Li, Yanen, Khetani, Radhika, Sanders, Barry, Lu, Yue, Ling, Xu, Zhai, ChengXiang, Schatz, Bruce
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2896161/
https://www.ncbi.nlm.nih.gov/pubmed/20576702
http://dx.doi.org/10.1093/nar/gkq544
_version_ 1782183324580577280
author He, Xin
Li, Yanen
Khetani, Radhika
Sanders, Barry
Lu, Yue
Ling, Xu
Zhai, ChengXiang
Schatz, Bruce
author_facet He, Xin
Li, Yanen
Khetani, Radhika
Sanders, Barry
Lu, Yue
Ling, Xu
Zhai, ChengXiang
Schatz, Bruce
author_sort He, Xin
collection PubMed
description Text mining is one promising way of extracting information automatically from the vast biological literature. To maximize its potential, the knowledge encoded in the text should be translated to some semantic representation such as entities and relations, which could be analyzed by machines. But large-scale practical systems for this purpose are rare. We present BeeSpace question/answering (BSQA) system that performs integrated text mining for insect biology, covering diverse aspects from molecular interactions of genes to insect behavior. BSQA recognizes a number of entities and relations in Medline documents about the model insect, Drosophila melanogaster. For any text query, BSQA exploits entity annotation of retrieved documents to identify important concepts in different categories. By utilizing the extracted relations, BSQA is also able to answer many biologically motivated questions, from simple ones such as, which anatomical part is a gene expressed in, to more complex ones involving multiple types of relations. BSQA is freely available at http://www.beespace.uiuc.edu/QuestionAnswer.
format Text
id pubmed-2896161
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28961612010-07-02 BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects He, Xin Li, Yanen Khetani, Radhika Sanders, Barry Lu, Yue Ling, Xu Zhai, ChengXiang Schatz, Bruce Nucleic Acids Res Articles Text mining is one promising way of extracting information automatically from the vast biological literature. To maximize its potential, the knowledge encoded in the text should be translated to some semantic representation such as entities and relations, which could be analyzed by machines. But large-scale practical systems for this purpose are rare. We present BeeSpace question/answering (BSQA) system that performs integrated text mining for insect biology, covering diverse aspects from molecular interactions of genes to insect behavior. BSQA recognizes a number of entities and relations in Medline documents about the model insect, Drosophila melanogaster. For any text query, BSQA exploits entity annotation of retrieved documents to identify important concepts in different categories. By utilizing the extracted relations, BSQA is also able to answer many biologically motivated questions, from simple ones such as, which anatomical part is a gene expressed in, to more complex ones involving multiple types of relations. BSQA is freely available at http://www.beespace.uiuc.edu/QuestionAnswer. Oxford University Press 2010-07-01 2010-06-21 /pmc/articles/PMC2896161/ /pubmed/20576702 http://dx.doi.org/10.1093/nar/gkq544 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
He, Xin
Li, Yanen
Khetani, Radhika
Sanders, Barry
Lu, Yue
Ling, Xu
Zhai, ChengXiang
Schatz, Bruce
BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects
title BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects
title_full BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects
title_fullStr BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects
title_full_unstemmed BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects
title_short BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects
title_sort bsqa: integrated text mining using entity relation semantics extracted from biological literature of insects
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2896161/
https://www.ncbi.nlm.nih.gov/pubmed/20576702
http://dx.doi.org/10.1093/nar/gkq544
work_keys_str_mv AT hexin bsqaintegratedtextminingusingentityrelationsemanticsextractedfrombiologicalliteratureofinsects
AT liyanen bsqaintegratedtextminingusingentityrelationsemanticsextractedfrombiologicalliteratureofinsects
AT khetaniradhika bsqaintegratedtextminingusingentityrelationsemanticsextractedfrombiologicalliteratureofinsects
AT sandersbarry bsqaintegratedtextminingusingentityrelationsemanticsextractedfrombiologicalliteratureofinsects
AT luyue bsqaintegratedtextminingusingentityrelationsemanticsextractedfrombiologicalliteratureofinsects
AT lingxu bsqaintegratedtextminingusingentityrelationsemanticsextractedfrombiologicalliteratureofinsects
AT zhaichengxiang bsqaintegratedtextminingusingentityrelationsemanticsextractedfrombiologicalliteratureofinsects
AT schatzbruce bsqaintegratedtextminingusingentityrelationsemanticsextractedfrombiologicalliteratureofinsects