Cargando…
Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text
BACKGROUND: Pharmacogenomics studies the relationship between genetic variation and the variation in drug response phenotypes. The field is rapidly gaining importance: it promises drugs targeted to particular subpopulations based on genetic background. The pharmacogenomics literature has expanded ra...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646239/ https://www.ncbi.nlm.nih.gov/pubmed/19208194 http://dx.doi.org/10.1186/1471-2105-10-S2-S6 |
_version_ | 1782164829956472832 |
---|---|
author | Garten, Yael Altman, Russ B |
author_facet | Garten, Yael Altman, Russ B |
author_sort | Garten, Yael |
collection | PubMed |
description | BACKGROUND: Pharmacogenomics studies the relationship between genetic variation and the variation in drug response phenotypes. The field is rapidly gaining importance: it promises drugs targeted to particular subpopulations based on genetic background. The pharmacogenomics literature has expanded rapidly, but is dispersed in many journals. It is challenging, therefore, to identify important associations between drugs and molecular entities – particularly genes and gene variants, and thus these critical connections are often lost. Text mining techniques can allow us to convert the free-style text to a computable, searchable format in which pharmacogenomic concepts (such as genes, drugs, polymorphisms, and diseases) are identified, and important links between these concepts are recorded. Availability of full text articles as input into text mining engines is key, as literature abstracts often do not contain sufficient information to identify these pharmacogenomic associations. RESULTS: Thus, building on a tool called Textpresso, we have created the Pharmspresso tool to assist in identifying important pharmacogenomic facts in full text articles. Pharmspresso parses text to find references to human genes, polymorphisms, drugs and diseases and their relationships. It presents these as a series of marked-up text fragments, in which key concepts are visually highlighted. To evaluate Pharmspresso, we used a gold standard of 45 human-curated articles. Pharmspresso identified 78%, 61%, and 74% of target gene, polymorphism, and drug concepts, respectively. CONCLUSION: Pharmspresso is a text analysis tool that extracts pharmacogenomic concepts from the literature automatically and thus captures our current understanding of gene-drug interactions in a computable form. We have made Pharmspresso available at . |
format | Text |
id | pubmed-2646239 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26462392009-02-23 Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text Garten, Yael Altman, Russ B BMC Bioinformatics Proceedings BACKGROUND: Pharmacogenomics studies the relationship between genetic variation and the variation in drug response phenotypes. The field is rapidly gaining importance: it promises drugs targeted to particular subpopulations based on genetic background. The pharmacogenomics literature has expanded rapidly, but is dispersed in many journals. It is challenging, therefore, to identify important associations between drugs and molecular entities – particularly genes and gene variants, and thus these critical connections are often lost. Text mining techniques can allow us to convert the free-style text to a computable, searchable format in which pharmacogenomic concepts (such as genes, drugs, polymorphisms, and diseases) are identified, and important links between these concepts are recorded. Availability of full text articles as input into text mining engines is key, as literature abstracts often do not contain sufficient information to identify these pharmacogenomic associations. RESULTS: Thus, building on a tool called Textpresso, we have created the Pharmspresso tool to assist in identifying important pharmacogenomic facts in full text articles. Pharmspresso parses text to find references to human genes, polymorphisms, drugs and diseases and their relationships. It presents these as a series of marked-up text fragments, in which key concepts are visually highlighted. To evaluate Pharmspresso, we used a gold standard of 45 human-curated articles. Pharmspresso identified 78%, 61%, and 74% of target gene, polymorphism, and drug concepts, respectively. CONCLUSION: Pharmspresso is a text analysis tool that extracts pharmacogenomic concepts from the literature automatically and thus captures our current understanding of gene-drug interactions in a computable form. We have made Pharmspresso available at . BioMed Central 2009-02-05 /pmc/articles/PMC2646239/ /pubmed/19208194 http://dx.doi.org/10.1186/1471-2105-10-S2-S6 Text en Copyright © 2009 Garten and Altman; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Garten, Yael Altman, Russ B Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text |
title | Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text |
title_full | Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text |
title_fullStr | Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text |
title_full_unstemmed | Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text |
title_short | Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text |
title_sort | pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2646239/ https://www.ncbi.nlm.nih.gov/pubmed/19208194 http://dx.doi.org/10.1186/1471-2105-10-S2-S6 |
work_keys_str_mv | AT gartenyael pharmspressoatextminingtoolforextractionofpharmacogenomicconceptsandrelationshipsfromfulltext AT altmanrussb pharmspressoatextminingtoolforextractionofpharmacogenomicconceptsandrelationshipsfromfulltext |