Cargando…

PGxMine: Text mining for curation of PharmGKB

Precision medicine tailors treatment to individuals personal data including differences in their genome. The Pharmacogenomics Knowledgebase (PharmGKB) provides highly curated information on the effect of genetic variation on drug response and side effects for a wide range of drugs. PharmGKB’s scient...

Descripción completa

Detalles Bibliográficos
Autores principales: Lever, Jake, Barbarino, Julia M., Gong, Li, Huddart, Rachel, Sangkuhl, Katrin, Whaley, Ryan, Whirl-Carrillo, Michelle, Woon, Mark, Klein, Teri E., Altman, Russ B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6917032/
https://www.ncbi.nlm.nih.gov/pubmed/31797632
_version_ 1783480340461912064
author Lever, Jake
Barbarino, Julia M.
Gong, Li
Huddart, Rachel
Sangkuhl, Katrin
Whaley, Ryan
Whirl-Carrillo, Michelle
Woon, Mark
Klein, Teri E.
Altman, Russ B.
author_facet Lever, Jake
Barbarino, Julia M.
Gong, Li
Huddart, Rachel
Sangkuhl, Katrin
Whaley, Ryan
Whirl-Carrillo, Michelle
Woon, Mark
Klein, Teri E.
Altman, Russ B.
author_sort Lever, Jake
collection PubMed
description Precision medicine tailors treatment to individuals personal data including differences in their genome. The Pharmacogenomics Knowledgebase (PharmGKB) provides highly curated information on the effect of genetic variation on drug response and side effects for a wide range of drugs. PharmGKB’s scientific curators triage, review and annotate a large number of papers each year but the task is challenging. We present the PGxMine resource, a text-mined resource of pharmacogenomic associations from all accessible published literature to assist in the curation of PharmGKB. We developed a supervised machine learning pipeline to extract associations between a variant (DNA and protein changes, star alleles and dbSNP identifiers) and a chemical. PGxMine covers 452 chemicals and 2,426 variants and contains 19,930 mentions of pharmacogenomic associations across 7,170 papers. An evaluation by PharmGKB curators found that 57 of the top 100 associations not found in PharmGKB led to 83 curatable papers and a further 24 associations would likely lead to curatable papers through citations. The results can be viewed at https://pgxmine.pharmgkb.org/ and code can be downloaded at https://github.com/jakelever/pgxmine.
format Online
Article
Text
id pubmed-6917032
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-69170322020-01-01 PGxMine: Text mining for curation of PharmGKB Lever, Jake Barbarino, Julia M. Gong, Li Huddart, Rachel Sangkuhl, Katrin Whaley, Ryan Whirl-Carrillo, Michelle Woon, Mark Klein, Teri E. Altman, Russ B. Pac Symp Biocomput Article Precision medicine tailors treatment to individuals personal data including differences in their genome. The Pharmacogenomics Knowledgebase (PharmGKB) provides highly curated information on the effect of genetic variation on drug response and side effects for a wide range of drugs. PharmGKB’s scientific curators triage, review and annotate a large number of papers each year but the task is challenging. We present the PGxMine resource, a text-mined resource of pharmacogenomic associations from all accessible published literature to assist in the curation of PharmGKB. We developed a supervised machine learning pipeline to extract associations between a variant (DNA and protein changes, star alleles and dbSNP identifiers) and a chemical. PGxMine covers 452 chemicals and 2,426 variants and contains 19,930 mentions of pharmacogenomic associations across 7,170 papers. An evaluation by PharmGKB curators found that 57 of the top 100 associations not found in PharmGKB led to 83 curatable papers and a further 24 associations would likely lead to curatable papers through citations. The results can be viewed at https://pgxmine.pharmgkb.org/ and code can be downloaded at https://github.com/jakelever/pgxmine. 2020 /pmc/articles/PMC6917032/ /pubmed/31797632 Text en http://creativecommons.org/licenses/by/4.0/ Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License.
spellingShingle Article
Lever, Jake
Barbarino, Julia M.
Gong, Li
Huddart, Rachel
Sangkuhl, Katrin
Whaley, Ryan
Whirl-Carrillo, Michelle
Woon, Mark
Klein, Teri E.
Altman, Russ B.
PGxMine: Text mining for curation of PharmGKB
title PGxMine: Text mining for curation of PharmGKB
title_full PGxMine: Text mining for curation of PharmGKB
title_fullStr PGxMine: Text mining for curation of PharmGKB
title_full_unstemmed PGxMine: Text mining for curation of PharmGKB
title_short PGxMine: Text mining for curation of PharmGKB
title_sort pgxmine: text mining for curation of pharmgkb
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6917032/
https://www.ncbi.nlm.nih.gov/pubmed/31797632
work_keys_str_mv AT leverjake pgxminetextminingforcurationofpharmgkb
AT barbarinojuliam pgxminetextminingforcurationofpharmgkb
AT gongli pgxminetextminingforcurationofpharmgkb
AT huddartrachel pgxminetextminingforcurationofpharmgkb
AT sangkuhlkatrin pgxminetextminingforcurationofpharmgkb
AT whaleyryan pgxminetextminingforcurationofpharmgkb
AT whirlcarrillomichelle pgxminetextminingforcurationofpharmgkb
AT woonmark pgxminetextminingforcurationofpharmgkb
AT kleinterie pgxminetextminingforcurationofpharmgkb
AT altmanrussb pgxminetextminingforcurationofpharmgkb