Cargando…

Translational drug–interaction corpus

The discovery of drug–drug interactions (DDIs) that have a translational impact among in vitro pharmacokinetics (PK), in vivo PK and clinical outcomes depends largely on the quality of the annotated corpus available for text mining. We have developed a new DDI corpus based on an annotation scheme th...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Shijun, Wu, Hengyi, Wang, Lei, Zhang, Gongbo, Rocha, Luis M, Shatkay, Hagit, Li, Lang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9216474/
https://www.ncbi.nlm.nih.gov/pubmed/35616099
http://dx.doi.org/10.1093/database/baac031
_version_ 1784731430044041216
author Zhang, Shijun
Wu, Hengyi
Wang, Lei
Zhang, Gongbo
Rocha, Luis M
Shatkay, Hagit
Li, Lang
author_facet Zhang, Shijun
Wu, Hengyi
Wang, Lei
Zhang, Gongbo
Rocha, Luis M
Shatkay, Hagit
Li, Lang
author_sort Zhang, Shijun
collection PubMed
description The discovery of drug–drug interactions (DDIs) that have a translational impact among in vitro pharmacokinetics (PK), in vivo PK and clinical outcomes depends largely on the quality of the annotated corpus available for text mining. We have developed a new DDI corpus based on an annotation scheme that builds upon and extends previous ones, where an abstract is fragmented and each fragment is then annotated along eight dimensions, namely, focus, polarity, certainty, evidence, directionality, study type, interaction type and mechanism. The guideline for defining these dimensions has undergone refinement during the annotation process. Our DDI corpus comprises 900 positive DDI abstracts and 750 that are not directly relevant to DDI. The abstracts in corpus are separated into eight categories of DDI or non-DDI evidence: DDI with pharmacokinetic (PK) mechanism, in vivo DDI PK, DDI clinical, drug–nutrition interaction, single drug, not drug related, in vitro pharmacodynamic (PD) and case report. Seven annotators, three annotators with drug–interaction research experience and four annotators with less drug–interaction research experience independently annotated the DDI corpus, where two researchers independently annotated each abstract. After two rounds of annotations with additional training in between, agreement improved from (0.79, 0.96, 0.86, 0.70, 0.91, 0.65, 0.78, 0.90) to (0.93, 0.99, 0.96, 0.94, 0.95, 0.93, 0.96, 0.97) for focus, certainty, evidence, study type, interaction type, mechanisms, polarity and direction, respectively. The novice-level annotators improved from 0.83 to 0.96, while the expert-level annotators stayed in high performance with some improvement, from 0.90 to 0.96. In summary, we achieved 96% agreement among each pair of annotators with regard to the eight dimensions. The annotated corpus is now available to the community for inclusion in their text-mining pipelines. Database URL https://github.com/zha204/DDI-Corpus-Database/tree/master/DDI%20corpus
format Online
Article
Text
id pubmed-9216474
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92164742022-06-23 Translational drug–interaction corpus Zhang, Shijun Wu, Hengyi Wang, Lei Zhang, Gongbo Rocha, Luis M Shatkay, Hagit Li, Lang Database (Oxford) Original Article The discovery of drug–drug interactions (DDIs) that have a translational impact among in vitro pharmacokinetics (PK), in vivo PK and clinical outcomes depends largely on the quality of the annotated corpus available for text mining. We have developed a new DDI corpus based on an annotation scheme that builds upon and extends previous ones, where an abstract is fragmented and each fragment is then annotated along eight dimensions, namely, focus, polarity, certainty, evidence, directionality, study type, interaction type and mechanism. The guideline for defining these dimensions has undergone refinement during the annotation process. Our DDI corpus comprises 900 positive DDI abstracts and 750 that are not directly relevant to DDI. The abstracts in corpus are separated into eight categories of DDI or non-DDI evidence: DDI with pharmacokinetic (PK) mechanism, in vivo DDI PK, DDI clinical, drug–nutrition interaction, single drug, not drug related, in vitro pharmacodynamic (PD) and case report. Seven annotators, three annotators with drug–interaction research experience and four annotators with less drug–interaction research experience independently annotated the DDI corpus, where two researchers independently annotated each abstract. After two rounds of annotations with additional training in between, agreement improved from (0.79, 0.96, 0.86, 0.70, 0.91, 0.65, 0.78, 0.90) to (0.93, 0.99, 0.96, 0.94, 0.95, 0.93, 0.96, 0.97) for focus, certainty, evidence, study type, interaction type, mechanisms, polarity and direction, respectively. The novice-level annotators improved from 0.83 to 0.96, while the expert-level annotators stayed in high performance with some improvement, from 0.90 to 0.96. In summary, we achieved 96% agreement among each pair of annotators with regard to the eight dimensions. The annotated corpus is now available to the community for inclusion in their text-mining pipelines. Database URL https://github.com/zha204/DDI-Corpus-Database/tree/master/DDI%20corpus Oxford University Press 2022-05-25 /pmc/articles/PMC9216474/ /pubmed/35616099 http://dx.doi.org/10.1093/database/baac031 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Article
Zhang, Shijun
Wu, Hengyi
Wang, Lei
Zhang, Gongbo
Rocha, Luis M
Shatkay, Hagit
Li, Lang
Translational drug–interaction corpus
title Translational drug–interaction corpus
title_full Translational drug–interaction corpus
title_fullStr Translational drug–interaction corpus
title_full_unstemmed Translational drug–interaction corpus
title_short Translational drug–interaction corpus
title_sort translational drug–interaction corpus
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9216474/
https://www.ncbi.nlm.nih.gov/pubmed/35616099
http://dx.doi.org/10.1093/database/baac031
work_keys_str_mv AT zhangshijun translationaldruginteractioncorpus
AT wuhengyi translationaldruginteractioncorpus
AT wanglei translationaldruginteractioncorpus
AT zhanggongbo translationaldruginteractioncorpus
AT rochaluism translationaldruginteractioncorpus
AT shatkayhagit translationaldruginteractioncorpus
AT lilang translationaldruginteractioncorpus