Cargando…

A Novel Metric to Quantify the Effect of Pathway Enrichment Evaluation With Respect to Biomedical Text-Mined Terms: Development and Feasibility Study

BACKGROUND: Natural language processing has long been applied in various applications for biomedical knowledge inference and discovery. Enrichment analysis based on named entity recognition is a classic application for inferring enriched associations in terms of specific biomedical entities such as...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Xuan, Yao, Xinzhi, Xia, Jingbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8277388/
https://www.ncbi.nlm.nih.gov/pubmed/34142969
http://dx.doi.org/10.2196/28247
_version_ 1783722063993765888
author Qin, Xuan
Yao, Xinzhi
Xia, Jingbo
author_facet Qin, Xuan
Yao, Xinzhi
Xia, Jingbo
author_sort Qin, Xuan
collection PubMed
description BACKGROUND: Natural language processing has long been applied in various applications for biomedical knowledge inference and discovery. Enrichment analysis based on named entity recognition is a classic application for inferring enriched associations in terms of specific biomedical entities such as gene, chemical, and mutation. OBJECTIVE: The aim of this study was to investigate the effect of pathway enrichment evaluation with respect to biomedical text-mining results and to develop a novel metric to quantify the effect. METHODS: Four biomedical text mining methods were selected to represent natural language processing methods on drug-related gene mining. Subsequently, a pathway enrichment experiment was performed by using the mined genes, and a series of inverse pathway frequency (IPF) metrics was proposed accordingly to evaluate the effect of pathway enrichment. Thereafter, 7 IPF metrics and traditional P value metrics were compared in simulation experiments to test the robustness of the proposed metrics. RESULTS: IPF metrics were evaluated in a case study of rapamycin-related gene set. By applying the best IPF metrics in a pathway enrichment simulation test, a novel discovery of drug efficacy of rapamycin for breast cancer was replicated from the data chosen prior to the year 2000. Our findings show the effectiveness of the best IPF metric in support of knowledge discovery in new drug use. Further, the mechanism underlying the drug-disease association was visualized by Cytoscape. CONCLUSIONS: The results of this study suggest the effectiveness of the proposed IPF metrics in pathway enrichment evaluation as well as its application in drug use discovery.
format Online
Article
Text
id pubmed-8277388
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-82773882021-07-26 A Novel Metric to Quantify the Effect of Pathway Enrichment Evaluation With Respect to Biomedical Text-Mined Terms: Development and Feasibility Study Qin, Xuan Yao, Xinzhi Xia, Jingbo JMIR Med Inform Original Paper BACKGROUND: Natural language processing has long been applied in various applications for biomedical knowledge inference and discovery. Enrichment analysis based on named entity recognition is a classic application for inferring enriched associations in terms of specific biomedical entities such as gene, chemical, and mutation. OBJECTIVE: The aim of this study was to investigate the effect of pathway enrichment evaluation with respect to biomedical text-mining results and to develop a novel metric to quantify the effect. METHODS: Four biomedical text mining methods were selected to represent natural language processing methods on drug-related gene mining. Subsequently, a pathway enrichment experiment was performed by using the mined genes, and a series of inverse pathway frequency (IPF) metrics was proposed accordingly to evaluate the effect of pathway enrichment. Thereafter, 7 IPF metrics and traditional P value metrics were compared in simulation experiments to test the robustness of the proposed metrics. RESULTS: IPF metrics were evaluated in a case study of rapamycin-related gene set. By applying the best IPF metrics in a pathway enrichment simulation test, a novel discovery of drug efficacy of rapamycin for breast cancer was replicated from the data chosen prior to the year 2000. Our findings show the effectiveness of the best IPF metric in support of knowledge discovery in new drug use. Further, the mechanism underlying the drug-disease association was visualized by Cytoscape. CONCLUSIONS: The results of this study suggest the effectiveness of the proposed IPF metrics in pathway enrichment evaluation as well as its application in drug use discovery. JMIR Publications 2021-06-18 /pmc/articles/PMC8277388/ /pubmed/34142969 http://dx.doi.org/10.2196/28247 Text en ©Xuan Qin, Xinzhi Yao, Jingbo Xia. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 18.06.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Qin, Xuan
Yao, Xinzhi
Xia, Jingbo
A Novel Metric to Quantify the Effect of Pathway Enrichment Evaluation With Respect to Biomedical Text-Mined Terms: Development and Feasibility Study
title A Novel Metric to Quantify the Effect of Pathway Enrichment Evaluation With Respect to Biomedical Text-Mined Terms: Development and Feasibility Study
title_full A Novel Metric to Quantify the Effect of Pathway Enrichment Evaluation With Respect to Biomedical Text-Mined Terms: Development and Feasibility Study
title_fullStr A Novel Metric to Quantify the Effect of Pathway Enrichment Evaluation With Respect to Biomedical Text-Mined Terms: Development and Feasibility Study
title_full_unstemmed A Novel Metric to Quantify the Effect of Pathway Enrichment Evaluation With Respect to Biomedical Text-Mined Terms: Development and Feasibility Study
title_short A Novel Metric to Quantify the Effect of Pathway Enrichment Evaluation With Respect to Biomedical Text-Mined Terms: Development and Feasibility Study
title_sort novel metric to quantify the effect of pathway enrichment evaluation with respect to biomedical text-mined terms: development and feasibility study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8277388/
https://www.ncbi.nlm.nih.gov/pubmed/34142969
http://dx.doi.org/10.2196/28247
work_keys_str_mv AT qinxuan anovelmetrictoquantifytheeffectofpathwayenrichmentevaluationwithrespecttobiomedicaltextminedtermsdevelopmentandfeasibilitystudy
AT yaoxinzhi anovelmetrictoquantifytheeffectofpathwayenrichmentevaluationwithrespecttobiomedicaltextminedtermsdevelopmentandfeasibilitystudy
AT xiajingbo anovelmetrictoquantifytheeffectofpathwayenrichmentevaluationwithrespecttobiomedicaltextminedtermsdevelopmentandfeasibilitystudy
AT qinxuan novelmetrictoquantifytheeffectofpathwayenrichmentevaluationwithrespecttobiomedicaltextminedtermsdevelopmentandfeasibilitystudy
AT yaoxinzhi novelmetrictoquantifytheeffectofpathwayenrichmentevaluationwithrespecttobiomedicaltextminedtermsdevelopmentandfeasibilitystudy
AT xiajingbo novelmetrictoquantifytheeffectofpathwayenrichmentevaluationwithrespecttobiomedicaltextminedtermsdevelopmentandfeasibilitystudy