Cargando…

Artificial Intelligence Clinical Evidence Engine for Automatic Identification, Prioritization, and Extraction of Relevant Clinical Oncology Research

We developed a system to automate analysis of the clinical oncology scientific literature from bibliographic databases and match articles to specific patient cohorts to answer specific questions regarding the efficacy of a treatment. The approach attempts to replicate a clinician’s mental processes...

Descripción completa

Detalles Bibliográficos
Autores principales: Saiz, Fernando Suarez, Sanders, Corey, Stevens, Rick, Nielsen, Robert, Britt, Michael, Yuravlivker, Leemor, Preininger, Anita M., Jackson, Gretchen P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society of Clinical Oncology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8140792/
https://www.ncbi.nlm.nih.gov/pubmed/33439724
http://dx.doi.org/10.1200/CCI.20.00087
_version_ 1783696245730050048
author Saiz, Fernando Suarez
Sanders, Corey
Stevens, Rick
Nielsen, Robert
Britt, Michael
Yuravlivker, Leemor
Preininger, Anita M.
Jackson, Gretchen P.
author_facet Saiz, Fernando Suarez
Sanders, Corey
Stevens, Rick
Nielsen, Robert
Britt, Michael
Yuravlivker, Leemor
Preininger, Anita M.
Jackson, Gretchen P.
author_sort Saiz, Fernando Suarez
collection PubMed
description We developed a system to automate analysis of the clinical oncology scientific literature from bibliographic databases and match articles to specific patient cohorts to answer specific questions regarding the efficacy of a treatment. The approach attempts to replicate a clinician’s mental processes when reviewing published literature in the context of a patient case. We describe the system and evaluate its performance. METHODS: We developed separate ground truth data sets for each of the tasks described in the paper. The first ground truth was used to measure the natural language processing (NLP) accuracy from approximately 1,300 papers covering approximately 3,100 statements and approximately 25 concepts; performance was evaluated using a standard F1 score. The ground truth for the expert classifier model was generated by dividing papers cited in clinical guidelines into a training set and a test set in an 80:20 ratio, and performance was evaluated for accuracy, sensitivity, and specificity. RESULTS: The NLP models were able to identify individual attributes with a 0.7-0.9 F1 score, depending on the attribute of interest. The expert classifier machine learning model was able to classify the individual records with a 0.93 accuracy (95% CI, 0.9 to 0.96, P < .0001), and sensitivity and specificity of 0.95 and 0.91, respectively. Using a decision boundary of 0.5 for the positive (expert) label, the classifier demonstrated an F1 score of 0.92. CONCLUSION: The system identified and extracted evidence from the oncology literature with a high degree of accuracy, sensitivity, and specificity. This tool enables timely access to the most relevant biomedical literature, providing critical support to evidence-based practice in areas of rapidly evolving science.
format Online
Article
Text
id pubmed-8140792
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society of Clinical Oncology
record_format MEDLINE/PubMed
spelling pubmed-81407922022-01-13 Artificial Intelligence Clinical Evidence Engine for Automatic Identification, Prioritization, and Extraction of Relevant Clinical Oncology Research Saiz, Fernando Suarez Sanders, Corey Stevens, Rick Nielsen, Robert Britt, Michael Yuravlivker, Leemor Preininger, Anita M. Jackson, Gretchen P. JCO Clin Cancer Inform ORIGINAL REPORTS We developed a system to automate analysis of the clinical oncology scientific literature from bibliographic databases and match articles to specific patient cohorts to answer specific questions regarding the efficacy of a treatment. The approach attempts to replicate a clinician’s mental processes when reviewing published literature in the context of a patient case. We describe the system and evaluate its performance. METHODS: We developed separate ground truth data sets for each of the tasks described in the paper. The first ground truth was used to measure the natural language processing (NLP) accuracy from approximately 1,300 papers covering approximately 3,100 statements and approximately 25 concepts; performance was evaluated using a standard F1 score. The ground truth for the expert classifier model was generated by dividing papers cited in clinical guidelines into a training set and a test set in an 80:20 ratio, and performance was evaluated for accuracy, sensitivity, and specificity. RESULTS: The NLP models were able to identify individual attributes with a 0.7-0.9 F1 score, depending on the attribute of interest. The expert classifier machine learning model was able to classify the individual records with a 0.93 accuracy (95% CI, 0.9 to 0.96, P < .0001), and sensitivity and specificity of 0.95 and 0.91, respectively. Using a decision boundary of 0.5 for the positive (expert) label, the classifier demonstrated an F1 score of 0.92. CONCLUSION: The system identified and extracted evidence from the oncology literature with a high degree of accuracy, sensitivity, and specificity. This tool enables timely access to the most relevant biomedical literature, providing critical support to evidence-based practice in areas of rapidly evolving science. American Society of Clinical Oncology 2021-01-13 /pmc/articles/PMC8140792/ /pubmed/33439724 http://dx.doi.org/10.1200/CCI.20.00087 Text en © 2021 by American Society of Clinical Oncology https://creativecommons.org/licenses/by-nc-nd/4.0/Creative Commons Attribution Non-Commercial No Derivatives 4.0 License: https://creativecommons.org/licenses/by-nc-nd/4.0/
spellingShingle ORIGINAL REPORTS
Saiz, Fernando Suarez
Sanders, Corey
Stevens, Rick
Nielsen, Robert
Britt, Michael
Yuravlivker, Leemor
Preininger, Anita M.
Jackson, Gretchen P.
Artificial Intelligence Clinical Evidence Engine for Automatic Identification, Prioritization, and Extraction of Relevant Clinical Oncology Research
title Artificial Intelligence Clinical Evidence Engine for Automatic Identification, Prioritization, and Extraction of Relevant Clinical Oncology Research
title_full Artificial Intelligence Clinical Evidence Engine for Automatic Identification, Prioritization, and Extraction of Relevant Clinical Oncology Research
title_fullStr Artificial Intelligence Clinical Evidence Engine for Automatic Identification, Prioritization, and Extraction of Relevant Clinical Oncology Research
title_full_unstemmed Artificial Intelligence Clinical Evidence Engine for Automatic Identification, Prioritization, and Extraction of Relevant Clinical Oncology Research
title_short Artificial Intelligence Clinical Evidence Engine for Automatic Identification, Prioritization, and Extraction of Relevant Clinical Oncology Research
title_sort artificial intelligence clinical evidence engine for automatic identification, prioritization, and extraction of relevant clinical oncology research
topic ORIGINAL REPORTS
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8140792/
https://www.ncbi.nlm.nih.gov/pubmed/33439724
http://dx.doi.org/10.1200/CCI.20.00087
work_keys_str_mv AT saizfernandosuarez artificialintelligenceclinicalevidenceengineforautomaticidentificationprioritizationandextractionofrelevantclinicaloncologyresearch
AT sanderscorey artificialintelligenceclinicalevidenceengineforautomaticidentificationprioritizationandextractionofrelevantclinicaloncologyresearch
AT stevensrick artificialintelligenceclinicalevidenceengineforautomaticidentificationprioritizationandextractionofrelevantclinicaloncologyresearch
AT nielsenrobert artificialintelligenceclinicalevidenceengineforautomaticidentificationprioritizationandextractionofrelevantclinicaloncologyresearch
AT brittmichael artificialintelligenceclinicalevidenceengineforautomaticidentificationprioritizationandextractionofrelevantclinicaloncologyresearch
AT yuravlivkerleemor artificialintelligenceclinicalevidenceengineforautomaticidentificationprioritizationandextractionofrelevantclinicaloncologyresearch
AT preiningeranitam artificialintelligenceclinicalevidenceengineforautomaticidentificationprioritizationandextractionofrelevantclinicaloncologyresearch
AT jacksongretchenp artificialintelligenceclinicalevidenceengineforautomaticidentificationprioritizationandextractionofrelevantclinicaloncologyresearch