Cargando…

A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data

BACKGROUND: We consider the user task of designing clinical trial protocols and propose a method that discovers and outputs the most appropriate eligibility criteria from a potentially huge set of candidates. Each document d in our collection D is a clinical trial protocol which itself contains a se...

Descripción completa

Detalles Bibliográficos
Autores principales: Restificar, Angelo, Korkontzelos, Ioannis, Ananiadou, Sophia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3618207/
https://www.ncbi.nlm.nih.gov/pubmed/23566239
http://dx.doi.org/10.1186/1472-6947-13-S1-S6
_version_ 1782265376888848384
author Restificar, Angelo
Korkontzelos, Ioannis
Ananiadou, Sophia
author_facet Restificar, Angelo
Korkontzelos, Ioannis
Ananiadou, Sophia
author_sort Restificar, Angelo
collection PubMed
description BACKGROUND: We consider the user task of designing clinical trial protocols and propose a method that discovers and outputs the most appropriate eligibility criteria from a potentially huge set of candidates. Each document d in our collection D is a clinical trial protocol which itself contains a set of eligibility criteria. Given a small set of sample documents [Formula: see text] , a user has initially identified as relevant e.g., via a user query interface, our scoring method automatically suggests eligibility criteria from D, D ⊃ D', by ranking them according to how appropriate they are to the clinical trial protocol currently being designed. The appropriateness is measured by the degree to which they are consistent with the user-supplied sample documents D'. METHOD: We propose a novel three-step method called LDALR which views documents as a mixture of latent topics. First, we infer the latent topics in the sample documents using Latent Dirichlet Allocation (LDA). Next, we use logistic regression models to compute the probability that a given candidate criterion belongs to a particular topic. Lastly, we score each criterion by computing its expected value, the probability-weighted sum of the topic proportions inferred from the set of sample documents. Intuitively, the greater the probability that a candidate criterion belongs to the topics that are dominant in the samples, the higher its expected value or score. RESULTS: Our experiments have shown that LDALR is 8 and 9 times better (resp., for inclusion and exclusion criteria) than randomly choosing from a set of candidates obtained from relevant documents. In user simulation experiments using LDALR, we were able to automatically construct eligibility criteria that are on the average 75% and 70% (resp., for inclusion and exclusion criteria) similar to the correct eligibility criteria. CONCLUSIONS: We have proposed LDALR, a practical method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data. Results from our experiments suggest that LDALR models can be used to effectively find appropriate eligibility criteria from a large repository of clinical trial protocols.
format Online
Article
Text
id pubmed-3618207
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36182072013-04-10 A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data Restificar, Angelo Korkontzelos, Ioannis Ananiadou, Sophia BMC Med Inform Decis Mak Proceedings BACKGROUND: We consider the user task of designing clinical trial protocols and propose a method that discovers and outputs the most appropriate eligibility criteria from a potentially huge set of candidates. Each document d in our collection D is a clinical trial protocol which itself contains a set of eligibility criteria. Given a small set of sample documents [Formula: see text] , a user has initially identified as relevant e.g., via a user query interface, our scoring method automatically suggests eligibility criteria from D, D ⊃ D', by ranking them according to how appropriate they are to the clinical trial protocol currently being designed. The appropriateness is measured by the degree to which they are consistent with the user-supplied sample documents D'. METHOD: We propose a novel three-step method called LDALR which views documents as a mixture of latent topics. First, we infer the latent topics in the sample documents using Latent Dirichlet Allocation (LDA). Next, we use logistic regression models to compute the probability that a given candidate criterion belongs to a particular topic. Lastly, we score each criterion by computing its expected value, the probability-weighted sum of the topic proportions inferred from the set of sample documents. Intuitively, the greater the probability that a candidate criterion belongs to the topics that are dominant in the samples, the higher its expected value or score. RESULTS: Our experiments have shown that LDALR is 8 and 9 times better (resp., for inclusion and exclusion criteria) than randomly choosing from a set of candidates obtained from relevant documents. In user simulation experiments using LDALR, we were able to automatically construct eligibility criteria that are on the average 75% and 70% (resp., for inclusion and exclusion criteria) similar to the correct eligibility criteria. CONCLUSIONS: We have proposed LDALR, a practical method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data. Results from our experiments suggest that LDALR models can be used to effectively find appropriate eligibility criteria from a large repository of clinical trial protocols. BioMed Central 2013-04-05 /pmc/articles/PMC3618207/ /pubmed/23566239 http://dx.doi.org/10.1186/1472-6947-13-S1-S6 Text en Copyright © 2013 Restificar et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Restificar, Angelo
Korkontzelos, Ioannis
Ananiadou, Sophia
A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data
title A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data
title_full A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data
title_fullStr A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data
title_full_unstemmed A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data
title_short A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data
title_sort method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3618207/
https://www.ncbi.nlm.nih.gov/pubmed/23566239
http://dx.doi.org/10.1186/1472-6947-13-S1-S6
work_keys_str_mv AT restificarangelo amethodfordiscoveringandinferringappropriateeligibilitycriteriainclinicaltrialprotocolswithoutlabeleddata
AT korkontzelosioannis amethodfordiscoveringandinferringappropriateeligibilitycriteriainclinicaltrialprotocolswithoutlabeleddata
AT ananiadousophia amethodfordiscoveringandinferringappropriateeligibilitycriteriainclinicaltrialprotocolswithoutlabeleddata
AT restificarangelo methodfordiscoveringandinferringappropriateeligibilitycriteriainclinicaltrialprotocolswithoutlabeleddata
AT korkontzelosioannis methodfordiscoveringandinferringappropriateeligibilitycriteriainclinicaltrialprotocolswithoutlabeleddata
AT ananiadousophia methodfordiscoveringandinferringappropriateeligibilitycriteriainclinicaltrialprotocolswithoutlabeleddata