Cargando…

Learning relevance models for patient cohort retrieval

OBJECTIVE: We explored how judgements provided by physicians can be used to learn relevance models that enhance the quality of patient cohorts retrieved from Electronic Health Records (EHRs) collections. METHODS: A very large number of features were extracted from patient cohort descriptions as well...

Descripción completa

Detalles Bibliográficos
Autores principales:	Goodwin, Travis R, Harabagiu, Sanda M
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2018
Materias:	Research and Applications
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6241510/ https://www.ncbi.nlm.nih.gov/pubmed/30474078 http://dx.doi.org/10.1093/jamiaopen/ooy010

_version_	1783371775081447424
author	Goodwin, Travis R Harabagiu, Sanda M
author_facet	Goodwin, Travis R Harabagiu, Sanda M
author_sort	Goodwin, Travis R
collection	PubMed
description	OBJECTIVE: We explored how judgements provided by physicians can be used to learn relevance models that enhance the quality of patient cohorts retrieved from Electronic Health Records (EHRs) collections. METHODS: A very large number of features were extracted from patient cohort descriptions as well as EHR collections. The features were used to investigate retrieving (1) neurology-specific patient cohorts from the de-identified Temple University Hospital electroencephalography (EEG) Corpus as well as (2) the more general cohorts evaluated in the TREC Medical Records Track (TRECMed) from the de-identified hospital records provided by the University of Pittsburgh Medical Center. The features informed a learning relevance model (LRM) that took advantage of relevance judgements provided by physicians. The LRM implements a pairwise learning-to-rank framework, which enables our learning patient cohort retrieval (L-PCR) system to learn from physicians’ feedback. RESULTS AND DISCUSSION: We evaluated the L-PCR system against state-of-the-art traditional patient cohort retrieval systems, and observed a 27% improvement when operating on EEGs and a 53% improvement when operating on TRECMed EHRs, showing the promise of the L-PCR system. We also performed extensive feature analyses to reveal the most effective strategies for representing cohort descriptions as queries, encoding EHRs, and measuring cohort relevance. CONCLUSION: The L-PCR system has significant promise for reliably retrieving patient cohorts from EHRs in multiple settings when trained with relevance judgments. When provided with additional cohort descriptions, the L-PCR system will continue to learn, thus offering a potential solution to the performance barriers of current cohort retrieval systems.
format	Online Article Text
id	pubmed-6241510
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-62415102018-11-23 Learning relevance models for patient cohort retrieval Goodwin, Travis R Harabagiu, Sanda M JAMIA Open Research and Applications OBJECTIVE: We explored how judgements provided by physicians can be used to learn relevance models that enhance the quality of patient cohorts retrieved from Electronic Health Records (EHRs) collections. METHODS: A very large number of features were extracted from patient cohort descriptions as well as EHR collections. The features were used to investigate retrieving (1) neurology-specific patient cohorts from the de-identified Temple University Hospital electroencephalography (EEG) Corpus as well as (2) the more general cohorts evaluated in the TREC Medical Records Track (TRECMed) from the de-identified hospital records provided by the University of Pittsburgh Medical Center. The features informed a learning relevance model (LRM) that took advantage of relevance judgements provided by physicians. The LRM implements a pairwise learning-to-rank framework, which enables our learning patient cohort retrieval (L-PCR) system to learn from physicians’ feedback. RESULTS AND DISCUSSION: We evaluated the L-PCR system against state-of-the-art traditional patient cohort retrieval systems, and observed a 27% improvement when operating on EEGs and a 53% improvement when operating on TRECMed EHRs, showing the promise of the L-PCR system. We also performed extensive feature analyses to reveal the most effective strategies for representing cohort descriptions as queries, encoding EHRs, and measuring cohort relevance. CONCLUSION: The L-PCR system has significant promise for reliably retrieving patient cohorts from EHRs in multiple settings when trained with relevance judgments. When provided with additional cohort descriptions, the L-PCR system will continue to learn, thus offering a potential solution to the performance barriers of current cohort retrieval systems. Oxford University Press 2018-09-28 /pmc/articles/PMC6241510/ /pubmed/30474078 http://dx.doi.org/10.1093/jamiaopen/ooy010 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Research and Applications Goodwin, Travis R Harabagiu, Sanda M Learning relevance models for patient cohort retrieval
title	Learning relevance models for patient cohort retrieval
title_full	Learning relevance models for patient cohort retrieval
title_fullStr	Learning relevance models for patient cohort retrieval
title_full_unstemmed	Learning relevance models for patient cohort retrieval
title_short	Learning relevance models for patient cohort retrieval
title_sort	learning relevance models for patient cohort retrieval
topic	Research and Applications
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6241510/ https://www.ncbi.nlm.nih.gov/pubmed/30474078 http://dx.doi.org/10.1093/jamiaopen/ooy010
work_keys_str_mv	AT goodwintravisr learningrelevancemodelsforpatientcohortretrieval AT harabagiusandam learningrelevancemodelsforpatientcohortretrieval

Learning relevance models for patient cohort retrieval

Ejemplares similares