Cargando…

SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research*

OBJECTIVE: Unlocking the data contained within both structured and unstructured components of electronic health records (EHRs) has the potential to provide a step change in data available for secondary research use, generation of actionable medical insights, hospital management, and trial recruitmen...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Honghan, Toti, Giulia, Morley, Katherine I, Ibrahim, Zina M, Folarin, Amos, Jackson, Richard, Kartoglu, Ismail, Agrawal, Asha, Stringer, Clive, Gale, Darren, Gorrell, Genevieve, Roberts, Angus, Broadbent, Matthew, Stewart, Robert, Dobson, Richard JB
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6019046/
https://www.ncbi.nlm.nih.gov/pubmed/29361077
http://dx.doi.org/10.1093/jamia/ocx160
_version_ 1783335066040008704
author Wu, Honghan
Toti, Giulia
Morley, Katherine I
Ibrahim, Zina M
Folarin, Amos
Jackson, Richard
Kartoglu, Ismail
Agrawal, Asha
Stringer, Clive
Gale, Darren
Gorrell, Genevieve
Roberts, Angus
Broadbent, Matthew
Stewart, Robert
Dobson, Richard JB
author_facet Wu, Honghan
Toti, Giulia
Morley, Katherine I
Ibrahim, Zina M
Folarin, Amos
Jackson, Richard
Kartoglu, Ismail
Agrawal, Asha
Stringer, Clive
Gale, Darren
Gorrell, Genevieve
Roberts, Angus
Broadbent, Matthew
Stewart, Robert
Dobson, Richard JB
author_sort Wu, Honghan
collection PubMed
description OBJECTIVE: Unlocking the data contained within both structured and unstructured components of electronic health records (EHRs) has the potential to provide a step change in data available for secondary research use, generation of actionable medical insights, hospital management, and trial recruitment. To achieve this, we implemented SemEHR, an open source semantic search and analytics tool for EHRs. METHODS: SemEHR implements a generic information extraction (IE) and retrieval infrastructure by identifying contextualized mentions of a wide range of biomedical concepts within EHRs. Natural language processing annotations are further assembled at the patient level and extended with EHR-specific knowledge to generate a timeline for each patient. The semantic data are serviced via ontology-based search and analytics interfaces. RESULTS: SemEHR has been deployed at a number of UK hospitals, including the Clinical Record Interactive Search, an anonymized replica of the EHR of the UK South London and Maudsley National Health Service Foundation Trust, one of Europe’s largest providers of mental health services. In 2 Clinical Record Interactive Search–based studies, SemEHR achieved 93% (hepatitis C) and 99% (HIV) F-measure results in identifying true positive patients. At King’s College Hospital in London, as part of the CogStack program (github.com/cogstack), SemEHR is being used to recruit patients into the UK Department of Health 100 000 Genomes Project (genomicsengland.co.uk). The validation study suggests that the tool can validate previously recruited cases and is very fast at searching phenotypes; time for recruitment criteria checking was reduced from days to minutes. Validated on open intensive care EHR data, Medical Information Mart for Intensive Care III, the vital signs extracted by SemEHR can achieve around 97% accuracy. CONCLUSION: Results from the multiple case studies demonstrate SemEHR’s efficiency: weeks or months of work can be done within hours or minutes in some cases. SemEHR provides a more comprehensive view of patients, bringing in more and unexpected insight compared to study-oriented bespoke IE systems. SemEHR is open source, available at https://github.com/CogStack/SemEHR.
format Online
Article
Text
id pubmed-6019046
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60190462018-07-05 SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research* Wu, Honghan Toti, Giulia Morley, Katherine I Ibrahim, Zina M Folarin, Amos Jackson, Richard Kartoglu, Ismail Agrawal, Asha Stringer, Clive Gale, Darren Gorrell, Genevieve Roberts, Angus Broadbent, Matthew Stewart, Robert Dobson, Richard JB J Am Med Inform Assoc Research and Applications OBJECTIVE: Unlocking the data contained within both structured and unstructured components of electronic health records (EHRs) has the potential to provide a step change in data available for secondary research use, generation of actionable medical insights, hospital management, and trial recruitment. To achieve this, we implemented SemEHR, an open source semantic search and analytics tool for EHRs. METHODS: SemEHR implements a generic information extraction (IE) and retrieval infrastructure by identifying contextualized mentions of a wide range of biomedical concepts within EHRs. Natural language processing annotations are further assembled at the patient level and extended with EHR-specific knowledge to generate a timeline for each patient. The semantic data are serviced via ontology-based search and analytics interfaces. RESULTS: SemEHR has been deployed at a number of UK hospitals, including the Clinical Record Interactive Search, an anonymized replica of the EHR of the UK South London and Maudsley National Health Service Foundation Trust, one of Europe’s largest providers of mental health services. In 2 Clinical Record Interactive Search–based studies, SemEHR achieved 93% (hepatitis C) and 99% (HIV) F-measure results in identifying true positive patients. At King’s College Hospital in London, as part of the CogStack program (github.com/cogstack), SemEHR is being used to recruit patients into the UK Department of Health 100 000 Genomes Project (genomicsengland.co.uk). The validation study suggests that the tool can validate previously recruited cases and is very fast at searching phenotypes; time for recruitment criteria checking was reduced from days to minutes. Validated on open intensive care EHR data, Medical Information Mart for Intensive Care III, the vital signs extracted by SemEHR can achieve around 97% accuracy. CONCLUSION: Results from the multiple case studies demonstrate SemEHR’s efficiency: weeks or months of work can be done within hours or minutes in some cases. SemEHR provides a more comprehensive view of patients, bringing in more and unexpected insight compared to study-oriented bespoke IE systems. SemEHR is open source, available at https://github.com/CogStack/SemEHR. Oxford University Press 2018-01-19 /pmc/articles/PMC6019046/ /pubmed/29361077 http://dx.doi.org/10.1093/jamia/ocx160 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Wu, Honghan
Toti, Giulia
Morley, Katherine I
Ibrahim, Zina M
Folarin, Amos
Jackson, Richard
Kartoglu, Ismail
Agrawal, Asha
Stringer, Clive
Gale, Darren
Gorrell, Genevieve
Roberts, Angus
Broadbent, Matthew
Stewart, Robert
Dobson, Richard JB
SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research*
title SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research*
title_full SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research*
title_fullStr SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research*
title_full_unstemmed SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research*
title_short SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research*
title_sort semehr: a general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research*
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6019046/
https://www.ncbi.nlm.nih.gov/pubmed/29361077
http://dx.doi.org/10.1093/jamia/ocx160
work_keys_str_mv AT wuhonghan semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT totigiulia semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT morleykatherinei semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT ibrahimzinam semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT folarinamos semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT jacksonrichard semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT kartogluismail semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT agrawalasha semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT stringerclive semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT galedarren semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT gorrellgenevieve semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT robertsangus semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT broadbentmatthew semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT stewartrobert semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch
AT dobsonrichardjb semehrageneralpurposesemanticsearchsystemtosurfacesemanticdatafromclinicalnotesfortailoredcaretrialrecruitmentandclinicalresearch