Cargando…

A review of approaches to identifying patient phenotype cohorts using electronic health records

OBJECTIVE: To summarize literature describing approaches aimed at automatically identifying patients with a common phenotype. MATERIALS AND METHODS: We performed a review of studies describing systems or reporting techniques developed for identifying cohorts of patients with specific phenotypes. Eve...

Descripción completa

Detalles Bibliográficos
Autores principales: Shivade, Chaitanya, Raghavan, Preethi, Fosler-Lussier, Eric, Embi, Peter J, Elhadad, Noemie, Johnson, Stephen B, Lai, Albert M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3932460/
https://www.ncbi.nlm.nih.gov/pubmed/24201027
http://dx.doi.org/10.1136/amiajnl-2013-001935
_version_ 1782304798995906560
author Shivade, Chaitanya
Raghavan, Preethi
Fosler-Lussier, Eric
Embi, Peter J
Elhadad, Noemie
Johnson, Stephen B
Lai, Albert M
author_facet Shivade, Chaitanya
Raghavan, Preethi
Fosler-Lussier, Eric
Embi, Peter J
Elhadad, Noemie
Johnson, Stephen B
Lai, Albert M
author_sort Shivade, Chaitanya
collection PubMed
description OBJECTIVE: To summarize literature describing approaches aimed at automatically identifying patients with a common phenotype. MATERIALS AND METHODS: We performed a review of studies describing systems or reporting techniques developed for identifying cohorts of patients with specific phenotypes. Every full text article published in (1) Journal of American Medical Informatics Association, (2) Journal of Biomedical Informatics, (3) Proceedings of the Annual American Medical Informatics Association Symposium, and (4) Proceedings of Clinical Research Informatics Conference within the past 3 years was assessed for inclusion in the review. Only articles using automated techniques were included. RESULTS: Ninety-seven articles met our inclusion criteria. Forty-six used natural language processing (NLP)-based techniques, 24 described rule-based systems, 41 used statistical analyses, data mining, or machine learning techniques, while 22 described hybrid systems. Nine articles described the architecture of large-scale systems developed for determining cohort eligibility of patients. DISCUSSION: We observe that there is a rise in the number of studies associated with cohort identification using electronic medical records. Statistical analyses or machine learning, followed by NLP techniques, are gaining popularity over the years in comparison with rule-based systems. CONCLUSIONS: There are a variety of approaches for classifying patients into a particular phenotype. Different techniques and data sources are used, and good performance is reported on datasets at respective institutions. However, no system makes comprehensive use of electronic medical records addressing all of their known weaknesses.
format Online
Article
Text
id pubmed-3932460
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-39324602014-02-24 A review of approaches to identifying patient phenotype cohorts using electronic health records Shivade, Chaitanya Raghavan, Preethi Fosler-Lussier, Eric Embi, Peter J Elhadad, Noemie Johnson, Stephen B Lai, Albert M J Am Med Inform Assoc Review OBJECTIVE: To summarize literature describing approaches aimed at automatically identifying patients with a common phenotype. MATERIALS AND METHODS: We performed a review of studies describing systems or reporting techniques developed for identifying cohorts of patients with specific phenotypes. Every full text article published in (1) Journal of American Medical Informatics Association, (2) Journal of Biomedical Informatics, (3) Proceedings of the Annual American Medical Informatics Association Symposium, and (4) Proceedings of Clinical Research Informatics Conference within the past 3 years was assessed for inclusion in the review. Only articles using automated techniques were included. RESULTS: Ninety-seven articles met our inclusion criteria. Forty-six used natural language processing (NLP)-based techniques, 24 described rule-based systems, 41 used statistical analyses, data mining, or machine learning techniques, while 22 described hybrid systems. Nine articles described the architecture of large-scale systems developed for determining cohort eligibility of patients. DISCUSSION: We observe that there is a rise in the number of studies associated with cohort identification using electronic medical records. Statistical analyses or machine learning, followed by NLP techniques, are gaining popularity over the years in comparison with rule-based systems. CONCLUSIONS: There are a variety of approaches for classifying patients into a particular phenotype. Different techniques and data sources are used, and good performance is reported on datasets at respective institutions. However, no system makes comprehensive use of electronic medical records addressing all of their known weaknesses. BMJ Publishing Group 2014-03 2013-11-07 /pmc/articles/PMC3932460/ /pubmed/24201027 http://dx.doi.org/10.1136/amiajnl-2013-001935 Text en Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 3.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Review
Shivade, Chaitanya
Raghavan, Preethi
Fosler-Lussier, Eric
Embi, Peter J
Elhadad, Noemie
Johnson, Stephen B
Lai, Albert M
A review of approaches to identifying patient phenotype cohorts using electronic health records
title A review of approaches to identifying patient phenotype cohorts using electronic health records
title_full A review of approaches to identifying patient phenotype cohorts using electronic health records
title_fullStr A review of approaches to identifying patient phenotype cohorts using electronic health records
title_full_unstemmed A review of approaches to identifying patient phenotype cohorts using electronic health records
title_short A review of approaches to identifying patient phenotype cohorts using electronic health records
title_sort review of approaches to identifying patient phenotype cohorts using electronic health records
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3932460/
https://www.ncbi.nlm.nih.gov/pubmed/24201027
http://dx.doi.org/10.1136/amiajnl-2013-001935
work_keys_str_mv AT shivadechaitanya areviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT raghavanpreethi areviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT foslerlussiereric areviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT embipeterj areviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT elhadadnoemie areviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT johnsonstephenb areviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT laialbertm areviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT shivadechaitanya reviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT raghavanpreethi reviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT foslerlussiereric reviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT embipeterj reviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT elhadadnoemie reviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT johnsonstephenb reviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords
AT laialbertm reviewofapproachestoidentifyingpatientphenotypecohortsusingelectronichealthrecords