Cargando…

Development and Evaluation of a Natural Language Processing Annotation Tool to Facilitate Phenotyping of Cognitive Status in Electronic Health Records: Diagnostic Study

BACKGROUND: Electronic health records (EHRs) with large sample sizes and rich information offer great potential for dementia research, but current methods of phenotyping cognitive status are not scalable. OBJECTIVE: The aim of this study was to evaluate whether natural language processing (NLP)–powe...

Descripción completa

Detalles Bibliográficos
Autores principales: Noori, Ayush, Magdamo, Colin, Liu, Xiao, Tyagi, Tanish, Li, Zhaozhi, Kondepudi, Akhil, Alabsi, Haitham, Rudmann, Emily, Wilcox, Douglas, Brenner, Laura, Robbins, Gregory K, Moura, Lidia, Zafar, Sahar, Benson, Nicole M, Hsu, John, R Dickson, John, Serrano-Pozo, Alberto, Hyman, Bradley T, Blacker, Deborah, Westover, M Brandon, Mukerji, Shibani S, Das, Sudeshna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9472045/
https://www.ncbi.nlm.nih.gov/pubmed/36040790
http://dx.doi.org/10.2196/40384
_version_ 1784789220812914688
author Noori, Ayush
Magdamo, Colin
Liu, Xiao
Tyagi, Tanish
Li, Zhaozhi
Kondepudi, Akhil
Alabsi, Haitham
Rudmann, Emily
Wilcox, Douglas
Brenner, Laura
Robbins, Gregory K
Moura, Lidia
Zafar, Sahar
Benson, Nicole M
Hsu, John
R Dickson, John
Serrano-Pozo, Alberto
Hyman, Bradley T
Blacker, Deborah
Westover, M Brandon
Mukerji, Shibani S
Das, Sudeshna
author_facet Noori, Ayush
Magdamo, Colin
Liu, Xiao
Tyagi, Tanish
Li, Zhaozhi
Kondepudi, Akhil
Alabsi, Haitham
Rudmann, Emily
Wilcox, Douglas
Brenner, Laura
Robbins, Gregory K
Moura, Lidia
Zafar, Sahar
Benson, Nicole M
Hsu, John
R Dickson, John
Serrano-Pozo, Alberto
Hyman, Bradley T
Blacker, Deborah
Westover, M Brandon
Mukerji, Shibani S
Das, Sudeshna
author_sort Noori, Ayush
collection PubMed
description BACKGROUND: Electronic health records (EHRs) with large sample sizes and rich information offer great potential for dementia research, but current methods of phenotyping cognitive status are not scalable. OBJECTIVE: The aim of this study was to evaluate whether natural language processing (NLP)–powered semiautomated annotation can improve the speed and interrater reliability of chart reviews for phenotyping cognitive status. METHODS: In this diagnostic study, we developed and evaluated a semiautomated NLP-powered annotation tool (NAT) to facilitate phenotyping of cognitive status. Clinical experts adjudicated the cognitive status of 627 patients at Mass General Brigham (MGB) health care, using NAT or traditional chart reviews. Patient charts contained EHR data from two data sets: (1) records from January 1, 2017, to December 31, 2018, for 100 Medicare beneficiaries from the MGB Accountable Care Organization and (2) records from 2 years prior to COVID-19 diagnosis to the date of COVID-19 diagnosis for 527 MGB patients. All EHR data from the relevant period were extracted; diagnosis codes, medications, and laboratory test values were processed and summarized; clinical notes were processed through an NLP pipeline; and a web tool was developed to present an integrated view of all data. Cognitive status was rated as cognitively normal, cognitively impaired, or undetermined. Assessment time and interrater agreement of NAT compared to manual chart reviews for cognitive status phenotyping was evaluated. RESULTS: NAT adjudication provided higher interrater agreement (Cohen κ=0.89 vs κ=0.80) and significant speed up (time difference mean 1.4, SD 1.3 minutes; P<.001; ratio median 2.2, min-max 0.4-20) over manual chart reviews. There was moderate agreement with manual chart reviews (Cohen κ=0.67). In the cases that exhibited disagreement with manual chart reviews, NAT adjudication was able to produce assessments that had broader clinical consensus due to its integrated view of highlighted relevant information and semiautomated NLP features. CONCLUSIONS: NAT adjudication improves the speed and interrater reliability for phenotyping cognitive status compared to manual chart reviews. This study underscores the potential of an NLP-based clinically adjudicated method to build large-scale dementia research cohorts from EHRs.
format Online
Article
Text
id pubmed-9472045
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-94720452022-09-15 Development and Evaluation of a Natural Language Processing Annotation Tool to Facilitate Phenotyping of Cognitive Status in Electronic Health Records: Diagnostic Study Noori, Ayush Magdamo, Colin Liu, Xiao Tyagi, Tanish Li, Zhaozhi Kondepudi, Akhil Alabsi, Haitham Rudmann, Emily Wilcox, Douglas Brenner, Laura Robbins, Gregory K Moura, Lidia Zafar, Sahar Benson, Nicole M Hsu, John R Dickson, John Serrano-Pozo, Alberto Hyman, Bradley T Blacker, Deborah Westover, M Brandon Mukerji, Shibani S Das, Sudeshna J Med Internet Res Original Paper BACKGROUND: Electronic health records (EHRs) with large sample sizes and rich information offer great potential for dementia research, but current methods of phenotyping cognitive status are not scalable. OBJECTIVE: The aim of this study was to evaluate whether natural language processing (NLP)–powered semiautomated annotation can improve the speed and interrater reliability of chart reviews for phenotyping cognitive status. METHODS: In this diagnostic study, we developed and evaluated a semiautomated NLP-powered annotation tool (NAT) to facilitate phenotyping of cognitive status. Clinical experts adjudicated the cognitive status of 627 patients at Mass General Brigham (MGB) health care, using NAT or traditional chart reviews. Patient charts contained EHR data from two data sets: (1) records from January 1, 2017, to December 31, 2018, for 100 Medicare beneficiaries from the MGB Accountable Care Organization and (2) records from 2 years prior to COVID-19 diagnosis to the date of COVID-19 diagnosis for 527 MGB patients. All EHR data from the relevant period were extracted; diagnosis codes, medications, and laboratory test values were processed and summarized; clinical notes were processed through an NLP pipeline; and a web tool was developed to present an integrated view of all data. Cognitive status was rated as cognitively normal, cognitively impaired, or undetermined. Assessment time and interrater agreement of NAT compared to manual chart reviews for cognitive status phenotyping was evaluated. RESULTS: NAT adjudication provided higher interrater agreement (Cohen κ=0.89 vs κ=0.80) and significant speed up (time difference mean 1.4, SD 1.3 minutes; P<.001; ratio median 2.2, min-max 0.4-20) over manual chart reviews. There was moderate agreement with manual chart reviews (Cohen κ=0.67). In the cases that exhibited disagreement with manual chart reviews, NAT adjudication was able to produce assessments that had broader clinical consensus due to its integrated view of highlighted relevant information and semiautomated NLP features. CONCLUSIONS: NAT adjudication improves the speed and interrater reliability for phenotyping cognitive status compared to manual chart reviews. This study underscores the potential of an NLP-based clinically adjudicated method to build large-scale dementia research cohorts from EHRs. JMIR Publications 2022-08-30 /pmc/articles/PMC9472045/ /pubmed/36040790 http://dx.doi.org/10.2196/40384 Text en ©Ayush Noori, Colin Magdamo, Xiao Liu, Tanish Tyagi, Zhaozhi Li, Akhil Kondepudi, Haitham Alabsi, Emily Rudmann, Douglas Wilcox, Laura Brenner, Gregory K Robbins, Lidia Moura, Sahar Zafar, Nicole M Benson, John Hsu, John R Dickson, Alberto Serrano-Pozo, Bradley T Hyman, Deborah Blacker, M Brandon Westover, Shibani S Mukerji, Sudeshna Das. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 30.08.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Noori, Ayush
Magdamo, Colin
Liu, Xiao
Tyagi, Tanish
Li, Zhaozhi
Kondepudi, Akhil
Alabsi, Haitham
Rudmann, Emily
Wilcox, Douglas
Brenner, Laura
Robbins, Gregory K
Moura, Lidia
Zafar, Sahar
Benson, Nicole M
Hsu, John
R Dickson, John
Serrano-Pozo, Alberto
Hyman, Bradley T
Blacker, Deborah
Westover, M Brandon
Mukerji, Shibani S
Das, Sudeshna
Development and Evaluation of a Natural Language Processing Annotation Tool to Facilitate Phenotyping of Cognitive Status in Electronic Health Records: Diagnostic Study
title Development and Evaluation of a Natural Language Processing Annotation Tool to Facilitate Phenotyping of Cognitive Status in Electronic Health Records: Diagnostic Study
title_full Development and Evaluation of a Natural Language Processing Annotation Tool to Facilitate Phenotyping of Cognitive Status in Electronic Health Records: Diagnostic Study
title_fullStr Development and Evaluation of a Natural Language Processing Annotation Tool to Facilitate Phenotyping of Cognitive Status in Electronic Health Records: Diagnostic Study
title_full_unstemmed Development and Evaluation of a Natural Language Processing Annotation Tool to Facilitate Phenotyping of Cognitive Status in Electronic Health Records: Diagnostic Study
title_short Development and Evaluation of a Natural Language Processing Annotation Tool to Facilitate Phenotyping of Cognitive Status in Electronic Health Records: Diagnostic Study
title_sort development and evaluation of a natural language processing annotation tool to facilitate phenotyping of cognitive status in electronic health records: diagnostic study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9472045/
https://www.ncbi.nlm.nih.gov/pubmed/36040790
http://dx.doi.org/10.2196/40384
work_keys_str_mv AT nooriayush developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT magdamocolin developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT liuxiao developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT tyagitanish developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT lizhaozhi developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT kondepudiakhil developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT alabsihaitham developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT rudmannemily developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT wilcoxdouglas developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT brennerlaura developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT robbinsgregoryk developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT mouralidia developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT zafarsahar developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT bensonnicolem developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT hsujohn developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT rdicksonjohn developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT serranopozoalberto developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT hymanbradleyt developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT blackerdeborah developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT westovermbrandon developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT mukerjishibanis developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy
AT dassudeshna developmentandevaluationofanaturallanguageprocessingannotationtooltofacilitatephenotypingofcognitivestatusinelectronichealthrecordsdiagnosticstudy