Cargando…
Non-negative matrix factorization temporal topic models and clinical text data identify COVID-19 pandemic effects on primary healthcare and community health in Toronto, Canada
OBJECTIVE: To demonstrate how non-negative matrix factorization can be used to learn a temporal topic model over a large collection of primary care clinical notes, characterizing diverse COVID-19 pandemic effects on the physical/mental/social health of residents of Toronto, Canada. MATERIALS AND MET...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Published by Elsevier Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8861144/ https://www.ncbi.nlm.nih.gov/pubmed/35202844 http://dx.doi.org/10.1016/j.jbi.2022.104034 |
_version_ | 1784654824107671552 |
---|---|
author | Meaney, Christopher Escobar, Michael Moineddin, Rahim Stukel, Therese A. Kalia, Sumeet Aliarzadeh, Babak Chen, Tao O'Neill, Braden Greiver, Michelle |
author_facet | Meaney, Christopher Escobar, Michael Moineddin, Rahim Stukel, Therese A. Kalia, Sumeet Aliarzadeh, Babak Chen, Tao O'Neill, Braden Greiver, Michelle |
author_sort | Meaney, Christopher |
collection | PubMed |
description | OBJECTIVE: To demonstrate how non-negative matrix factorization can be used to learn a temporal topic model over a large collection of primary care clinical notes, characterizing diverse COVID-19 pandemic effects on the physical/mental/social health of residents of Toronto, Canada. MATERIALS AND METHODS: The study employs a retrospective open cohort design, consisting of 382,666 primary care progress notes from 44,828 patients, 54 physicians, and 12 clinics collected 01/01/2017 through 31/12/2020. Non-negative matrix factorization uncovers a meaningful latent topical structure permeating the corpus of primary care notes. The learned latent topical basis is transformed into a multivariate time series data structure. Time series methods and plots showcase the evolution/dynamics of learned topics over the study period and allow the identification of COVID-19 pandemic effects. We perform several post-hoc checks of model robustness to increase trust that descriptive/unsupervised inferences are stable over hyper-parameter configurations and/or data perturbations. RESULTS: Temporal topic modelling uncovers a myriad of pandemic-related effects from the expressive clinical text data. In terms of direct effects on patient-health, topics encoding respiratory disease symptoms display altered dynamics during the pandemic year. Further, the pandemic was associated with a multitude of indirect patient-level effects on topical domains representing mental health, sleep, social and familial dynamics, measurement of vitals/labs, uptake of prevention/screening maneuvers, and referrals to medical specialists. Finally, topic models capture changes in primary care practice patterns resulting from the pandemic, including changes in EMR documentation strategies and the uptake of telemedicine. CONCLUSION: Temporal topic modelling applied to a large corpus of rich primary care clinical text data, can identify a meaningful topical/thematic summarization which can provide policymakers and public health stakeholders a passive, cost-effective, technology for understanding holistic impacts of the COVID-19 pandemic on the primary healthcare system and community/public-health. |
format | Online Article Text |
id | pubmed-8861144 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Published by Elsevier Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-88611442022-02-22 Non-negative matrix factorization temporal topic models and clinical text data identify COVID-19 pandemic effects on primary healthcare and community health in Toronto, Canada Meaney, Christopher Escobar, Michael Moineddin, Rahim Stukel, Therese A. Kalia, Sumeet Aliarzadeh, Babak Chen, Tao O'Neill, Braden Greiver, Michelle J Biomed Inform Article OBJECTIVE: To demonstrate how non-negative matrix factorization can be used to learn a temporal topic model over a large collection of primary care clinical notes, characterizing diverse COVID-19 pandemic effects on the physical/mental/social health of residents of Toronto, Canada. MATERIALS AND METHODS: The study employs a retrospective open cohort design, consisting of 382,666 primary care progress notes from 44,828 patients, 54 physicians, and 12 clinics collected 01/01/2017 through 31/12/2020. Non-negative matrix factorization uncovers a meaningful latent topical structure permeating the corpus of primary care notes. The learned latent topical basis is transformed into a multivariate time series data structure. Time series methods and plots showcase the evolution/dynamics of learned topics over the study period and allow the identification of COVID-19 pandemic effects. We perform several post-hoc checks of model robustness to increase trust that descriptive/unsupervised inferences are stable over hyper-parameter configurations and/or data perturbations. RESULTS: Temporal topic modelling uncovers a myriad of pandemic-related effects from the expressive clinical text data. In terms of direct effects on patient-health, topics encoding respiratory disease symptoms display altered dynamics during the pandemic year. Further, the pandemic was associated with a multitude of indirect patient-level effects on topical domains representing mental health, sleep, social and familial dynamics, measurement of vitals/labs, uptake of prevention/screening maneuvers, and referrals to medical specialists. Finally, topic models capture changes in primary care practice patterns resulting from the pandemic, including changes in EMR documentation strategies and the uptake of telemedicine. CONCLUSION: Temporal topic modelling applied to a large corpus of rich primary care clinical text data, can identify a meaningful topical/thematic summarization which can provide policymakers and public health stakeholders a passive, cost-effective, technology for understanding holistic impacts of the COVID-19 pandemic on the primary healthcare system and community/public-health. Published by Elsevier Inc. 2022-04 2022-02-22 /pmc/articles/PMC8861144/ /pubmed/35202844 http://dx.doi.org/10.1016/j.jbi.2022.104034 Text en © 2022 Published by Elsevier Inc. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Meaney, Christopher Escobar, Michael Moineddin, Rahim Stukel, Therese A. Kalia, Sumeet Aliarzadeh, Babak Chen, Tao O'Neill, Braden Greiver, Michelle Non-negative matrix factorization temporal topic models and clinical text data identify COVID-19 pandemic effects on primary healthcare and community health in Toronto, Canada |
title | Non-negative matrix factorization temporal topic models and clinical text data identify COVID-19 pandemic effects on primary healthcare and community health in Toronto, Canada |
title_full | Non-negative matrix factorization temporal topic models and clinical text data identify COVID-19 pandemic effects on primary healthcare and community health in Toronto, Canada |
title_fullStr | Non-negative matrix factorization temporal topic models and clinical text data identify COVID-19 pandemic effects on primary healthcare and community health in Toronto, Canada |
title_full_unstemmed | Non-negative matrix factorization temporal topic models and clinical text data identify COVID-19 pandemic effects on primary healthcare and community health in Toronto, Canada |
title_short | Non-negative matrix factorization temporal topic models and clinical text data identify COVID-19 pandemic effects on primary healthcare and community health in Toronto, Canada |
title_sort | non-negative matrix factorization temporal topic models and clinical text data identify covid-19 pandemic effects on primary healthcare and community health in toronto, canada |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8861144/ https://www.ncbi.nlm.nih.gov/pubmed/35202844 http://dx.doi.org/10.1016/j.jbi.2022.104034 |
work_keys_str_mv | AT meaneychristopher nonnegativematrixfactorizationtemporaltopicmodelsandclinicaltextdataidentifycovid19pandemiceffectsonprimaryhealthcareandcommunityhealthintorontocanada AT escobarmichael nonnegativematrixfactorizationtemporaltopicmodelsandclinicaltextdataidentifycovid19pandemiceffectsonprimaryhealthcareandcommunityhealthintorontocanada AT moineddinrahim nonnegativematrixfactorizationtemporaltopicmodelsandclinicaltextdataidentifycovid19pandemiceffectsonprimaryhealthcareandcommunityhealthintorontocanada AT stukeltheresea nonnegativematrixfactorizationtemporaltopicmodelsandclinicaltextdataidentifycovid19pandemiceffectsonprimaryhealthcareandcommunityhealthintorontocanada AT kaliasumeet nonnegativematrixfactorizationtemporaltopicmodelsandclinicaltextdataidentifycovid19pandemiceffectsonprimaryhealthcareandcommunityhealthintorontocanada AT aliarzadehbabak nonnegativematrixfactorizationtemporaltopicmodelsandclinicaltextdataidentifycovid19pandemiceffectsonprimaryhealthcareandcommunityhealthintorontocanada AT chentao nonnegativematrixfactorizationtemporaltopicmodelsandclinicaltextdataidentifycovid19pandemiceffectsonprimaryhealthcareandcommunityhealthintorontocanada AT oneillbraden nonnegativematrixfactorizationtemporaltopicmodelsandclinicaltextdataidentifycovid19pandemiceffectsonprimaryhealthcareandcommunityhealthintorontocanada AT greivermichelle nonnegativematrixfactorizationtemporaltopicmodelsandclinicaltextdataidentifycovid19pandemiceffectsonprimaryhealthcareandcommunityhealthintorontocanada |