Cargando…

Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding

BACKGROUND: No existing machine learning (ML)-based models use free text from electronic medical records (EMR) as input to predict immediate remission (IR) of Cushing’s disease (CD) after transsphenoidal surgery. PURPOSE: The aim of the present study is to develop an ML-based model that uses EMR tha...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Wentai, Li, Dongfang, Feng, Ming, Hu, Baotian, Fan, Yanghua, Chen, Qingcai, Wang, Renzhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8548651/
https://www.ncbi.nlm.nih.gov/pubmed/34722308
http://dx.doi.org/10.3389/fonc.2021.754882
_version_ 1784590620883419136
author Zhang, Wentai
Li, Dongfang
Feng, Ming
Hu, Baotian
Fan, Yanghua
Chen, Qingcai
Wang, Renzhi
author_facet Zhang, Wentai
Li, Dongfang
Feng, Ming
Hu, Baotian
Fan, Yanghua
Chen, Qingcai
Wang, Renzhi
author_sort Zhang, Wentai
collection PubMed
description BACKGROUND: No existing machine learning (ML)-based models use free text from electronic medical records (EMR) as input to predict immediate remission (IR) of Cushing’s disease (CD) after transsphenoidal surgery. PURPOSE: The aim of the present study is to develop an ML-based model that uses EMR that include both structured features and free text as input to preoperatively predict IR after transsphenoidal surgery. METHODS: A total of 419 patients with CD from Peking Union Medical College Hospital were enrolled between January 2014 and August 2020. The EMR of the patients were embedded and transformed into low-dimensional dense vectors that can be included in four ML-based models together with structured features. The area under the curve (AUC) of receiver operating characteristic curves was used to evaluate the performance of the models. RESULTS: The overall remission rate of the 419 patients was 75.7%. From the results of logistic multivariate analysis, operation (p < 0.001), invasion of cavernous sinus from MRI (p = 0.046), and ACTH (p = 0.024) were strongly correlated with IR. The AUC values for the four ML-based models ranged from 0.686 to 0.793. The highest AUC value (0.793) was for logistic regression when 11 structured features and “individual conclusions of the case by doctor” were included. CONCLUSION: An ML-based model was developed using both structured and unstructured features (after being processed using a word embedding method) as input to preoperatively predict postoperative IR.
format Online
Article
Text
id pubmed-8548651
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85486512021-10-28 Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding Zhang, Wentai Li, Dongfang Feng, Ming Hu, Baotian Fan, Yanghua Chen, Qingcai Wang, Renzhi Front Oncol Oncology BACKGROUND: No existing machine learning (ML)-based models use free text from electronic medical records (EMR) as input to predict immediate remission (IR) of Cushing’s disease (CD) after transsphenoidal surgery. PURPOSE: The aim of the present study is to develop an ML-based model that uses EMR that include both structured features and free text as input to preoperatively predict IR after transsphenoidal surgery. METHODS: A total of 419 patients with CD from Peking Union Medical College Hospital were enrolled between January 2014 and August 2020. The EMR of the patients were embedded and transformed into low-dimensional dense vectors that can be included in four ML-based models together with structured features. The area under the curve (AUC) of receiver operating characteristic curves was used to evaluate the performance of the models. RESULTS: The overall remission rate of the 419 patients was 75.7%. From the results of logistic multivariate analysis, operation (p < 0.001), invasion of cavernous sinus from MRI (p = 0.046), and ACTH (p = 0.024) were strongly correlated with IR. The AUC values for the four ML-based models ranged from 0.686 to 0.793. The highest AUC value (0.793) was for logistic regression when 11 structured features and “individual conclusions of the case by doctor” were included. CONCLUSION: An ML-based model was developed using both structured and unstructured features (after being processed using a word embedding method) as input to preoperatively predict postoperative IR. Frontiers Media S.A. 2021-10-13 /pmc/articles/PMC8548651/ /pubmed/34722308 http://dx.doi.org/10.3389/fonc.2021.754882 Text en Copyright © 2021 Zhang, Li, Feng, Hu, Fan, Chen and Wang https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Zhang, Wentai
Li, Dongfang
Feng, Ming
Hu, Baotian
Fan, Yanghua
Chen, Qingcai
Wang, Renzhi
Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding
title Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding
title_full Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding
title_fullStr Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding
title_full_unstemmed Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding
title_short Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding
title_sort electronic medical records as input to predict postoperative immediate remission of cushing’s disease: application of word embedding
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8548651/
https://www.ncbi.nlm.nih.gov/pubmed/34722308
http://dx.doi.org/10.3389/fonc.2021.754882
work_keys_str_mv AT zhangwentai electronicmedicalrecordsasinputtopredictpostoperativeimmediateremissionofcushingsdiseaseapplicationofwordembedding
AT lidongfang electronicmedicalrecordsasinputtopredictpostoperativeimmediateremissionofcushingsdiseaseapplicationofwordembedding
AT fengming electronicmedicalrecordsasinputtopredictpostoperativeimmediateremissionofcushingsdiseaseapplicationofwordembedding
AT hubaotian electronicmedicalrecordsasinputtopredictpostoperativeimmediateremissionofcushingsdiseaseapplicationofwordembedding
AT fanyanghua electronicmedicalrecordsasinputtopredictpostoperativeimmediateremissionofcushingsdiseaseapplicationofwordembedding
AT chenqingcai electronicmedicalrecordsasinputtopredictpostoperativeimmediateremissionofcushingsdiseaseapplicationofwordembedding
AT wangrenzhi electronicmedicalrecordsasinputtopredictpostoperativeimmediateremissionofcushingsdiseaseapplicationofwordembedding