Cargando…

A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records

OBJECTIVE: Transformer-based language models are prevailing in the clinical domain due to their excellent performance on clinical NLP tasks. The generalizability of those models is usually ignored during the model development process. This study evaluated the generalizability of CancerBERT, a Transf...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Sicheng, Wang, Nan, Wang, Liwei, Sun, Ju, Blaes, Anne, Liu, Hongfang, Zhang, Rui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10480628/
https://www.ncbi.nlm.nih.gov/pubmed/37680211
http://dx.doi.org/10.1016/j.csbj.2023.08.018
_version_ 1785101830877872128
author Zhou, Sicheng
Wang, Nan
Wang, Liwei
Sun, Ju
Blaes, Anne
Liu, Hongfang
Zhang, Rui
author_facet Zhou, Sicheng
Wang, Nan
Wang, Liwei
Sun, Ju
Blaes, Anne
Liu, Hongfang
Zhang, Rui
author_sort Zhou, Sicheng
collection PubMed
description OBJECTIVE: Transformer-based language models are prevailing in the clinical domain due to their excellent performance on clinical NLP tasks. The generalizability of those models is usually ignored during the model development process. This study evaluated the generalizability of CancerBERT, a Transformer-based clinical NLP model, along with classic machine learning models, i.e., conditional random field (CRF), bi-directional long short-term memory CRF (BiLSTM-CRF), across different clinical institutes through a breast cancer phenotype extraction task. MATERIALS AND METHODS: Two clinical corpora of breast cancer patients were collected from the electronic health records from the University of Minnesota (UMN) and Mayo Clinic (MC), and annotated following the same guideline. We developed three types of NLP models (i.e., CRF, BiLSTM-CRF and CancerBERT) to extract cancer phenotypes from clinical texts. We evaluated the generalizability of models on different test sets with different learning strategies (model transfer vs locally trained). The entity coverage score was assessed with their association with the model performances. RESULTS: We manually annotated 200 and 161 clinical documents at UMN and MC. The corpora of the two institutes were found to have higher similarity between the target entities than the overall corpora. The CancerBERT models obtained the best performances among the independent test sets from two clinical institutes and the permutation test set. The CancerBERT model developed in one institute and further fine-tuned in another institute achieved reasonable performance compared to the model developed on local data (micro-F1: 0.925 vs 0.932). CONCLUSIONS: The results indicate the CancerBERT model has superior learning ability and generalizability among the three types of clinical NLP models for our named entity recognition task. It has the advantage to recognize complex entities, e.g., entities with different labels.
format Online
Article
Text
id pubmed-10480628
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-104806282023-09-07 A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records Zhou, Sicheng Wang, Nan Wang, Liwei Sun, Ju Blaes, Anne Liu, Hongfang Zhang, Rui Comput Struct Biotechnol J Research Article OBJECTIVE: Transformer-based language models are prevailing in the clinical domain due to their excellent performance on clinical NLP tasks. The generalizability of those models is usually ignored during the model development process. This study evaluated the generalizability of CancerBERT, a Transformer-based clinical NLP model, along with classic machine learning models, i.e., conditional random field (CRF), bi-directional long short-term memory CRF (BiLSTM-CRF), across different clinical institutes through a breast cancer phenotype extraction task. MATERIALS AND METHODS: Two clinical corpora of breast cancer patients were collected from the electronic health records from the University of Minnesota (UMN) and Mayo Clinic (MC), and annotated following the same guideline. We developed three types of NLP models (i.e., CRF, BiLSTM-CRF and CancerBERT) to extract cancer phenotypes from clinical texts. We evaluated the generalizability of models on different test sets with different learning strategies (model transfer vs locally trained). The entity coverage score was assessed with their association with the model performances. RESULTS: We manually annotated 200 and 161 clinical documents at UMN and MC. The corpora of the two institutes were found to have higher similarity between the target entities than the overall corpora. The CancerBERT models obtained the best performances among the independent test sets from two clinical institutes and the permutation test set. The CancerBERT model developed in one institute and further fine-tuned in another institute achieved reasonable performance compared to the model developed on local data (micro-F1: 0.925 vs 0.932). CONCLUSIONS: The results indicate the CancerBERT model has superior learning ability and generalizability among the three types of clinical NLP models for our named entity recognition task. It has the advantage to recognize complex entities, e.g., entities with different labels. Research Network of Computational and Structural Biotechnology 2023-08-22 /pmc/articles/PMC10480628/ /pubmed/37680211 http://dx.doi.org/10.1016/j.csbj.2023.08.018 Text en © 2023 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Zhou, Sicheng
Wang, Nan
Wang, Liwei
Sun, Ju
Blaes, Anne
Liu, Hongfang
Zhang, Rui
A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records
title A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records
title_full A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records
title_fullStr A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records
title_full_unstemmed A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records
title_short A cross-institutional evaluation on breast cancer phenotyping NLP algorithms on electronic health records
title_sort cross-institutional evaluation on breast cancer phenotyping nlp algorithms on electronic health records
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10480628/
https://www.ncbi.nlm.nih.gov/pubmed/37680211
http://dx.doi.org/10.1016/j.csbj.2023.08.018
work_keys_str_mv AT zhousicheng acrossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT wangnan acrossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT wangliwei acrossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT sunju acrossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT blaesanne acrossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT liuhongfang acrossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT zhangrui acrossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT zhousicheng crossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT wangnan crossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT wangliwei crossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT sunju crossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT blaesanne crossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT liuhongfang crossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords
AT zhangrui crossinstitutionalevaluationonbreastcancerphenotypingnlpalgorithmsonelectronichealthrecords