Cargando…
A clinical text classification paradigm using weak supervision and deep representation
BACKGROUND: Automatic clinical text classification is a natural language processing (NLP) technology that unlocks information embedded in clinical narratives. Machine learning approaches have been shown to be effective for clinical text classification tasks. However, a successful machine learning mo...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6322223/ https://www.ncbi.nlm.nih.gov/pubmed/30616584 http://dx.doi.org/10.1186/s12911-018-0723-6 |
_version_ | 1783385575530692608 |
---|---|
author | Wang, Yanshan Sohn, Sunghwan Liu, Sijia Shen, Feichen Wang, Liwei Atkinson, Elizabeth J. Amin, Shreyasee Liu, Hongfang |
author_facet | Wang, Yanshan Sohn, Sunghwan Liu, Sijia Shen, Feichen Wang, Liwei Atkinson, Elizabeth J. Amin, Shreyasee Liu, Hongfang |
author_sort | Wang, Yanshan |
collection | PubMed |
description | BACKGROUND: Automatic clinical text classification is a natural language processing (NLP) technology that unlocks information embedded in clinical narratives. Machine learning approaches have been shown to be effective for clinical text classification tasks. However, a successful machine learning model usually requires extensive human efforts to create labeled training data and conduct feature engineering. In this study, we propose a clinical text classification paradigm using weak supervision and deep representation to reduce these human efforts. METHODS: We develop a rule-based NLP algorithm to automatically generate labels for the training data, and then use the pre-trained word embeddings as deep representation features for training machine learning models. Since machine learning is trained on labels generated by the automatic NLP algorithm, this training process is called weak supervision. We evaluat the paradigm effectiveness on two institutional case studies at Mayo Clinic: smoking status classification and proximal femur (hip) fracture classification, and one case study using a public dataset: the i2b2 2006 smoking status classification shared task. We test four widely used machine learning models, namely, Support Vector Machine (SVM), Random Forest (RF), Multilayer Perceptron Neural Networks (MLPNN), and Convolutional Neural Networks (CNN), using this paradigm. Precision, recall, and F1 score are used as metrics to evaluate performance. RESULTS: CNN achieves the best performance in both institutional tasks (F1 score: 0.92 for Mayo Clinic smoking status classification and 0.97 for fracture classification). We show that word embeddings significantly outperform tf-idf and topic modeling features in the paradigm, and that CNN captures additional patterns from the weak supervision compared to the rule-based NLP algorithms. We also observe two drawbacks of the proposed paradigm that CNN is more sensitive to the size of training data, and that the proposed paradigm might not be effective for complex multiclass classification tasks. CONCLUSION: The proposed clinical text classification paradigm could reduce human efforts of labeled training data creation and feature engineering for applying machine learning to clinical text classification by leveraging weak supervision and deep representation. The experimental experiments have validated the effectiveness of paradigm by two institutional and one shared clinical text classification tasks. |
format | Online Article Text |
id | pubmed-6322223 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63222232019-01-09 A clinical text classification paradigm using weak supervision and deep representation Wang, Yanshan Sohn, Sunghwan Liu, Sijia Shen, Feichen Wang, Liwei Atkinson, Elizabeth J. Amin, Shreyasee Liu, Hongfang BMC Med Inform Decis Mak Research Article BACKGROUND: Automatic clinical text classification is a natural language processing (NLP) technology that unlocks information embedded in clinical narratives. Machine learning approaches have been shown to be effective for clinical text classification tasks. However, a successful machine learning model usually requires extensive human efforts to create labeled training data and conduct feature engineering. In this study, we propose a clinical text classification paradigm using weak supervision and deep representation to reduce these human efforts. METHODS: We develop a rule-based NLP algorithm to automatically generate labels for the training data, and then use the pre-trained word embeddings as deep representation features for training machine learning models. Since machine learning is trained on labels generated by the automatic NLP algorithm, this training process is called weak supervision. We evaluat the paradigm effectiveness on two institutional case studies at Mayo Clinic: smoking status classification and proximal femur (hip) fracture classification, and one case study using a public dataset: the i2b2 2006 smoking status classification shared task. We test four widely used machine learning models, namely, Support Vector Machine (SVM), Random Forest (RF), Multilayer Perceptron Neural Networks (MLPNN), and Convolutional Neural Networks (CNN), using this paradigm. Precision, recall, and F1 score are used as metrics to evaluate performance. RESULTS: CNN achieves the best performance in both institutional tasks (F1 score: 0.92 for Mayo Clinic smoking status classification and 0.97 for fracture classification). We show that word embeddings significantly outperform tf-idf and topic modeling features in the paradigm, and that CNN captures additional patterns from the weak supervision compared to the rule-based NLP algorithms. We also observe two drawbacks of the proposed paradigm that CNN is more sensitive to the size of training data, and that the proposed paradigm might not be effective for complex multiclass classification tasks. CONCLUSION: The proposed clinical text classification paradigm could reduce human efforts of labeled training data creation and feature engineering for applying machine learning to clinical text classification by leveraging weak supervision and deep representation. The experimental experiments have validated the effectiveness of paradigm by two institutional and one shared clinical text classification tasks. BioMed Central 2019-01-07 /pmc/articles/PMC6322223/ /pubmed/30616584 http://dx.doi.org/10.1186/s12911-018-0723-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Wang, Yanshan Sohn, Sunghwan Liu, Sijia Shen, Feichen Wang, Liwei Atkinson, Elizabeth J. Amin, Shreyasee Liu, Hongfang A clinical text classification paradigm using weak supervision and deep representation |
title | A clinical text classification paradigm using weak supervision and deep representation |
title_full | A clinical text classification paradigm using weak supervision and deep representation |
title_fullStr | A clinical text classification paradigm using weak supervision and deep representation |
title_full_unstemmed | A clinical text classification paradigm using weak supervision and deep representation |
title_short | A clinical text classification paradigm using weak supervision and deep representation |
title_sort | clinical text classification paradigm using weak supervision and deep representation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6322223/ https://www.ncbi.nlm.nih.gov/pubmed/30616584 http://dx.doi.org/10.1186/s12911-018-0723-6 |
work_keys_str_mv | AT wangyanshan aclinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT sohnsunghwan aclinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT liusijia aclinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT shenfeichen aclinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT wangliwei aclinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT atkinsonelizabethj aclinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT aminshreyasee aclinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT liuhongfang aclinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT wangyanshan clinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT sohnsunghwan clinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT liusijia clinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT shenfeichen clinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT wangliwei clinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT atkinsonelizabethj clinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT aminshreyasee clinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation AT liuhongfang clinicaltextclassificationparadigmusingweaksupervisionanddeeprepresentation |