Cargando…

Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives

In secondary analysis of electronic health records, a crucial task consists in correctly identifying the patient cohort under investigation. In many cases, the most valuable and relevant information for an accurate classification of medical conditions exist only in clinical narratives. Therefore, it...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gehrmann, Sebastian, Dernoncourt, Franck, Li, Yeran, Carlson, Eric T., Wu, Joy T., Welt, Jonathan, Foote, John, Moseley, Edward T., Grant, David W., Tyler, Patrick D., Celi, Leo A.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5813927/ https://www.ncbi.nlm.nih.gov/pubmed/29447188 http://dx.doi.org/10.1371/journal.pone.0192360

_version_	1783300250375553024
author	Gehrmann, Sebastian Dernoncourt, Franck Li, Yeran Carlson, Eric T. Wu, Joy T. Welt, Jonathan Foote, John Moseley, Edward T. Grant, David W. Tyler, Patrick D. Celi, Leo A.
author_facet	Gehrmann, Sebastian Dernoncourt, Franck Li, Yeran Carlson, Eric T. Wu, Joy T. Welt, Jonathan Foote, John Moseley, Edward T. Grant, David W. Tyler, Patrick D. Celi, Leo A.
author_sort	Gehrmann, Sebastian
collection	PubMed
description	In secondary analysis of electronic health records, a crucial task consists in correctly identifying the patient cohort under investigation. In many cases, the most valuable and relevant information for an accurate classification of medical conditions exist only in clinical narratives. Therefore, it is necessary to use natural language processing (NLP) techniques to extract and evaluate these narratives. The most commonly used approach to this problem relies on extracting a number of clinician-defined medical concepts from text and using machine learning techniques to identify whether a particular patient has a certain condition. However, recent advances in deep learning and NLP enable models to learn a rich representation of (medical) language. Convolutional neural networks (CNN) for text classification can augment the existing techniques by leveraging the representation of language to learn which phrases in a text are relevant for a given medical condition. In this work, we compare concept extraction based methods with CNNs and other commonly used models in NLP in ten phenotyping tasks using 1,610 discharge summaries from the MIMIC-III database. We show that CNNs outperform concept extraction based methods in almost all of the tasks, with an improvement in F1-score of up to 26 and up to 7 percentage points in area under the ROC curve (AUC). We additionally assess the interpretability of both approaches by presenting and evaluating methods that calculate and extract the most salient phrases for a prediction. The results indicate that CNNs are a valid alternative to existing approaches in patient phenotyping and cohort identification, and should be further investigated. Moreover, the deep learning approach presented in this paper can be used to assist clinicians during chart review or support the extraction of billing codes from text by identifying and highlighting relevant phrases for various medical conditions.
format	Online Article Text
id	pubmed-5813927
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-58139272018-03-02 Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives Gehrmann, Sebastian Dernoncourt, Franck Li, Yeran Carlson, Eric T. Wu, Joy T. Welt, Jonathan Foote, John Moseley, Edward T. Grant, David W. Tyler, Patrick D. Celi, Leo A. PLoS One Research Article In secondary analysis of electronic health records, a crucial task consists in correctly identifying the patient cohort under investigation. In many cases, the most valuable and relevant information for an accurate classification of medical conditions exist only in clinical narratives. Therefore, it is necessary to use natural language processing (NLP) techniques to extract and evaluate these narratives. The most commonly used approach to this problem relies on extracting a number of clinician-defined medical concepts from text and using machine learning techniques to identify whether a particular patient has a certain condition. However, recent advances in deep learning and NLP enable models to learn a rich representation of (medical) language. Convolutional neural networks (CNN) for text classification can augment the existing techniques by leveraging the representation of language to learn which phrases in a text are relevant for a given medical condition. In this work, we compare concept extraction based methods with CNNs and other commonly used models in NLP in ten phenotyping tasks using 1,610 discharge summaries from the MIMIC-III database. We show that CNNs outperform concept extraction based methods in almost all of the tasks, with an improvement in F1-score of up to 26 and up to 7 percentage points in area under the ROC curve (AUC). We additionally assess the interpretability of both approaches by presenting and evaluating methods that calculate and extract the most salient phrases for a prediction. The results indicate that CNNs are a valid alternative to existing approaches in patient phenotyping and cohort identification, and should be further investigated. Moreover, the deep learning approach presented in this paper can be used to assist clinicians during chart review or support the extraction of billing codes from text by identifying and highlighting relevant phrases for various medical conditions. Public Library of Science 2018-02-15 /pmc/articles/PMC5813927/ /pubmed/29447188 http://dx.doi.org/10.1371/journal.pone.0192360 Text en © 2018 Gehrmann et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Gehrmann, Sebastian Dernoncourt, Franck Li, Yeran Carlson, Eric T. Wu, Joy T. Welt, Jonathan Foote, John Moseley, Edward T. Grant, David W. Tyler, Patrick D. Celi, Leo A. Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives
title	Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives
title_full	Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives
title_fullStr	Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives
title_full_unstemmed	Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives
title_short	Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives
title_sort	comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5813927/ https://www.ncbi.nlm.nih.gov/pubmed/29447188 http://dx.doi.org/10.1371/journal.pone.0192360
work_keys_str_mv	AT gehrmannsebastian comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives AT dernoncourtfranck comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives AT liyeran comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives AT carlsonerict comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives AT wujoyt comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives AT weltjonathan comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives AT footejohn comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives AT moseleyedwardt comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives AT grantdavidw comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives AT tylerpatrickd comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives AT celileoa comparingdeeplearningandconceptextractionbasedmethodsforpatientphenotypingfromclinicalnarratives

Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives

Ejemplares similares